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GGTTTGGTAG CTATCCACTT GATGGTCAAA TTGAAAATGG GGCAAGGCCA TCAGTAAATT 4860 

CACACCCTGC TCAAAAAAGG TCAGAGGGAA AAAGAGGTGC CGACCTTGGT TTTGGAAAAA 4920 

GAGGTCTGGA ACCAGCCCTT CCTCCGTTAG TCCGTGCAAG AAAGTCAAAA GTTCTTGGCT 4980 

GGCATCATCA AAGGCTTCCC AAGAAAGAGA CTCCTCATAA ATCTTGCCAA TCATATACGA 5040 

CTTTCTCTGC TCGACAATCC TTAAAAAAAG TGGAATATCC CGAATGACAT AGTATTTTTG 5100 

GCTATTGATT TGGCCGATTC TCAGAGTCCA CAAGATATGA TTGCTTCCTG CTTCCACCTG 5160 

ACCCACAGCT GATAACTCAT AGGCGCATTC TGATTTTGGA GATAAAATTC GATCCAAAAA 5220 

CTTGCCACCC AAGGTCACCT TGGTTTCAAC AGCCTCTTTT TCTTCATGAC CTTCTTCCAG 5280 

ACTCCACAAG ATTTCCTGAC CACGCTCATC ATTTTTCAGA AAATGCTCTA GCGCTGCCAA 5340 

ATGCACACAG TAGCCCCTCT TTTGAAAAAA ATCACAGGCA CAAAAAACCA AATCATCCTC 5400 

TAAACTATAG CGCAGTTCTT CTTCTGCAAC GCGAGCGTAG AGCCGATTGT TCTTTTCCTT 5460 

GATGATATCA ACCTTACCAG TTTCATAAAG GGCAACACCT TCGATACGAA TTTTCCCCGG 5520 

AATCAATTTA GCCATATTTT CACCTTTACC TTATCTTTTT ATTATACCAT ATTTTCGCCT 5580 

ATGAAAATAG CCTTCTAGGA AGACTTTTCT CCTAGAAGCC TGGATTTTTA ACGTTTGGCA 5640 

AAAGTAGCCA CAATCCGCTG ACAGACTTCT TGCAACAGAG ATTTGGGCAT AGCTATATTG 5700 

ATGCGGGCAT GGAGACTTCC TTCCTCTCCA AAATCCAAAC CACGGTTGAG GATAACCTTG 5760 

GCTTCATTTC TCAACAACTC TTGCAATGTT TCATCAGTCA GCTCATAAGC TGAAAAGTCA 5820 

AGCCAAATCA AGTAGGTACC TTGCGGTTTC ATGACCTTGA TTTTAGTCTC TTTTCCAAAT 5880 

AGATCCATCA CATAATTGAT CTGGTCTTCA AAGACTTGCT TGAGTTCCTC TAGCCAATCT 5940 

TTACCGTATC GATAGGCAGC TTCTGTCGCC AAATAACCCA AGCCTGAAAT TTCATGCTGA 6000 

TTATTGGCCA ACAGGCGTTT CTGGAAAGCC AGTCTCAACT TAGGATTTTC AATGACTGCA 6060 

TAGGAATTTT TTCTTCCAGC AAT ATT AAAT GTTTTACTGG CACTGCTCAA GACGATAGCA 6120 

AAATTTTTGA AGGCAGGATT CATCGTATTG AAAGACTGGT GTTTGTGACC AAAGAGGGTC 6180 

AAATCTTGGT GAATCTCATC CGAAACTAAC AAAACACCGT GTTTTTGGCA GAGTTGGCCA 6240 

ATCTTCTCCA ACACTTCTTT TTCCCAAACA CGTCCACCAG GATTGTGAGG GTTGCAAAGA 6300 

ACATAGAGTT TAACCTCCTC TTCCACCAAA TCCTTTTCAA GTTGGTCAAA GTCAATCTCA 6360 

AACAGACTAT CCTTTTCCAC TAAGGAATTA GTAATCAATC TACGATTATT CAACTTGACA 6420 

CTGCGAGCAA ACGGTGGGTA GACAGGCGTG TTAATTAAAA CCGCCTCGCC TTCTTTTGTA 6480 

AAGGTTTGAA TAGCTGTTGA GATGGCTGGT ACCACACCCT CGATAAAGAC AAGAGCCTCT 6540 



WO 98/18931 



PCT/US97/19588 



528 

TTGTCAAAGT TGTAACCGTA TTGTGTAGCT TCCCACTTTT GAACTTCCTT AATTAAGTCT 6600 

TCACTGGCAT ACGTATAACC ATAAACCAGT TGGTCTGCGT AAGTTTGCAC GGCTTGGCGG 6660 

ATTTCAGGCA AGACCACAAA GTCCATATCC GCTATCCAAG CTGGTAGAAC TTCACTATCC 6720 

GTTTCTGTTT CTTTCCATTT ATAGGTATGG TCCCCTAAAC GGTTGGGCAG GCTTGTAAAA 6780 

TCATATTTTC CCATCTTTGT CTTATCCTTC TATGGCTTGG CGCAAATCTG CAATCAAATC 6840 

TCTAGCATCC TCAATCCCAA TAGACAAACG CAAGAGGTCA TCTGTCAAAC CATAAGAATG 6900 

GCGTACCTCT GCTGGAATAT CAGCATGAGT TTGAGTCGTT GGATAACTAA TAAGACTTTC 6960 

CACTCCACCC AAACTTTCCG CAAAAGAGAA GACCTTGAGA CTGTTCAAAA TATGAGGAAT 7020 

GCGTGTTTCA TCGGCTACTT TAAAGGAAAT CATGCCTCCA CGACCAGTGT AGAGAACTTC 7080 

CTTAACTGCT GGAGAATCCT TCAAAAAGGC AACCACTTCT TGGGCGTTAG CTGTTGAGCG 7140 

CTCCATACGA AGAGACAAGG TCTTGAGACC ACGAAGCAAC TGGTAGCTGT CAAATGGAGA 7200 

CAAGACTGCC CCTGTTGTAT TAAGATTGTA AAAAAGCTTC TCGTATAGTT CTAAACTATT 7260 

GGTCACAACC ACTCCAGCCA AGACATCATT GTGGCCTGCT AGATACTTGG TTGCTGAATG 7320 

GAGAACGATA TCTGCTCCAT CTTCAATCGG ACGTTGGTAG ATAGGGCTAT AGAAGGTATT 7 380 

CTCCACCACC ACTTTGGCAC CCTTAGCATG AGCCAATTTT GCTAGTTTTT CCATATCAAA 7440 

TTCCAACATC AAGGGATTGG TTGGGCTTTC CATATAGAGA ACATCCACAT CCTTTTCTAA 7500 

CTCGGCAATC AACTCTTCTT CTGTATTGGC ATAGGTAAAA TGGAAATGAC CTTCCTGCTC 7560 

CACTTGCTTA AACCAGCGAA AAGAACCACC GTAAAGATCA CGCACTGCCA AGACCTTACT 7620 

TCCTACTCGA AAGACCCTAA AGCCCAGTAC AATAGCTGAC ATCCCTGAGC TAGTCGCTAC 7680 

GGCATAGTCT GCTGACTCAA TAGCCCCCAA GACTTCCTCA GCCTTACTAC GAGTTGGATT 7740 

TTTAGTGCGC GTATAGTCAA AC CC AG? AG A TCGACCAAAC TCTGGATGCT GATAGGTCGT 7800 

TGAAAAATGA AGTGGTGTCA CCAAAGCACC TGTTGCCTCA TCAGACTTGA TCCCTGCTTG 7860 

TGCTAAAATT GTGTTAATGT GTAATTCCTT GCTCATACAA TTCCTCCAAA TCTATAGTAA 7 920 

CTATTGTACC ACTTATTTTG TATCCTTCGT TTTCTTGTTT TCAAGAGCTA GTTATAGTTT 7980 

CAAACTATAT AAAAAGGGAG TTTTTCCTGC TCCCTTTAAT AGACTATAAA ATGGTGAATC 8040 

TCAAAAGACA CCTTCACTCT ATCATTTGCT CCTGCACAAA ACGAGCATAA CGCTCATGAT 8100 

TTTCCAGTAG TTCCTTATGA GTTCCTGAGC CAGTGATTTT CCCCTCCTCT AAGAAGAAAA 8160 

TACAATCCAC ATCTTTTACC GTTGACAAAC GATGCGCTAT AATCACAACC GTCTTCTCCT 8220 

TTAGTACAGA ATAGAGGCTA CTGATAATCG CATACTCAGA ATCCCCATCA AGATTAGCAG 8280 

TGGCTTCATC AAATATAAGA ATTTCAGCAT CTTTTAAGTA GGCTCTACCT ATTTGAAGTC 8340 
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ACGGTTCTTC TCAATGATGC TGTTCCCCCA CGCGAAACAC TTGGAAGAAC TAAAGAACTA 10140 

TCAATCAACC AAGCTTGAAT ATCAGCATAT TTCTCATAAC GTTTGGCCGG ATCTTGCTCT 10200 

TTATTAGCTT CTTCCAACAT TTGAGTATAG ACATCCAGTC CAACTGCCTT AGCCTTGTCA 10260 

TTGGCCTCAC CAGGCTCTAG TCCAAGATTT TGCAGAAATC CTCCACTATT AGTATTAAAA 10320 

ATATCGAGAT AGGTTGACGG GTCTTGATAA TCAGCTCCCC AACCGCCATG ATATAAATCA 10380 

TAATCTTTCT GAGCAGCTGT TTGAGCAAAG TAGCCTGAAC TGTCAAACTC ATCTGATGTT 10440 

AATTGCTGAA TGTCAATCAC TACATTATCA GAACCTAAAA CAGATTCAAT TGATTGTTTG 10500 

ATAGAACTAA CTCCTTGTAT GCCTACTTTA TCTGTTACTT CCACAGTCTT ATCCAAGTGG 10560 

ATTGGGAATT GAACACCCTT TGCTTCGAGT TCTTTCTTAG CTTCCGCAAA CTTAGCCTTG 10620 

GCTTTCTCAG GATTGTAGTA AGGGTCTTGA CCATCCGCAA AGTTGATACC TTGCCATTCC 10680 

TTACCATAGT TGACCATCTT AGAGGCTACA ACTTCACCAA AGTCTTTTCC CTTGATACTG 10740 

ACAAACTTTG GAGGAACCAC TAGGTTACGC AAAATCTTTG TTGCACCTTC TTTCCCTTCA 10800 

GACTGAGCCC CATAAGATGT TCTGTCAAAA GCAAAATTGA TAGCCTGACG GAAGTTTTTA 10860 

TTGAGAACTG CTTCCTGAGT CGATTTCTTT TCAATGTCAC TTGTTTTAGA ACTATAATTG 10920 

TAAGACTTCC TATCTAGGTT AAAATTAAAG AAATATGAAG TTGAATTTTG CATACTATAG 10980 

ATGATATTGT TTTTGTATTT TTCTTTAATC CCTTCATAGC TCGAGCTGTT AGGAAAAAGA 11040 

CGAGCCGTAG TATAAGCACC AGCTGTAAAA TTACGTTCCA GTGATTCTTG GTCGCTACCA 11100 

TCATAGTAGG TCAATTTCAC ATCGTCTACA AAGACATTCT TAGCATCCCA GTAATTAGGG 11160 

TTTTTCTTAT ATTCAATAGC AGATTTTGAG ACAAGTGCTT TCATCAAGAA AGGTCCATTC 11220 

TACAAAATAC TAGATGGATC CGCCTTCCCA AAATCATCCC CTTTTGATTT CAGGAAATCT 11280 

GCATTAACAG GAAAAAGTAT CGTTGCAAGT GTTTTTCAAT TCCAGTAAAG TTCTGGTTTA 11340 

ACCAAAGTAT ATTGAACCGT TTGCTCATCA AGTGCCTTGA CACCGACAGT TGAAAAGTCG 11400 

CTTGTTTTAC CAGTGATATA GTCATCCAAA CCAGCAACAG AGTCCTGCAC TAGATACAAC 114 60 

GCTTCTGATT TTTTATCAGC TGCATATTGC AAACCTGTCA CAAAATCCTG GGCAGTTACA 11520 

GGCGCATATT CTTCTCCCTC AGAAGTAAAC CACTTGGCAT CCTTACGAAG TTTGTAGGTA 11580 

TAGGTCAAAC CGTCCTGAGA AACAGTCCAA TCCTCTGCTA ATGATGGAAT AATATTCCCA 11640 

TATTGGTCAT TTTCTAATAA CCCGTCTACC AAATTTGCAA CAATATCGGA TGTTGCTGCG 11700 

CGGTTTTCTG CTAGATAGTT CAAGCTAGAT GGATCACTTG AATAAACATA GTTGTAGGTT 11760 

TTTGACGCCG TGCTAGAATT TCCACACGCG CTCAATAAAA CTCCTGTACC CAGGACAAGA 11820 

CCTGCCAAGG TTAGATATTT CCTCTTAGAC TTTTTCATTT CCGG 11864 
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(2> INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

TAACTGCACT AAACATAATA TAAGGAGAGA AAATGTCTGC AATAGAACGT ATTACAAAAG 60 

CTGCTCACTT AATTGATATG AACGATATTA TCCGTGAAGG GAATCCTACT CTACGCGCGA 120 

TTGCTGAGGA ACTCACTTTC CCCCTATCTG ACCAGGAAAT CATCCTAGGC GAAAAGATGA 180 

TGCAATTCCT TAAACATTCC CAAGATCCTG TCATGGCTGA AAAAATGGGA CTCCGCGGTG 240 

GTGTTGGACT GGCTGCTCCC CAGTTAGATA TCTCAAAACC CATTATCGCT GTTTTGGTAC 300 

CTAATATTCT TGAAGAAGGC GAAACTCCAC AGGAAGCCTA CGATTTGGAA GCCATTATGT 360 

ACAATCCAAA AATCGTCTCT CACTCTGTTC AAGATGCTGC TCTTGGCGAA GGAGAAGGTT 4 20 

GCCTGTCTGT TGACCGTAAC GTGCCTGGCT ATCTTGTTCG CCATGCCCGC GTTACTGTTG 4 80 

ACTACTTTGA CAAAGATGGA GAAAAACACC GTATCAAACT CAAAGGCTAC AACTCCATTG 540 

TTGTTCAGCA TGAAATTGAC CACATTAACG GTATCATGTT TTACGATCGC ATCAATGAAA 600 

AAGACCCATT TGCAGTTAAA GATGGTTTAC TCATTCTTGA ATAAAGAAAA TCCCGTTGCA 660 

AGACGGGGTT TTGTGTTATA ATACAGGCAT CAAAACAAA7 GATATTCTCT ATGGTGTCCA 720 

CCCCCTTACC GAAGCCCTCC TTGCAAATAC AGGAAACAAA CTCTACCTCC AAGAAGATCT 780 

CCGACGTAAG AATGTTGAGA AAGTCAAGGA ACTAGCTACA GAAAAGAAGG TGTCCATTTC 840 

TTGGACATCA AAAAAATCTC TCTCTGAGAT TACTGAAGGT GCTGTTCATC AAGGTTTTGT 900 

TCTACGAGTG TCTGAATTTG CCTATAGCGA GCTAGATTAC ATCCTTGCAA AAACACGCCA 960 

AGAAGAAAAT CCACTTCTAT TGATTCTAGA TGGTCTAACC GATCCCCATA ATCTGGGTTC 1020 

TATCTTGCGA ACAGCCGATG CGACCAATGT TTCAGGTGTC ATCATTCCCA AGCACCGTAC 1080 

TGTCGGAGTA ACTCCTGTCG TTGCCAAAAC AGCCACAGGT CCTATTGAAC ACGTtCCAAT 1140 

TGCCCGAGTG ACCAACCTCA GTCAAACCTT AGGATAAACT TAAGGATGAA GGTTTCTGGA 1200 

CCTTTGGAAC GGATATGAAC GGTACTCCTT GCCACAAGTG GAATACAAAA GGGAAAATCG 1260 

CCCTCATCAT TGGAAATGAA GGAAAAGGTA TCTCTACCAA CATCAAAAAA CAGGTCGATG 1320 

AAATGATTAC CATTCCGATG AATGGACATG TTCAAAGCCT TAATGCCAGT GTTGCTGCGG 1380 
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CCATTCTCAT GTACGAAGTT TTCCGAAATA GACTATAAAA AAGTTTCCAG TCATCTGATT 144 0 

GGAAACTTTT TTATGATTAA CTATGTTCTG TAATGAATTT ATAGGCTTCT TGACCAGCGA 1500 

TAGCTCCATC TCCAACCGCT GTTGTTACTT GGCGAAGGTC TTTCAAGCGA ACATCTCCAA 1560 

CTGCAAAGAT ACCGTCGACT GCAGTTTTCA TGTGGTTATC TGTCACAATC CATCCTGCCT 1620 

GATCTTGGAT ATTCAATTCT TTAACAAAAT CGCTAAGAGG GTCCAAACCA ACATAGATAA 1680 

AGACACCACC GAAGGCTTGT TCTGTCACTT GACCTGTTTT CACATTTTCA AATACGACTG 1740 

ATTCTACTCG GTTTTCACCC TTGATTTCCC TTACTACAGA ATCCCAGATA AAGCTGATTT 1800 

TTTCATTCGC AAAGGCGCGA TCTTGTAAAA CCTTTTGGGC ACGAAGTTGG TCACGACGGT I860 

GAACAATGGT AACAGTCTTA GCAAAACGAG TCAAGAAGAG GCCTTCTTCA ACAGCTGAAT 1920 

CTCCACCACC AACTACCAAT AAATCTTGGT CACGGAAGAA AGCACCATCA CACACAGCAC 1980 

AGTAAGAAAC ACCACGACTG TTCAGTTCTT CTTCTCCAGG CACTCCCAAA GGACGGTGTT 2040 

TAGAACCAGT TGCTACGATA ACTGTACGTG TTTCATATCT TTGGTCATCA GTCATCACTT 2100 

TCTTAAAATC ACCATGGCTT CGACATTTTC AACATAACCA TAAATGTGCT CAACACCAAG 2160 

ATTTTCAAGT GGTTCAAACA TCTTTTCAGC CAATTCAGGT CCACTAATAT TAGCGTATCC 2220 

TGGGTAATTT TCGATATCAG ATGTATTATT CATCTGACCA CCTGGCAGAC CACCTTCAAT 2280 

CAAAGCTACT TTTAGATTGC TTCGAGCACC ATACAAGGCC GCAGTCATCC cTGCAGGTCC 2340 

AGCACCGATA ATAATAGTAT CGTACATATA GATTCCTTCT TTCTTGCTGT AACTATCTTT 2400 

2412 

ATTCTAACTC TG 

(2) INFORMATION FOR SEQ ID NO: 63: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CCGATTTGGT GGAATTTTTG TCTCATCATT TAGAAGGTGT TGCAAGACCA GAGTTTACCT 60 

TGGTGCTTCA TACCAAATTG GGAGAAGCCT CTGTTTTGGC AAATATTGTA CATGTAAACA 120 

AGGATGAATG GATTTTAGGA ACAGTTGCTG GTGCCAATAC CTTATTGGTT ATTTGTCGAG 180 

ATCAGCACGT TGCCAAACTC ATGGAAGATC GTTTGCTAGA TTTGATGAAA GATAAGTAAG 240 

GTCTTGGGAG TTGCTCTCAA GACTTATTTT TGAAAAGGAG AGACAGAAAA TGGCGATAGA 300 

AAAGTTATCA CCCGGCATGC AACAGTATGT GG AT ATT AAA AAGCAATATC CAGATGCTTT 360 
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TTTGCTCTTT CGGATGGGTG ATTTTTATGA ATTATTTTAT GAGGATGCGG TCAATGCTGC 420 

GCACATTCTG GAAATTTCCT TAACGAGTCG CAACAAGAAT GCCGACAATC CGATCCCTAT 4 80 

GGCGGGTGTT CCCTATCATT CTGCCCAACA GTATATCGAT GTCTTGATTG AGCAGGGTTA 540 

TAAGGTGGCT ATCGCAGAGC AGATGGAAGA TCCTAAACAA GCAGTTGGGG TTGTTAAACG 600 

AGAGGTTGTT CAGGTCATTA CGCCAGGGAC AGTGGTCGAT AGCAGTAAGC CGGACAGTCA 660 

GAATAATTTT TTGGTTTCCA TAGACCGCGA AGGCAATCAA TTTGGCCTAG CTTATATGGA 720 

TTTGGTGACG GGTGACTTTT ATGTGACAGG TCTTTTGGAT TTCACGCTGG TTTGTGGGGA 780 

AATCCGTAAC CTCAAGGCTC GAGAAGTGGT CTTGGGTTAT GACTTGTCTG AGGAAGAAGA 840 

ACAAATCCTC AGCCGCCAGA TGAATCTGGT ACTCTCTTAT GAAAAAGAAA CCTTTGAAGA 900 

CCTTCATTTA TTGGATTTGC GATTGGCAAC GGTGGAGCAA ACGGCATCTA GTAAGCTGCT 960 

CCAGTATGTT CATCGGACTC AGATGAGGGA ATTGAACCAC CTCAAACCTG TTATCCGCTA 1020 

CGAAATTAAG GATTTCTTGC AGATGGATTA TCCGACCAAG GCTAGTCTGG ATTTGGTTGA 1080 

GAATGCTCGC TCAGGTAAGA AACAAGGCAG TCTTTTCTGG CTTTTGGATG AAACCAAAAC 1140 

GGCTATGGGG ATGCGTCTCT TGCGTTCTTG GATTCATCGC CCCTTGATTG ATAAGGAACG 1200 

AATCGTCCAA CGTCAAGAAG TAGTGCAGGT CTTTCTCGAC CATTTCTTTG AGCGTAGTGA 1260 

CTTGACAGAC AGTCTCAAGG GTGTTTATGA CATTGAGCGC TTGGCTAGTC GTGTTTCTTT 1320 

TGGCAAAACC AATCCAAAGG ATCTCTTGCA GTTGGCGACT ACCTTGTCTA GTGTGCCACG 1380 

GATTCGTGCG ATTTTACAAG GGATGGAGCA ACCTACTCTA GCCTATCTCA TCGCACAACT 1440 

GGATGCAATC CCTGAGTTGG AGAGTTTGAT TAGCGCAGCG ATTCCTCCTG AAGCTCCTCA 1500 

TGTGATTACA GATGGGGGAA TTATCCGGAC TGGATTTGAT GAGACTTTAG ACAAGTATCC 1560 

TTGCGTTCTC AGAGAAGGGA CTAGCTGGAT TGCTGAGATT GAGGCTAAGG AGCGAGAAAA 1620 

CTCTGGTATC AGCACGCTCA AGATTGACTA CAATAAAAAG GATGCCTACT ATTTTCATGT 1680 

GACCAATTCG CAACTAGGAA ATCTGCCAGC TCACTTTTTC CGCAAGGCGA CGCTGAAAAA 1740 

CTCAGAACGC TTTGGAACCG AAGAATTAGC CCGTATCGAG GGAGATATGC TTGAGGCGCG 1800 

TGAGAAGTCA GCCAACCTCG AATACGAAAT ATTTATGCGC ATTCGTGAAG AGGTCGGCAA 1860 

GTACATCCAG CGTTTACAAG CTCTAGCCCA AGGAATTGCG ACGGTTGATG TCTTACAGAG 1920 

TCTGGCGGTT GTGGCTGAAA CCCAGCATTT GATTCGACCT GAGTTTGGTG ACGATTCACA 1980 

AATTGATATC CGGAAAGGGC GCCATGCTGT CGTTGAAAAG GTTATGGGGG CTCAGACCTA 2040 

TATTCCAAAT ACGATTCAGA TGGCAGAAGA TACCAGTATT CAACTGGTTA CAGGGCCAAA 2100 
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CATGAGTGGG AAGTCTACCT ATATGCGTCA GTTAGCCATG ACGGCGGTTA TGGCCCAGCT 2160 

GGGTTCCTAT GTTCCTGCTG AAAGCGCCCA TTTACCCATT TTTGATGCGA TTTTTACCCG 2220 

TATCGGAGCA GCAGATGACT TGGTTTCGGG TCAGTCAACC TTTATGGTGG AGATGATGGA 2280 

GGCCAATAAT GCCATTTCGC ATGCGACCAA GAACTCTCTC ATTCTCTTTG ATGAATTGGG 2340 

ACGTGGAACT GCAACTTATG ACGGGATGGC TCTTGCTCAG TCCATCATCG AATATATCCA 24 00 

TGAGCACATC GGAGCTAAGA CCCTCTTTGC GACCCACTAC CATGAGTTGA CTAGTCTGGA 24 60 

GTCTAGTTTA CAACACTTGG TCAATGTCCA CGTGGCAACT TTGGAGCAGG ATGGGCAGGT 2520 

CACCTTCCTT CACAAGATTG AACCGGGACC AGCTGATAAA TCtACGGTAT CCATGTTGCC 2580 

AAGATTGCTG GCTTGCCAGC AGACCTTTTA GCAAGGGCGG ATAAGATTTT GACTCAGCTA 2640 

GAGAATCAAG GAACAGAGAG TCCTCCTCCC ATGAGACAAA CTAGTGCTGT CACTGAACAG 27 00 

ATTTCACTCT TTGATAGGGC AGAAGAGCAT CCTATCCTAG CAGAATTAGC TAAACTGGAT 2760 

GTGTATAATA TGACACCTAT GCAGGTTATG AATGTCTTAG TAGAGTTAAA ACAGAAACTA 2820 

TAAAACCAAG ACTCACTAGT TAATCTAGCT GTATCAAGGA GACTTCTTTC ACAATTCTCC 2880 

ACTTTTTTGC TAGAATAACA TCACACAAAC AGAATGAAAA GGAGCTGACG CATTGTCCCT 2940 

CCCTTTTGTC TATTTTTTAA GGAGAAAGTA TCCTCATTCA GAAAATAAAA ACCTACAAGT 3000 

GGCAGGCCCT GGCTTCGCTC CTGATGACAG GCTTGATGGT TGCTAGTTCA CTTCTGCAAC 3060 

CGCGTTATCT GCAGGAAGTC TTAGGCGCCC TCCTTACTGG GAAATATGAA GCTATTTATA 3120 

GTATCGGGGC TTGGTTGATT GGTGTCGCCG TAGTCGGTCT AGTTGCTGGT GGACTCAATG 3180 

TTGTCCTCGC ACCCTATATT GCCCAAGGAG TTTCATCCGA CCTTCGGCAG GATCCCTTCC 3240 

GTAAAATTCA AACCTTTTCT TATGCTGATA TTGAACAATT TAATGCGGGA AATCTAGTCG 3 300 

TTCGAATGAC AAATGATATC AACCAGATTC AGAACGTTGT CATCATCACC TTCCAAATTC 3 3 60 

TTTTCAGACT TCCCCTCTTG TTCATCGGTT CGTTTATCCT AGCGGTTCAA ACCTTACCTT 3 4 20 

CTCTGTGGTG GGTGATTGTT CTCATGGTAG TCTTGATTTT TGGTTTGACT GCTGTCATGA 34 80 

TGGGAATGAT GGGGCCTCGT TTTGCCAAGT TTCAAACCCT TCTTGAGCGC ATCAATGCCA 3540 

TTGCCAAGCA AAATTTACGT GGCGTTCGTG TGGTCAAGTC CTTTGTCCAA GAAAAAGAGC 3600 

AATTTGCTAA GTTTACAGAG GTCTCAGACG AGCTTCTTGG TCAAAACCTT TACATTGGTT 3660 

ATGCCTTTTC AGTAGTGGAA CCCTTTATGA TGTTGGTTGG TTACGGGGCG GTCTTCCTCT 3720 

CTATTTGGCT GGTCGCGGGA ATGGTTCAGT CGGATCCGTC TGTTGTTGGT TCCATCGCTT 3780 

CTTTTGTTAA TTACCTAAGC CAGATTATCT TTACCATTGT TATGGTTGCA TTTTTGGGAA 3840 

ATTCTGTCAG CCGTGCCATG ATTTCCATGC GTCGTATTCG AGAAATTCTT GACGCAGAGC 3900 
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CAGCTATGAC CTTCAAGGAT ATCCCAGATG AAGAGTTGGT TGGAAGTCTT ACCTTTGAAA 3960 

ATGTGACCTT TACCTATCCA ATGGACAAGG AACCGATGCT CAAAGATGTG AGCTTTACTA 4020 

TTGAACCTGG TCAAATGGTT GGTGTAGTTG GAGCGACTGG TGCAGGAAAG TCAACCTTGG 4080 

CTCAATTGAT TCCACGTCTC TTTGATCCAC AGGACGGGGC CATTAAAATC GGTGGCAAGG 4140 

# 

ATATTCGAGA AGTGAGTGAA GGAACCCTGC GTAAAACAGT TTCCATCGTT CTCCAACGTG 4 200 

CCATTCTTTT TAGTGGAACG ATTCCAGATA ACTTGAGACA GGGGAAGGGG AATGCTACTC 4260 

TATTTGAAAT GGAGCGCGCA GCCAATATTG CCCAGGCTAG TGAATTCATT CATCGTATGG 4320 

AGAAAACCTT TGAAAGTCCA GTTGAAGAAC GGGGAACCAA TTTCTCTGGT GGACAAAAAC 4 380 

AAAGGATGTC GATTGCGCGT GGGATTGTCA GCAATCCACG TATTCTGATT TTTGATGATT 4440 

CGACCTCAGC CTTGGATGCC AAATCAGAGC GCTTGGTGCA AGAACCTTTG AATAAGGACT 4500 

TGAAGGGGAC GACAACCATT ATTATTGCTC AAAAAATTAG CTCGGTTGTC CATGCAGACA 4 560 

AGATCTTGGT TCTAAATCAA GGACGATTGA TTGGTCAAGG TACGCATGCA GACTTGCTTG 4620 

CCAACAATGC CGTTTACCGT GAAATCTATG AAACACAGAA ATGAAACACA AACTATAAGA 4680 

AAAGTCAATA CTTTTATCTA AACTATTTCT TATTTCAATT TGATGATTTG GCGATGATTT 4740 

TAGAGCACGG CAAAAAGCCC TTGAAAAAGT CCATTTTTTC AAAGGTAATC CTGTGTTAAT 4 800 

TTCAGAAATT ACATCACTTT TTGTTCGTCA AATGGCAGCT CTTTTTTTAG GATATAAAAC 4860 

ACGGTTCGGA TAAGTTTTTT TGCAAGGTGG ATGATGGCTA CATTGTAATG TTTTCCTTGT 4920 

TCTAATTTAG TCTTAAGATA GGCCTTAAAA GCAGGCCAAA AGCGAGGGCA TGCTTTGGCA 4980 

CCTTGTATGA GTACCTACCG CAGATGAGGG GAACTCCGTT TGACCATTCT TCCTGCTAAA 5040 

TCAATCTGAT CTGACTGATA AATAGAAGAA TCCAGTCCAG CGAAAGCTTG TAATTGAGCA 5100 

GGATTATCAA AGGCATGAAT ATTTCGAATC TCAGCTAAAA TGACCGCCCC TAAACGATCC 5160 

CCAATCCCAG TAACCGTCGT GATGACCGAG TTGAACTCAG CCATCAAGTC ATTGACACAT 5220 

GTTTCCGCCT TGTCAATGAG CCTCTTGTAA TGTTTGATGT TTTCATTACA CGAGATAAAA 52 80 

CGTCTATGCG TTATCAAACT CATTACCAAT TAAAACAAAA AGCTGTGCTT AGATCCTTTC 53 40 

GGAAATTGTC AAGCGATTGG AGGAAATGAA CTAATCCACA GCGGCTTATT CCAAGTATAC 5400 

CACTTCGGCT TTGGCAGTAG CTAACTGCGC TAAATATAAT ATAAGGAGGA GTAAAATGAA 5460 

CACAGTTCAA TTTTTTTCGC ATTATTTTAA GCTCTACAAG TTCTCATTTG TAGTTGTCAT 5520 

CCTGATGATT GTTCTGCCGA CTTTTGCCCA AGCCCTCTTT CCAGTCTTTT CTGGACAAGC 5 580 

GGTGACGCAG CTAGCCAATT TAGTTCAAGC TTATCAAAAT GGCAATCCAG AACTTGTATG 5640 
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CCAAAGCCTA 


TCAGGAATCA 


TGGTCAATCT 


TGGCCTGCTG 


GTTTTGGTTC 


TATTTATCTC 


5700 


TAGTGTAATA 


TACATGTGTC 


TCATGACGCG 


CGTGATTGCA 


GAATCGACCA 


ACGAGATGCG 


5760 


CAAAGGCCTC 


TTTGGTAAGC 


TTGCTCAGTT 


GACGGTTTCT 


TTCTTTGACC 


GTCGACAAGA 


5820 


TGGCGATATC 


CTGTCTCATT 


TTACCAGTGA 


TTTGGATAAT 


ATCCTCCAAG 


CCTTTAACGA 


5880 


AAGCTTGATT 


CACGTCATGA 


GCAATATTGT 


TTTATACATT 


GGTCTGATTC 


TTGTCATGTT 


5940 


TTCGAGAAAT 


GTGACGCTGG 


CTCTCATCAC 


CATTGCCAGC 


ACCCCATTGG 


CTTTCCTTAT 


6000 


GCTGATTTTC 


ATCGTGAAAA 


TGGCACGCAA 


ATACACCAAC 


CTCCAGCAGA 


AAGAGGTAGG 


6060 


GAAGCTCAAC 


GCCTATATGG 


ATGAGAGCAT 


CTCAGGCCAA 


AAAGCCGTGA 


TTGTGCAAGG 


6120 


AATTCAAGAG 


GATATGATGG 


CAGGATTTCT 


TGAACAAAAT 


GAGCGCGTGC 


GCAAGGCAAC 


6180 


CTTTAAAGGA 


AGAATGTTCT 


CAGGAATTCT 


TTTCCCTGTC 


ATGAATGGGA 


TGAGCCTGAT 


6240 


TAATACAGCC 


ATCGTCATCT 


TTGCTGGTTC 


GGCTCTACTT 


TTGAATGATA 


AGTCTATTGA 


6300 


AACAAGTACA 


GCCCTAGGTT 


TGATTGTTAT 


GTTTGCACAA 


TTTTCACACC 


AGTACTACCA 


6360 


GCCTATTATC 


CAAGTTGCAG 


CGAGTTGGGG 


AAGCCTTCAG 


TTGGCCTTTA 


CTGGAGCTGA 


6420 


ACGAATTCAG 


GAAATCTTTG 


ATGCAGAGGA 


GGAAATCCGA 


CCTGAAAAGG 


CTCCAACCTT 


6480 


C ACT AAGTTG 


CAAGAAAGTG 


TTGAAATCAG 


TCATATCCTT 


TTTTCATACT 


TGCCTGATAA 


6540 


ACCTATTTTG 


AAAGATCTCA 


GCATTTCTGC 


CCCTAAAGGC 


CAGATGACAG 


CAGTTGTTGG 


6600 


GCCGACAGGT 


TCAGGAAAAA 


CGACTATTAT 


GAACCTCATC 


AATCGCTTTT 


ATGATGTTGA 


6660 


TGCTGGTGGT 


ATTTATTTTG 


ATGGTAAAGA 


CATTCGTGGC 


TATGACTTAG 


ATAGTCTTAG 


6720 


AAGCAAGGTG 


GGAATTGTAT 


TGCAAGATTC 


GGTCTTGTTT 


AGCGGAACGA 


TTAGAGACAA 


6780 


TATCCGATTT 


GGTGTGCCAG 


ATGCTAGTCA 


GGAAATGGTT 


GAGGTAGCAG 


CAAAAGCAAC 


6840 


CCACATTCAC 


GACTATATCG 


AAAGTTTGCC 


TGATAAGTAC 


GATACTCTTA 


TTGATGATGA 


6900 


CCAGAGCATC 


TTTTCAACAG 


CGCAGAAGCA 


ATTGATTTCA 


ATCGCTCGAA 


CCCTGATGAC 


6960 


AGATCCAGAA 


GTTCTCATTC 


T CGATGAAGC 




Li 1 nun 1 nV_Uv? 




7020 


CAAGATTCAG 


CATGCCATGG 


AGGTGGTTGT 


AGCACGTAGA 


ACTAGTTTCG 


TCATTGCCCA 


7080 


CCGCTTGAAA 


ACCATTCTCA 


ATGCAGATCA 


GATTATTGTC 


CTTAAAGATG 


GAGAAGTCAT 


7140 


TGAACGTGGT 


AACCACCATG 


AACTTTTGAA 


GCTAGGTGGC 


TTTTATTCAG 


AACTCTATCA 


7200 


CAATCAATTT 


GTTTTCGAAT 


AAGAAAGAAG 


TTGTCCTATG 


TGGCCAGCTT 


TTTCTTGTCC 


7260 


ATAAAAAATG 


TTTATCACAG 


CCTTAAAAAA 


AACATATTAG 


ACGAAAGTCA 


TTTTGAGTGA 


7320 


TATGATAGGA 


CTATCGTTAG 


CATTCGAAAG 


GAGAGGCATC 


ATGGCTAGAA 


CGGTTGTAGG 


7380 


AGTTGCTGCA 


AATCTATGTC 


CCGTAGACGC 


AGAAGGCAAA 


ATCATTCATT 


CATCTGTATC 


7440 
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TTGTAGATTC GCAGAGATCA TTCGTCAAGT CGGTGGTCTC CCTTTAGTCA TTCCTGTTGG 7500 

TGATGAGTCA GTTGTACGTG ATTATGTGGA AATGATTGAC AAACTCATTT TGACAGGAGG 7560 

CCAAAATGTT CATCCTCAGT TTTATGGAGA GAAAAAGACC GTCGAGAGCG ATGATTACAA 7620 

TCTGGTCCGT GACGAATTTG AATTGGCACT CTTGAAGGAA GCGCTTCGTC AGAATAAACC 7680 

AATTATGGCA ATCTGTCGCG GTGTCCAACT TGTCAATGTT GCCTTTGGTG GAACCCTCAA 7740 

TCAAGAAATC GAAGGTCAGG 7760 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 272 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 64: 

GAGGTTTTAA TTCACTTACC TCTsCCGTAT CTTTATTTAA AATGAATTCT TTTACGGTTG 60 

TATTTCTTGC AAAATCTTTT ACAACAATCT TAATGTTTAG TGTCTTGTCT ATTATTTGTT 120 

TAATATCATT AAATGATCTA TATTCTTTTC CATTTATATA AATATGTTGT TCTTGAATCT 180 

CACCATCGAA TCCATTATTT CTTTTATCAT TGATGTTAAA GACTACAGAT TTTCCATCAG 240 

CATATTCGAT ACTAGTATTT CCCTTAGGAT CAATGTTTAC TTCGGGTTTA ACATTATCAT 300 

ATAAAAACTG ATAGTGGACT CCAACTCCTT TAGCATTCAA ATCGCTATAG CCAGTTTGAA 360 

GATAAACATT TCCATCCATA TCTGTTACCT TATCTCGAAA TCCGTTTGCT TTATAGTCTT 420 

TCATTCCCCA GTCCATGATG TCACCGTCTT TAACATTCAC CTTAATATTA AAATCTCTAG 430 

TGTTATCAAT GTGTAAATCT CCGTAGATTA AATAATTATC TACAACCGAT TCATTAACTC 540 

TCAATTCCCA CTTAAAACCA CCCTTATCAG AAATCTTACC TCTTAAATAA AATTCTGGAT 600 

TTCGTACATA AATTTTATTA GATTTAGATG GATTAAAGTA GTTCTTATCC ATTGAAAGGT 660 

TTACTGGTTT GGTATCAATA AATAACATGG AGCCATCTTC TTTTATACCT TCTACATTGA 720 

ACTTATCCTC TCCAGTGTAT TCTTTATCAT CCTTACCAAA TAATACAAGT TTAGAAGAAT 780 

CTGTCACAAG ATTTCCGTCT TTATCGATAG CTTCCCCTTT ATCCTTCATT TTAAATGTAA 840 

ACACTTGATA CCTTATAATG TTAAAGCCGT CCAAAGCCGA CATTAATACA GATTGGGTAC 900 

TTCTTCCATC TTCAACATTT CTACTATCAG CATAAATTGT TGTTTCTGAA AGGGCTCTTA 960 

GATTAGGATT GGCCTTTTGT ATTTTTGCTA TATCTTCCTT GCTATAGACT CCATTTCCTT 1020 
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CTAACATATC 


CGTTTTTCCA 


GGATTATAGG 


TAGTCACTTT 


TAGTGCATAG 


CCTTTTCTTA 


1080 


GAATGATATT 


ATCCTTTAAC 


AGATATTGTT 


GTTTTTCTGA 


ATCAGAATAG 


ATTTTACCAG 


1140 


ATTCCATTTT 


AGTTAAATTG 


TCTGGTTTGT 


TTTTTGAAAG 


ATCTCCTTCC 


CCTAATTCTA 


1200 


TCACATTCCC 


ATAACTTGAT 


ACATAGGGAT 


ATTCTGATTT 


AGTTTCCTTA 


ATTTTTTCAG 


1260 


GCATTCTAAT 


TTTAATTTCA 


GCTTTTTTCT 


GATCATTATC 


TTTAACAAAT 


AATCTCATAT 


1320 


CTCCTGCAAA 


ACCTAATCCA 


TCCACAATAT 


CATTAATATT 


AGCGTATAGA 


TCAAATGTCA 


1380 


TCGTTTTTGA 


GTGGAAATCA 


TACTTGGTCG 


CTTTCATTTC 


TATAGATTTA 


TAGTTATTCC 


1440 


CATAATATAC 


CTTGGCATTT 


TTAGAAACAT 


TACTTATCTT 


TCCAAGAATT 


TCAAAGTGTC 


1500 


CATCTTTAGA 


CGGACTTAGA 


ACACCATAAA 


TTTTTGATTT 


GATTTCGTCA 


AGTTTCTCAG 


1560 


TTTCATATTC 


TAGATCAGTC 


CCATCATCGT 


AGGCTATTAT 


ATTTCCTTTA 


TCATCGTATT 


1620 


TATAATCGTA 


TTCCTCCATT 


CTCTTACCAG 


TTTCACTTGT 


AAAATCATCA 


ACTTCTCTAA 


1680 


ATTTCTTTTT 


AATGAGTTTC 


TTTAAGTCTT 


TATTTTCAAA 


GTCTCTAATT 


GTTGAAATAT 


1740 


TTCTATCAAT 


AGTAAAACTA 


GATTTTTCTT 


TAATAGACTC 


TTCATTTTCT 


TGATGATGAT 


1800 


GTTCTACCCC 


AGTTGTATCT 


TTTTTTAGAC 


TACCCTCTTT 


TCCATTTCCT 


AAATTTTTAA 


1860 


ATTl'AGATTC 


TGCAATCTCG 


CCAAGCTTTT 


GATATTTAGA 


TGAATCTTCA 


TCAGGATCTA 


1920 


CTAGATAATA 


GCAAATCATC 


CCCTTTTCAT 


CAGCCTGATT 


AGCAAATTTA 


ATTCTATGAA 


1980 


TCTTTGTGAA 


ATTGCTAGAA 


CCATCTAATG 


CAATGACTTC 


AATGATTTTT 


CCCCTTAAAT 


2040 


CTCCCGCACC 


TTTAATTTCA 


TAAATGGTAT 


TTCCGTCTTT 


ATCAAGTTTT 


CTATTTCTTC 


2100 


CTTGACCCTC 


ACCTGCGTAA 


GTTACTTCAA 


GATTTTTTTC 


AACCTCTCCA 


TCTTCATTAA 


2160 


CAAGAGCGGC 


GCCAGCATAC 


CAAACTTCGT 


TCGCAATCTC 


GTCAAATTTT 


TCAGGATGTT 


2220 


CTTTTTGATC 


TCTCGCAAAT 


AGCGTTTCAT 


TCTTATACTG 


ATCTTTTACC 


TTATGATAAC 


2280 


TATCCTTTGT 


AATCAACTTA 


ATTTTTTCAG 


GATTTGAAAA 


ATCAACCGAA 


ACAATCTTAG 


2340 


GGGCGGTGTP 


ATCAATTTTT 


ACAGGAATAT 


AGGAAACCTC 


CCATGGGTAA 


TCTTTAGTTA 


2400 


ATCTATATTT 


AAATTTATAG 


AAATATTGAC 


CTTCCGCAAT 


CGGTTCAAAT 


TGACCTCTTA 


2460 


TCTTAGTAGC 


AGGATCTTGA 


TTATCCTTAC 


TTTCTGGTGC 


ATTTTCTTCT 


CTACCTCTAG 


2520 


GATTATAGAT 


GAGTCCATCC 


CACTTCAAGT 


CACCCCAAAC 


TTTTAGTTTA 


GATGATTTGA 


2580 


TTCCCTTTGC 


ATCATTGCTT 


TTAGAATTTA 


AAATTCCTCT 


AATAAACTGT 


TCTCTCGAAA 


2640 


TGACTTTTAA 


GTCTCTTTGA 


TTTTCTCCCT 


CTTTATTTGT 


ATTTACTATT 


GAAATCAATC 


2700 


CTTCTTCTGC 


ACTTCTTAAT 


ACA 








2723 


{2 J INFORMATION FOR SEQ ID NO: 65: 
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<i> SEQUENCE CHARACTERISTICS: 
, (A) LENGTH : 11831 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

AAAAAAGTGG GAATGACTCA AATCTTCACT GAAGCTGGCG AATTGATCCC TGTAACAGTT 60 

ATTGAAGCAA CTCCAAACGT TGTTCTTCAA GTTAAAACTG TTGAAACAGA CGGATACAAC 120 

GCTATCCAAG TTGGTTTCGA TGACAAACGC GAAGTATTGA GCAACAAACC TGCTAAAGCA 180 

CATGTAGCGA AAGCTAACAC GGCTCCTAAG CGCTTCATTC GTGAATTCAA AAACGTTGAA 240 

GGCTTGGAAG TTGGTGCTGA AATPACAGTT GAAACATTCG CAGCTGGAGA CGTTGTTGAC 300 

GTAACGGGTA CTTCTAAAGG TAAAGGTTTC CAAGGTGTTA TCAAACGCCA CGGACAATCA 360 

CCTGGACCAA TGGCTCACGG TTCTCGTTAC CACCGTCGTC CAGGTTCTAT GCGGCCTGTT 420 

GCACCTAACC GCGTATTCAA AGGTAAAAAC CTTGCAGGAC GTATGGGTGG CGACCGCGTA 480 

ACAATTCAAA ACCTTGAAGT TGTACAAGTT GTTCCAGAAA AGAACGTTAT CCPTATCAAA 540 

GGTAACGTAC CACCTGCTAA GAAATCTCTT ATCACTATCA AATCAGCAGT TAAAGCTGGT 600 

AAATAATAAA GAAAGGGGAA ATCAGTCACA ATGGCAAACG TAACATTATT TGACCAAACT 660 

GGTAAAGAAG CTGGCCAAGT TGTTCTTAGC GATGCAGTAT TTGGTATCGA ACCAAATGAA 720 

TCAGTTGTGT TTGATGTAAT CATCAGCCAA CGCGCAAGCC TTCGTCAACG AACACACGCT 780 

GTTAAAAACC GCTCTGCAGT ATCAGGTGGT GGACGCAAAC CATGGCGTCA AAAAGGAACT 840 

GGACGTGCTC CTCAACCT7C TATCCCCTCA CCACAATGGC CTGGTCCTGG TGTTGTCTTC 900 

GGACCAACTC CACGTTCATA CGGCTACAAA CTTCCACAAA AACTTCGTCG CCTAGCTCTT 960 

AAATCAGTTT ACTCTGAAAA AGTTGCTGAA AACAAATTCG TAGCTGTAGA CGCTCTTTCA 1020 

TTTACAGCTC CAAAAACTGC TGAATTTCCA AAAGTTCTTG CAGCATTCAG CATCGATTCT 1080 

AAAGTTCTTG TTATCCTTGA AGAAGGAAAT GAATTCGCAG CTCTTTCAGC TCGTAACCTT 1140 

CCAAACGTGA AAGTTGCAAC TGCTACAACT GCAAGTGTTC TTGACATCGC AAATAGCGAC 1200 

AAACTTCTTG TCACACAAGC AGCTATCTCT AAAATCGAGG AGGTTCTTGC ATAATGAATT 1260 

TGTATCATGT TATCAAAAAA CCTGTCATCA CTGAAAGCTC AATGGCTCAA CTTGAAGCAG 1320 

GAAAATATGT ATTTGAAGTT GACACTCGTG CACACAAACT TTTGATCAAG CAAGCTGTTG 1380 

AAGCTGCTTT CGAAGGTGTT AAAGTTGCCA ATGTTAACAC AATCAACGTA AAACCAAAAG 1440 
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CTAAACGTGT TGGACGTTAC ACTGGTTTTA CTAACAAAAC TAAAAAAGCT ATCATCACAC 1500 

TTACAGCTGA TTCTAAAGCA ATCGAGTTGT TTGCTGCTGA AGCTGAATAA TCTAAGGAGG X560 

AAATATCGTG GGAATTCGTG TTTATAAACC AACAACAAAC GGTCGCCGTA ATATGACTTC 1620 

TTTGGATTTC GCTGAAA7CA CAACAAGCAC TCCTGAAAAA TCATTGCTTG TTGCATTGAA 1680 

GAGCAAGGCT GGTCGTAACA ACAACGGTCG TATCACAGTT CGTCACCAAC GTGGTGGACA 1740 

CAAACGTTTC TACCGTTTGG TTGACTTCAA ACGTAATAAA GACAACGTTG AAGCAGTTGT 1800 

TAAAACAATC GAGTACGATC CAAACCGTTC TGCAAACATC GCTCTTGTAC ACTACACTGA 1860 

CCGTGTGAAA GCATACATCA TCCCTCCAAA AGGTCTTGAA GTACGTCAAC GTATCGTTTC 1920 

AGGTCCAGAA GCAGATATCA AAGTCGGAAA CGCTCTTCCA CTTGCTAACA TCCCAGTTGG 1980 

TACTTTGATT CACAACATCG AGTTGAAACC AGGTCGTGGT GGTGAATTGG TACGTGCTGC 2040 

TGGTGCATCT GCTCAAGTAT TGGGTTCTGA AGGTAAATAT GTTCTTGTTC GTCTTCAATC 2100 

AGGTGAAGTT CGTATGATTC TTGGAACTTG CCGTGCTACA GTTGGTGTTG TCGGAAACGA 2160 

ACAACATGGA CTTGTAAACC TTGGTAAAGC AGGACGTAGC CGTTGGAAAG GTATCCGCCC 2220 

AACAGTTCGT CGTTCTGTAA TGAACCCTAA CGATCACCCA CACCGTCGTG GTGAAGGTAA 2280 

AGCACCAGTT GGTCGTAAAG CACCATCTAC TCCATGGGGC AAACCTGCTC TTGGTCTTAA 2340 

AACTCGTAAC AAGAAAGCGA AATCTGACAA ACTTATCGTT CGTCGTCGCA ACGAGAAATA 2400 

ATATTAAACT AGTCGCTTAA GCAACTAGTA AATCCGCCAG CTCGGTACCG CTCCATAGGA 2460 

GTGCAAGCCG CTGTGGTACA ACATTTAAAG GAGAAAATAT AAAAATGGGA CGCAGTCTTA 2520 

AAAAAGGACC TTTCGTCGAT GAGCATTTGA TGAAAAAAGT TGAACCTCAA GCTAACGACG 2580 

AAAAGAAAAA AGTTATTAAA ACTTGGTCAC GTCCTTCAAC GATCTTCCCA ACTTTCATTG 2640 

GTTACACTAT TGCAGTTTAT GACGGACGTA AACACGTACC TGTTTACATC CAACAAGACA 2700 

TGGTAGGCCA CAAACTTGGT GAATTTGCAC CAACTCGTAC TTACAAAGGT CACGCTGCAC 27 60 

ACGACAAGAA AACACGTAGA AAATAAGGAG AACATAAATG GCAGAAATTA CTTCAGCTAA 2820 

AGCAATGGCT CGTACAGTAC GTGTTTCACC TCGTAAATCA CGTCTTGTTC TTGATAACAT 2880 

CCGTGGTAAA AGCGTAGCCG ATGCAATCGC AATCTTGACA TTCACTCCAA ACAAAGCTGC 2940 

TGAAATCATC TTGAAAGTTT TGAACTCAGC TGTAGCTAAC GCTGAAAACA ACTTTGGTTT 3000 

GGATAAAGCT AACTTGGTAG TATCTGAAGC ATTCGCAAAC GAAGGACCAA CTATGAAACG 3060 

TTTCCGTCCA CGTGCGAAAG GTTCAGCTTC ACCAATCAAC AAACCTACAG CTCACATCAC 3120 

TGTAGCTGTT GCAGAAAAAT AAGGAGGTAA AATCGTGGGT CAAAAAGTAC ATCCAATTGG 3180 

TATGCGTGTC GGCATCATCC GTGATTGGGA TGCCAAATGG TATGCTGAAA AAGAATACGC 3240 
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GGATTACCTT CATGAAGATC TTGCAATCCG TAAATTCGTT CAAAAAGAAC TTGCTGACCC 3300 

AGCAGTTTCA ACTATTGAAA TCGAACGCGC AGTAAACAAA GTTAACGTTT CACTTCACAC 33 60 

TGCTAAACCA GGTATGGTTA TCGGTAAAGG TGGTGCTAAC GTTGATGCaC TCCCTGCAAA 34 20 

ACTTAACAAA TTGACTGGAA AACAAGTACA CATCAACATC ATCGAAATCA AACAACCTGA 34 80 

TTTGGATGCT CACCTTGTAG GTGAAGGAAT TGCTCGTCAA TTGGAGCAAC GTGTTGCTTT 3540 

CCGTCGTGCA CAAAAACAAG CAATCCAACG TGCAATGCGT GCTGGAGCTA AAGGAATCAA 3600 

AACTCAAGTA TCAGGTCGTT TGAACGGTGC AGATATCGCC CGTGCTGAAG GATACTCTGA 3 660 

AGGAACTGTT CCGCTTCACA CACTTCGTGC AGATATCGAT TACGCTTGGG AAGAAGCAGA 3720 

TACTACATAC GGTAAACTTG GTGTTAAAGT ATGCATCTAC CGTGGTGAAC TTCTTCCAGC 3780 

TCGTAAAAAC ACTAAAGGAG GTAAATAACC AATGTTAGTA CCTAAACGTG TTAAACACCG 3840 

TCGTGAGTTC CGTGGAAAAA TGCCCGGTGA AGCAAAAGGT GGAAAAGAAG TAGCATTCGG 3900 

TGAATACGGT CTTCAAGCTA CAACTAGCCA CTGGATCACT AACCGCCAAA TCGAAGCTGC 3 960 

TCGTATCGCC ATGACTCGTT ACATGAAACG TGGTGGTAAA GTTTGGATTA AAATCTTCCC 4020 

ACACAAATCA TACACTGCTA AAGCTATCGG TGTGCGTATG GGATCTGGTA AAGCGGCACC 4080 

TGAAGGTTGG GTAGCACCAG TTAAACGTGG TAAAGTGATG TTCGAAATCG CTGGTGTATC 4140 

TGAAGAGATT GCACGTGAAG CCCTTCGACT TGCTAGCCAC AAATTGCCAG TTAAATGTAA 4200 

ATTCGTAAAA CGTGAAGCAG AATAAGGAGA AGGCATGAAA CTTAATGAAG TAAAAGAATT 4260 

TGTTAAAGAA CTTCGTGGTC TTTCTCAAGA AGAACTCGCG AAGCGCGAAA ACGAATTGAA 4 320 

AAAAGAATTG TTTGAACTTC GTTTCCAAGC TGCTACTGGT CAATTGGAAC AAACAGCTCG 4380 

CTTGAAAGAA GTTAAAAAAC AAATCGCTCG CATCAAAACA GTTCAATCTG AAGCGAAATA 4 440 

ATAGACTAGG GAAGGAGAAA TTTCAATGGA ACGCAATAAT CGTAAAGTTC TTGTTGGACG 4500 

TGTTGTATCT GACAAAATGG ACAAGACAAT CACAGTTGTA GTTGAAACAA AACGTAACCA 4560 

CCCACTCTAT GGTAAACGTA TTAACTACTC TAAAAAATAC AAAGCTCATG ATGAAAACAA 4620 

TGTTGCCAAA GAAGGCGATA TCGTACGTAT CATGGAAACT CGCCCGCTTT CAGCTACAAA 4680 

ACGTTTCCGT CTTGTAGAAG TTGTTGAAGA AGCGCTCATC ATCTAATCAA ACCTGAAAGG 4740 

AGAAAACTGA AATGATTCAA ACAGAAACTC GTTTGAAAGT CGCAGACAAC AGCGGTGCTC 4800 

GCGAAATCTT GACTATCAAA GTTCTTGGTG GTTCAGGACG TAAATTTGCA AACATCGGTG 4860 

ATGTTATCGT GGCATCTGTA AAACAAGCTA CTCCTGCTGG TGCCGTTAAA AAAGGTGACG 4920 

TTGTTAAAGC AGTTATCGTT CGTACTAAAT CAGGTGCTCG TCGTGCTGAT GGTTCATACA 4980 
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TCAAATTTGA CGAAAACGCA GCAGTTATCA TCCGTGAAGA CAAAACTCCT CGCGGAACAC 5040 

GTATCTTTGG CCCAGTTGCA CGTGAATTGC GTGAAGGTCG CTTCATGAAG ATCGTGTCAC 5100 

TTCCTCCAGA AGTACTTTAA TTTTTAGGAA CAAACTAGTC CCCTAGCTTC AAGCTAGGGT 5160 

GCCCTTATGG GCGTAAGAAA AATCAAGGAG AAACCTAATG TTTGTAAAAA AAGGCGACAA 5220 

AGTTCGCGTA ATCGCTGGTA AAGATAAGGG AACAGAAGCT GTTGTCCTTA CTGCCCTTCC 5280 

AAAAGTAAAC AAAGTTATCG TTGAAGGTGT TAACATTGTT AAGAAACACC AACGTCCAAC 534 0 

TAACGAGCTT CCTCAAGGTG GTATCATCGA GAAAGAAGCA GCTATCCACG TATCAAACGT 5400 

TCAAGTTTTG GACAAAAATG GTGTAGCTGG TCGTGTTGGA TACAAATTTG TAGACGGTAA 5460 

AAAAGTTCGC TACAACAAAA AATCAGGCGA AGTGCTTGAT TAATCACGAA GGAAAGGAGA S520 

AGTATAATCG CAAATCGTTT AAAAGAAAAA TATCTTAATG AACTAGTTCC TGCTTTGACA 5580 

GAACAATTCA ACTACTCATC AGTGATGGCT GTGCCTAAAG TAGATAAGAT TGTTTTGAAC 5640 

ATGGGTGTTG GTGAAGCTGT ATCAAACGCT AAAAGCCTTG AAAAAGCTGC TGAAGAATTG 5700 

GCACTTATCT CAGGTCAAAA ACCACTTATC ACTAAAGCTA AAAAATCAAT CCCCGGCTTC 5760 

CGTCTTCGTG AAGGTGTTGC GATCGGTGCA AAAGTTACCC TTCGTGGTGA ACGTATGTAC 5820 

GAATTCTTGG ATAAATTGGT ATCAGTTTCA CTTCCACGTG TACGTGACTT CCACGGTGTC 5880 

CCAACAAAAT CATTTGATGG ACGCGGGAAC TACACACTTG GTGTGAAAGA ACAATTAATC 5940 

TTCCCAGAAA TCAACTTCGA TGACGTTGAC AAAACTCGTG GTCTTGACAT CGTTATCGTA 6000 

ACAACTGCTA ACACTGACGA AGAGTCACGT GCATTGCTTA CAGGCCTTGG AATGCCTTTT 6060 

GCAAAATAAT ATAGGAGGTA AATCTAATGG CTAAAAAATC AATGGTACCT AGAGAGGCTA 6120 

AACGCCAAAA AATTGTTGAC CGTTATGCTG AAAAACGTGC TGCATTAAAC CCGGCAGGGG 6180 

ACTACGAAGG TTTATCTAAA TTACCTCGCA ACGCCTCACC GACTCGTTTA CATAATCGTT 6240 

GTAGGGTTAC GGGGCGCCCA CATTCACTTT ACCGCAAATT TGGTCTGAGT CCTATCGCTT 6300 

TTCGCGAACT TGCGCATAAA CGTCAAATTC CTGGTGTAAC AAAAGCATCT TGGTAATTTA 6 360 

AGATATCAAG AGCGTCAAAA CTCCAAGTAA AAATAGGAAA CTTGACGAAG AAACTAAAGT 6420 

TTCTAGGAAA GTTTATCTTT TTCACACAGA GTTTACCCCG GGTTCAATTG GGCTTGCCAA 6480 

TTTGAACACG AG CT ACAGCT TTGGCAAAAA AGACCAATTT GCTTTGGAGC ATTGCTTCTG 6540 

CATTAAATTG TCTATTTTTG CTCGTGCTGT TACGCTCTTT GTATCATGTA TTAACTAGCA 6600 

AGTGCAACTT GCAAACTACT AGTAAGAGGA GAAAAACAAA ATGGTTATGA CTGACCCAAT 6660 

CGCAGACTTC CTAACTCGTA TTCGTAATGC TAACCAAGCT AAACACGAAG TACTTGAAGT 672 0 

ACCTGCATCA AACATCAAAA AAGGGATTGC TGAAATCCTT AAACGCGAAG GTTTTGTAAA 6780 
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AAACGTTGAA ATCATTGAAG ATGACAAACA AGGCGTCATC CGTGTATTTC TTAAATACGG 6840 

ACCAAATGGT GAGAAAGTTA TCACTAACTT GAAACGTGTT TCTAAACCAG GACTTCGTGT 6900 

CTACAAAAAA CGTGAAGACC TTCCAAAAGT TCTTAACGGA CTTGGAATTG CCATCCTTTC 6960 

AACTTCTGAA GGTTTGCTTA CTGATAAAGA AGCACGCCAA AAGAATGTTG GTGGTGAGGT 7020 

TATCGCTTAC GTTTGGTAAA ATCAAGATAC AAAGCTCCTA AAGAACAAAG CAAAATTAGG 7080 

AAGTTGGAGA AGTTTGTTTA CAAACAAGCC AACTTATCTA TTTTGCACAG TTCTTAGACC 7140 

GTGTTCAGTT CAGCTCTTGA ACTAAATAAG TATCTGAACC CCGTGAAAAC TGGCCGTTCT 7200 

GCCCTGACAA TTTAACAGGA GAAAATAAAC ATCTCACCTA TTGGTAATAA AGTTATCGTG 72 60 

TTCCCTGCTC GTGTTGAACT CGCTAACAAT GACAACGTTG TAACTGTAAA AGGATCTAAA 7320 

GGAGAACTTA CTCGTGAGTT CTCAAAAGAT ATTGAAATCC GTGTGGAAGG TACTGAAATA 7380 

ACTCTTCACC GTCCAAACGA TTCAAAAGAA ATGAAAACTA TCCACGGAAC TACTCGTGCC 7440 

CTTTTGAACA ACATGGTTGT TGGTGTATCA GAAGGATTCA AGAAAGAACT TGAAATGCGT 7500 

GGGGTTGGTT ACCGTGCACA GCTTCAAGGA TCTAAACTTG TTTTGGCTGT TGGTAAATCT 7560 

CATCCAGACG AAGTTGAAGC TCCAGAAGGA ATTACTTTTG AACTTCCAAA CCCAACAACA 7620 

ATCGTTGTTA GCGGAATTTC AAAAGAAGTA GTTGGTCAAA CAGCTGCTTA CCTACGTAGC 7680 

CTTCGTTCAC CAGAACCATA TAAAGGTAAA GGTATCCGTT ACGTTGGTGA ATTCGTTCGC 7740 

CGTAAAGAAG GTAAAACAGG TAAATAATGT TGAGTGCTTG ATCATCAACC ACCAACCTAT 7 800 

TTTCCAACTT TGTGCATAGC ACACGATTTA AAACTAAAGA GGTGAAAACT GTGATTTCAA 7860 

AACCAGATAA AAACAAACTC CCCCAAAAAC GCCACCGTCG CCTTCGCCGA AAACTCTCTG 7920 

GAACTGCTGA TCGCCCACGT TTGAACGTAT TCCGTTCTAA TACAGGCATC TACGCTCAAG 7980 

TGATTGATGA CGTAGCGGGT GTAACGCTCG CAAGTGCTTC AACTCTTGAT AAAGAAGTTT 8040 

CAAAAGGAAC TAAAACTGAA CAAGCCGTTG CTGTCGGTAA ACTCGTTGCA GAACCTGCAA 8100 

ACGCTAAAGG TATTTCAGAA GTGGTGTTCG ACCGCGGTGG ATATCTATAT CACGGACGTG 8160 

TGAAAGCTTT GGCTGATGCA GCTCGTGAAA ACGGATTGAA ATTCTAATAG GAGGACACTA 8220 

GAAAATGGCA TTTAAAGACA ATGCAGTTGA ATTACAAGAA CGCGTACTTG CTGTCAACCG 8280 

TGTTACAAAA GTTGTTAAAG GTGGACGTCG TCTTCGTTTC GCAGCTCTTG TTGTTGTTGG 8340 

TGACCACAAT GGTCGCGTAG GATTTGGTAC TGGTAAAGCT CAAGAAGTTC CAGAAGCAAT 8400 

CCGTAAAGCA GTAGATGATG CTAAGAAAAA CTTGATCGAA GTTCCTATGG TTGGAACAAC 8460 

AATCCCACAC GAAGTTCTTT CAGAATTCGG TGGAGCTAAA GTATTGTTGA AACCTGCTGT 8520 
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AGAAGGTTCT 


GGAGTTGCCG 


CTGGTGGTGC 


AGTTCGTGCC 


GTTGTGGAAT 


TGGCAGGTGT 


8580 


GGCAGATATT 


ACATCTAAAT 


CACTTGGTTC 


TAACACTCCA 


ATCAACATTG 


TTCGTGCAAC 


8640 


TGTTGAAGGT 


TTGAAACAAT 


TGAAACGCGC 


TGAAGAAATT 


GCTGCCCTTC 


GTGGTATTTC 


8700 


AGTTTCTGAT 


TTGGCATAAG 


AAAGGGGATA 


AAATGGCTCA 


AATTAAAATT 


ACTTTGACTA 


8760 


AGTCTCCAAT 


CGGACGCATT 


CCATCACAAC 


GTAAAACTGT 


TGTAGCACTT 


GGACTTGGCA 


8820 


AATTGAACAG 


CTCTGTTATT 


AAAGAAGATA 


ACGCTGCTAT 


CCGTGGTATG 


ATCACAGCAG 


8880 


TATCTCACTT 


AGTAACAGTT 


GAAGAAGTAA 


ACTAATGAaG 


TTTTAGGGGA 


TGTGCACTGT 


3940 


ACCATCCCCT 


AAAACTACAT 


ATAGTCATCT 


ATGATGACAT 


CGTATAGGCG 


AGTTGATGGG 


9000 


GGAGACAACC 


TTTTCTCCCT 


TATCGGCGCT 


AGCATTTTAC 


AAAAG AGG AG 


AAAATAAAAA 


9060 


TGAAACTTCA 


TGAATTGAAA 


CCTGCAGAAG 


GTTCTCGTAA 


AGTACGTAAC 


CGCGTTGGTC 


9120 


GTGGTACTTC 


ATCAGGTAAC 


GGTAAAACAT 


CTGGTCGTGG 


TCAAAAAGGT 


CAAAAAGCTC 


9180 


GTAGCGGTGG 


CGGAGTTCGC 


CTTGGTTTTG 


AAGGTGGACA 


AACTCCATTG 


TTCCGTCGTC 


9240 


TTCCAAAACG 


TGGATTCACT 


AACATCAACG 


CTAAAGAATA 


CGCAATTGTG 


AACCTTGACC 


9300 


AATTGAACGT 


CTTTGAAGAT 


GGTGCTGAAG 


TAACTCCAGT 


TGTTCTTATC 


GAAGCAGGAA 


9360 


TTGTTAAAGC 


TGAAAAGTCA 


GGTATTAAAA 


TTCTTGGTAA 


CGGTGAGTTC 


ACTAAGAAAT 


9420 


TGACTGTGAA 


AGCAGCTAAA 


TTCTCTAAAT 


CAGCTGAAGA 


AGCTATCACT 


GCTAAAGGTG 


9480 


GTTCAGTAGA 


AGTCATCTAA 


GAGAGGTGAC 


CTATGTTTTT 


TAAATTATTA 


AGAGAAGCTC 


9540 


TTAAAGTCAA 


GCAGGTTCGA 


TCAAAAATTT 


TATTTACAAT 


TTTTATCGTT 


TTCGTCTTTC 


9600 


GTATCGGAAC 


TAGCATTACA 


GTTCCTGGTG 


TGAATCCCAA 


TAGCTTGAAT 


G CTTT AAGTC 


9660 


GATTATCCTT 


CTTAAACATG 


TTGAGCTTGG 


TGTCGGGGAA 


TGCCCTAAAA 


AACTTTTCGA 


9720 


TTTTTGCCCT 


AGGAGTTAGT 


CCCTATATCA 


CCGCTTCTAT 


TGTTGTCCAA 


CTCTTGCAAA 


9780 


TGGATATTTT 


ACCCAAGTTT 


GTAGAGTGGG 


GTAAACAAGG 


GGAAGTAGGT 


CGAAGAAAAT 


9840 


TGAATCAAGC 


I AL I 1 Lni 


ATTflfTCTAG 
nil \_> i w i 


TTCTCGCTTT 


TGTGCAATCT 


ATCGGGATTA 


9900 


CACCTGGTTT 


TAATACCTTG 


GCTGGAGCTC 


AATTGATTAA 


AACTGCTTTA 


ACTCCACAAG 


9960 


TTTTTCTGAC 


GATTGGTATC 


ATCTTAACAG 


CTGGTAGTAT 


GATTGTCACT 


TGGTTGGGTG 


10020 


AGCAAATTAC 


AGATAAGGGA 


TACGGAAACG 


GTGTTTCCAT 


GATTATCTTT 


GCCGGGATTG 


10080 


TTTCCTCAAT 


TCCAGAGATG 


ATTCAGGGCA 


TCTATGTGGA 


CTACTTTGTG 


AACGTCCCAA 


10140 


GTAGCCGTAT 


CACTTCATCT 


ATCATTTTCG 


TAATCATTTT 


GATTATTACT 


GTATTGTTGA 


10200 


TTATTTACTT 


TACAACTTAT 


GTTCAACAAG 


CAGAATACAA 


AATTCCAATC 


CAATATACTA 


10260 


AGGTTGCACA 


AGGTGCTCCA 


TCTAGCTCTT 


ACCTTCCGTT 


AAAAGTAAAC 


CCTGCTGGAG 


10320 
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TTATCCCTGT TATCTTTGCC AGTTCGATTA CTGCAGCcTG CGGCTATTCT TCAGTTTTTG 10380 

AGTGCCACAG GTCATGATTG GGCTTGGGTA AGGGTAGCAC AAGACATGTT GGCAACTACT 10440 

TCTCCAACTG GTATTGCCAT CTATGCTTTG TTGATTATTC TCTTTACATT CTTCTATACG 10500 

TTTGTACAGA TTAATCCTGA AAAAGCAGCA GAGAJcCCTAC AAAAGAGTCG TGCCTATATC 10560 

CATGGAGTTC GTCCTGGTAA AGGTACAGAA GAATATATGT CTAAACTTCT TCGTCGTCTT 10620 

GCAACTGTTG GTTCCCTCTT CCTTGGTGTG ATTTCCATTT TACCGATTGC ACCTAAAGAT 10680 

GTATTTGGTC TTTCTGATGT TGTTGCCTTT GGTGGAACAA GTCTCTTGAT CATTATCTCT 10740 

ACAGGTATCG AAGGAATCAA GCAATTGGAA GGTTACCTAT TGAAACGTAA GTATGTTGGT 10800 

TTCATGGACA GAACAGAATA AAAGTATTTA CTGAATCACT AAATACTGAG GGAGTGGAGG 10860 

TTTAAACTCT GACATTTGTA AGAGTTGCAT CTCCCCTCTT CTATTTTGTT TTTAAATCGG 10920 

GGTGAAAAGA CTTTTTGCTT CTATTTAAAA ATAAAATAAC GAGATCAAAT CATGAATCTT 10980 

TTGATTATGG GCTTACCTGG TGCAGGTAAG GGAACTCAAG CAGCAAAAAT CGTAGAACAA 11040 

TTCCATGTTG CACATATCTC AACAGGTGAT ATGTTCCGCG CTGCAATGGC AAATCAAACT 11100 

GAAATGGGTG TTCTTGCTAA GTCATATATT GACAAGGGTG AATTGGTTCC TGACGAAGTT 11160 

ACAAATGGAA TCGTAAAAGA ACGCCTTTCA CAAGATGATA TTAAAGAAAC AGGATTCTTA 11220 

TTGGATGGTT ACCCACGTAC AATTGAACAA GCTCATGCCT TGCACAAAAC ATTGGCTGAA 11280 

CTTGGCATTG AACTAGAAGG TGTTATCAAT ATTGAAGTGA ACCCTGACAG CCTTTTGGAA 11340 

CGTTTGAGTG GGCGTATCAT CCACCGCGTA ACTGGAGAAA CTTTCCACAA GGTCTTTAAC 11400 

CCACCAGTTG ACTATAAAGA AGAAGATTAC TACCAACGTG AAGATGATAA GCCTGAGACA 11460 

GTAAAACGTC GTTTGGATCT TAATATTGCT CAAGGAGAAC CAATCATTCC 7CACTACCGT 11520 

GCCAAAGGTT TGGTTCATGA CATCGAAGGT AATCAAGATA TCAATCATGT CTTCTCAGAT 11580 

ATTGAAAAAG TATTCACAAA TTTGAAATAA AGCCTTTTTC ACACTTGCAA AAATCCGCTA 11640 

CAAATGTTAT ACTGAGATAG TCTGACTTAT AATTGTTGTC TCTGTCTCTA GAGGCATCGA 11700 

ATCGAAATTT ATGGAGGTGC TTTTGCGTGG CAAAAGACGA TGTGATTGAA GTTGAAGGCA 117 60 

AAGTAGTTGA TACAATGCCG AATCCAATGT TTACGGTTGA ACTTGAAAAT GGACATCAGA 11820 

TTTTAGCAGG G 11831 
(2) INFORMATION FOR SEQ ID NO: 66: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CCCGGCATTT GAAAGCTATT CGTGAAGGAT TTATGATGGC AATGCCTTTG ATTTTAGTCG 60 

GCTCTTTATT TCTTATTCTA ATCAGTTGGC CTCAAGAGGC TTTTACAAAT TGGCTGAATA 120 

GTGTTGGATT GCTAAGTATC TTGACAACTA TGAATCAGTC AACAGTAGCG ATTATCTCCT 180 

TGGTCGCTTG TTTCGGTATT GCCTACAGGT TGTCGGAAGG ATATGGTACA GATGGTCCGT 24 0 

CGGCAGGGAT CATAGCCTTA TCCAGTTTTG TATTGATGGC ACCTCGTTTT TCGAGTATGG 300 

TTTATGATAA AAATGGGGAG CAGGTCAAGC AGTTATTTGG CGGCGCAATA CCATTTTCTA 360 

GCCTGAATGC ATCTTCTTTG TTTATGGCCA TTACTATTGG ATTGGTTACA GCAGAGATTT 420 

ATCGTATGTT TATCCAGCGC GGAATTACGA TAAAAATGCC AAGTGGTGTC CCAGATGTAG 4 80 

TAAGTAAATC ATTTTCAGCT CTTTTATCTG GTTTTACTAC TTTTGTTTTG TGCGCTTTGG 540 

TCTTAAAAGG TCTTGAAGCG GCAGGAGTTG CAGGAGGTCT CAACGGACTC CTAGGTGCAA 600 

TTGTTGGAAC ACCGCTTAAG TTAATTGCAG GAACGCTTCC AGGTATGATT CTATGTGTTA 660 

TTGTAAACTC ATTCTTTTGG TTCTGTGGAG TTAATGGGGG ACAAGTTTTA AATGCTTTTG 720 

TAGACCCAGT TTGGTTACAA TTTACTACAG AAAACCAAGA AGCTCTGCCT GCAGGACAAA 780 

CACTCCAACA CATTATTACA TTACCGTTTA AAGATTTATT TGTATTTATT GGTGGCGGTG 840 

GAGCGACTAT TGGTCTTGCG ATTTGTCTCT TCCTATTTAG TAAGAGTCGT GCGAATAAAA 900 

CATTAGGTAA GCTAGCTATT ATACCGTCTA TTTTTAATAT CAATACAGCT ATTCTATTTA 960 

CGTTTCCAAC AGTTTTAAAT CCGATTATGC TGATTCCGTT TATTGCTACT CCTACAATCA 102 3 

ATGCCTTGAT TACCTATGTA TCAATCCCTG TAGGATTAGT ACCCTATACA ACAGGTGTAA 1080 

TCCTTCCGTG GACAATGCCA CCGATTATAG GAGGCTTCCT TGCAACAGGG GCTAGTTGGC 114 0 

GAGGAGCTCT ATTACAAGTT GTTTTGATTT TGGTTTCTGT AGCAATTTAT TATCCATTCT 1200 

TCAAAATTGC AGATAAACGC AATCTTCAAA AAGAAAAAGC TACTGTTGGA GGGAAATAAG 1260 

ATGGTTATCA GAGTATTTGA TCAACAGAAA AATACTTATT CTAGCTTTGC CTTAGAGGAA 1320 

TTAAGTTACT ATATGAATCG GGTCTTTAAG ACTAACATAG AGCTTGTCGA GGAGAAGGAA 1380 

GCGGATATTT TTGTAGGATT AGTCAATAAA GAGGACAGAA AAGACCATGT TCTTATCTCA 1440 

TTAGACAAGG GTAAGGGGAG AATTGAGTCT AATACAATTG TAGGTTTACT TATTGGAATT 1500 

TACCGAATGT TTCATGAATT TGGGGTTGTG TATACTAGAC CAGGGCGCAG ACATGACTTT 1560 

GTTCCAGACT TACGATTTGA AGATTTTTTA GATAAACAGC TATCTATACA TGAAACAGCC 1620 
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AGTTACTATC 


ATAGGGGAGT 


ATGTATAGAG 


GGAGCGGATT 


CATTTGAAAA 


TATACTAGAT 


1680 


TTCATTGATT 


GCCTACCTAA 


GATTGGGATG 


AACAGTTTTT 


TCATCCAGTT 


TGAAAATCCT 


1740 


TACTCTTTTT 


TGAAACGTTG 


GTATGAACAT 


GAATTTAATC 


CATATCTAAA 


TAAAGAACAA 


1800 


TTTTCAAATG 


AATTAGTACA 


AGAATTGAGT 


GATAGGTTGG 


ATAAAGAATT 


GCAAAAAAGA 


1860 


GGTCTTATTC 


ATCATCGTGT 


TGGTCATGGA 


TGGACAGGTG 


AAGTTTTAGG 


TTACTCTTCA 


1920 


AAATTTGGCT 


GGGAATCAGG 


TCTTAGTATT 


TCAGAGGAGA 


AGAAACCCTA 


TGTCGCTGAA 


1980 


ATAAACGGGA 


AACGAGAATT 


GTTTAATACG 


GCTCCGATTT 


TAACCAGCCT 


GGATTTTTCA 


2040 


AATCCAGATG 


TAGCTGATAA 


GATGGTAGAA 


ATTATCAAGG 


ATTATGCCAA 


GAAAAGACCT 


2100 


GATGTTAACT 


ACTTACATGT 


ATGGTTGTCG 


GATGCTCGTA 


ATAATATTTG 


TGAATGCGAA 


2160 


AACTGTAGAC 


AAGAATTCGT 


TTCGGATCAG 


TATATTCGTA 


TTCTCAATCA 


ATTGGATAGG 


2220 


GCTTTAACGA 


GTGAGGGATT 


AGATACAAAG 


ATTTGTTTTC 


TGCTTTATCA 


TGAGTTGTTA 


2280 


TGGCCACCTC 


AGAAAGAAAA 


ATTAGATAAT 


CCTGAACCCT 


TTACCATGAT 


GTTTGCACCG 


2340 


ATTACAAGAA 


CATTTGAAAT 


GAGTTATGCA 


CATGTACATT 


TTGACAATTC 


CATACCTACG 


2400 


CCTAAACCTT 


ATATGCCTAA 


TAAAATTATA 


CTTCCGAATT 


CTCTTGAGCA 


AAATTTATCT 


2460 


TATCTTTTTG 


AGTGGCAAAA 


AGCATTTAAA 


GGAGATAGTT 


TCGTATATGA 


CTATCCTTTA 


2520 


GGGCGTGCTC 


ATTATGGCGA 


TTTAGGCTAT 


ATGAAAATTA 


GTCAAACTAT 


TTACAGAGAT 


2580 


GTATCTTATC 


TTTCCAACCT 


ACATTTGAAC 


GGGTACATTT 


CGTGTCAAGA 


ATTACGTGCC 


2640 


GGATTCCCTC 


ATAATTTTCC 


TAATTATGTC 


ATGGGGGAAA 


TGCTCTGGAA 


GAAGACAAGA 


2700 


AGTTATGAAG 


AATTGATTGA 


AGAATACTTT 


TCTGCTTTGT 


ATGGGGAAAA 


TTGGCAGTCT 


2760 


GTTGTTC AAT 


ATTTAGAAAA 


ATTATCCAT? 


^^^^^^ ^^^^^fl 


GTGATTATTT 


TAATGCAATT 


2820 


GGCAGCCCTC 


AAAGTGATGT 


TTTAGCGAAT 


CATTATTATA 


TAGCTTACAA 


TCTACCTGAT 


2880 


AATTTTTTAC 


CAATTATTGA 


GGAAAATATT 


TCTAAGTTAT 


TAAATAGTCA 


AAAGGATGAA 


2940 


TGGAAACAGC 


TCAGTTATCA 


TCGTGAATAT 


GTTGTTAAGA 


TCGCGAAGGC 


TTTATATCTT 


3000 


CAAGCAACTG 


GAAAAACAAG 


GCAAGCTCAA 


GATGAATGGA 


GAAATGTGTT 


GAATTATATC 


3060 


CGTGGGCACG 


AATTGCTATT 


TCAATCTAAT 


TTGGATGTTT 


ATCGTGTAAT 


TGAAGTAGCA 


3120 


AAAAATTACG 


CTGGTTTCCA 


CTTATAAATC 


ATAAGTATAG 


AAAATGAACT 


AAGGTATTCA 


3180 


GAGAAGATTG 


ATCCTAAATA 


TTATGAAATT 


TAACGATTTT 


TAAGATATTT 


AGGGTCAACT 


3240 


TTCTATTTAT 


ATCGTAGCGA 


AGTCATTTTA 


ATAATGATGT 


GTAAAAGATG 


GATCAAGATT 


3300 


GAGGAGGAAG 


AAAGATGAAA 


TCAAAAGAAG 


AAATAAATAT 


GCTTGGTTTT 


ACAATTGTCG 


3360 
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CTTACGCAGG 


AGATGCAAGG 


TCAGATTTGA 


TGGATGCTTT 


GGCGTTTGCG 


AGAGATGGAT 


3420 


ATTTTGAACA 


GGCAAGAGAA 


TTGGTTGAGT 


CTGCAAACGA 


CTCAATAGTG 


TCTGCCCATC 


3480 


GAGAACAGAC 


TAATTTATTA 


GCGGAGGAGG 


CATATGGAGA 


TAATTTTGAA 


GTGAGCTTTA 


3S40 


TTATGATTCA 


TGGTCAAGAT 


ACTTTGATGA 


CAACGATGCT 


ATTGTATGAT 


CAGGTAAAGT 


3600 


TTTTTATTGA 


TGAATATGAA 


CGAATTCGAA 


AGATTGAAGA 


ACRTATTGCT 


TTGCAATGAG 


3660 


GATTAGTCAT 


GGAAAATTTA 


CAGGTTAAAG 


CCTTACCGAA 


GGAGTTTTTA 


TTAGGAACTG 


3720 


CTACCGCTGC 


TTATCAAGTA 


GAGGGTGCAA 


CTAGGGTAGA 


TGGCAAAGGA 


ATAAATATGT 


3780 


GGGATGTTTA 


TTTGCAAGAA 


AATAGTCCGT 


TCTTACCAGA 


TCCAGCTAGT 


GATTTTTATT 


3840 


ATCGTTACGA 


AGAGGATATA 


GCTTTGGCGG 


CAGAACATGG 


TTTGCAGGCT 


TTGCGTTTAT 


3900 


CTATTTCTTG 


GGTTCGTATA 


TTTCCTGATA 


TAGATGGGGA 


TGCTAATGTA 


TTAGCTGTTC 


3960 


ATTATTACCA 


TAGAGTTTTT 


CAGTCTTGCT 


TAAAACATAA 


TGTGATTCCG 


TTTGTTTCTT 


4020 


TACATCATTT 


TGATTCGCCT 


CAGAAAATGT 


TACAAACAGG 


GGATTGGTTG 


AACAGAGAGA 


4080 


ATATTGATCG 


TTTCATACGA 


TATGCTCGCT 


TTTGTTTCCA 


AGAATTTACA 


GAAGTCAAGC 


4X40 


ATTGGTTTAC 


AATCAATGAA 


CTCATCTCTC 


TTGCTGCAGG 


TCAATATATA 


GGAGGTCAGT 


4200 


TTCCTCCAAA 


TCATCATTTT 


CAATTATCTG 


AAGCAATTCA 


AGCGAATCAT 


AATATGTTGT 


4260 


TGGCGCATGC 


TCTTGCACTC 


CTCGAATTTC 


ATCAATTACG 


GATTGAGGGA 


AAGGTAGGTT 


4320 


GTATTCATGC 


TTTAAAGCCA 


GGCTATCCTA 


TTGATGGGCA 


AAAAGAAAAT 


ATTTTGGCAG 


4380 


CTAAACGGTA 


TGATGTTTAT 


AATAATAAAT 


TTCTATTAGA 


TGGAACTTTT 


TTGGGCTACT 


4440 


ACAGTGAGGA 


CACGCTTTTT 


CACTTGAATC 


AAATATTGGA 


AGCTAATAAT 


TCTAGCTTTA 


4500 


TTATTGAAGA 


TGGTGATTTA 


GAAATTATGA AGAGACCTGC 


ACCTCTTAAT 


ACGATGTTTG 


4560 


CCATCAATTA 


TTATCG77CA 


GAATTTATTC 


GTGAATACAA 


AGGTGAAAAT 


ACACAAGAAT 


4620 


TTAATTCAAC 


AGGAATAAAA 


GGACAGTCTT 


CTTTTAAATT 


AAATGCTCTA 


GGTGAATTTG 


4680 


TAAAAAAAPC* 


TGGTATTCCG 


ACAACAGATT 


GGGATTGGAA 


TATTTATCCT 


CAAGGGTTAT 


4740 


TTGATATGTT 


GCTTCGTATC 


AAAGAAGAAT 


ATCCTCAACA 


TCCGGTCATT 


TATTTAACTG 


4800 


AAAATGGTAC 


AGCCCTTAAA 


GAAGTTAAGC 


CAGAGGGCGA 


GAATGATATT 


ATTGATGACA 


4860 


GTAAGAGAAT 


CCGTTATATT 


GAGCAACATT 


TACACAAAGT 


TTTAGAGGCT 


CGAGATAGAG 


4920 


GAGTCAATAT 


TCAAGGCTAT 


TTTATATGGT 


CTTTGCAAGA 


TCAATTTTCT 


TGGGCGAATG 


4980 


GCTACAATAA 

t 


GCGATATGGT 


CTTTTCTTTG 


TTGATTATGA 


AACACAGAAG 


AGATATATTA 


5040 


AGAAAAGTGC 


TCTTTGGGTA 


AAAGGGCTAA 


AACGGAATTA 


AGGTTAGCGA 


TTTGACTGAT 


5100 


GTTTAATATG 


TTTTAAATAT 


GAGGTTGAAT 


TTTTTATAGG 


AGGAGTTTTA 


TGGATAAGCT 


5160 
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AGTCCCTGCC 


ATTGAAAAGC 


AACAAGGGAA 


ATTTGAAAAA 


ATTTCTACTA 


ATAACTATAT 


5220 


GATGGCTATT 


AAAGATGGAT 


TCATTGCTAC 


TATGCCTTTA 


ATTATGTTTT 


CAAGCTTTTT 


5280 


GATGATTATT 


ATTATGATTC 


CTAAAAATTT 


CGGAGTAGAG 


TTACCGAGTC 


CAGCTATTGT 


5340 


CTGGATGAGA 


AAAGTGTATA 


TGTTAACCAT 


GGGAGTTTTG 


GGTATTATTG 


TTTCAGGGAC 


5400 


TGTTGGAAAG 


TCATTAGTTG 


GAAATGTTAA 


CAGAAAAATG 


CCTCACGGAA 


AGGTAATAAA 


5460 


TGATATTTCT 


GCAATGTTGG 


CAGCCATATG 


TAGTTATCTG 


GTATTAACTG 


TAACGCTTGT 


5520 


AGTTGATGAG 


AAGACGGGAT 


CTACAAGTTT 


GTCGACAAAC 


TATTTAGGAT 


CTCAAGGATT 


5580 


GATAACTTCG 


TTTGTCAGTG 


CCTTTATTAC 


TGTAAATGTT 


TACCGATTCT 


GTATTAAGCG 


5640 


AGACATTACT 


ATTCATTTAC 


CTAAGGAAGT 


TCCTGGGGCT 


ATATCACAAG 


CTTTTAGAGA 


5700 


TATTTTCCCT 


TTTTCTTTTG 


TTTTACTTAT 


TAGTGGTTTG 


TTAGATATTG 


TATCTCGGTT 


5760 


TAGTTTAGAT 


GTTCCTTTTG 


CCCAAGTATT 


TCAACAACTA 


TTGACTCCTA 


TTTTTAAGGG 


582Q 


GGCAGAATCA 


TATCCTGCTA 


TGATGTTGAT 


TTGGTTTATG 


TGTGCTTTGC 


TTTGGTTTGT 


5860 


TGGAATTCAT 


GGACCATCTA 


TTGTCTTACC 


TGCTGTTACA 


GCTTTGCAAC 


TGAGCAATAT 


5940 


GGAAGAGAAT 


GCTCAACTTC 


TTGCAAATGG 


GCAGTTCCCT 


TATCATTCTT 


TAACACCTAA 


600O 


TTTCGGGAAT 


TATATCGCTG 


CTATTGGACG 


AACGGGGGCT 


ACCTTTGTTG 


TACCATTTAT 


6060 


TTTGATTTTC 


TTTATGCGGT 


CTAAACAATT 


AAAATCGGTA 


GGTAAAGCTA 


CAATTACTCC 


6120 


TGTTTTATTT 


CCGGTAAATG 


AACCTCTTCT 


ATTTGGTATG 


CCTGTTATTT 


TGAATCCCTA 


6180 


TCTTTTTGTC 


CCTTTTTTGA 


TGACTCCACC 


AGTGAATGTA 


TTTCTAGGAA 


AGGTCTTTAT 


6240 


TGATTTCTTT 


GGAATGAATG 


GATTTTATAT 


CCAGTTACCT 


TGGACCTTTC 


CTGGTCCCTT 


6300 


CGGATTCTTA 


ATTGGAACGA 


ATTTTCAACT 


TATCTCCTTT 


GTATTTTTA? 


CTTTGATTTT 


6360 


AGTTGTCGAC 


ATATTGATTT 


ATTTGCCATT 


CTGTAGAGCG 


TATGATAGAC 


AGTTACTGGT 


6420 


GAAAGAAGAT 


ATTGCAAGCT 


CAAATGATAT 


TATTTTAGAG 


GAGGATACAA 


GTGAAATAAT 


6480 


TCCTGGTGAG 


ATAGATGAAA 


TAAAAAGTAA 


GGAGTTGAAA 


GTACTGGTTC 


TTTGTGCAGG 


6540 


GTCTGGAACA 


AGTGCGCAAT 


TAGCCAATGC 


AATTAACGAG 


GGGGCTAACT 


TAACAGAGCT 


6600 


TAGAGTGATT 


GCGAATTCAG 


GAGCGTACGG 


AGCTCATTAT 


GATATTATGG 


GTGTTTATGA 


6660 


TTTAATTATT 


CTGGCCCCAC 


AAGTTCGGAG 


TTATTATAGA 


GAGATGAAGG 


TGGATGCAGA 


6720 


AAGATTAGGT 


ATTCAGATAG 


TTGCTACCAG 


AGGAATGGAA 


TATATTCATT 


TAACAAAGAG 


6780 


TCCAAGTAAA 


GCCTTACAAT 


TTGTATTGGA 


GCATTACCAA 


GCTGTGTAGT 


AAGTTTTTCC 


6840 


ATCTTTTATT 


TGAGTAAAGA 


TTTTGTTTAC 


AGATAGGCTT 


GGATTTAAAA 


ACGTTCCCCC 


6900 
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TTTTTTAATA TAAGAATCCC TCTTTCACAA TTGTAAAAAG ACGGATTTTG TATTTTATCT 6960 

CTTAGACCAA GTTCTCTTCA TAAAGAGAAG GAGGATTGGG TAAATCTCCA AGCGCCCTGC 7020 

AATCATTGCA AAGGATAGGA GAATTTTTGA GATGCGACTA AAGATTGAGA AACTAGAAGT 7080 

GGTTCCTAGA ATAGGCCCGA TATTATTGAA ACAGCTAAAG ACAGCGCTGG TCACGACCAG 7140 

AAAATCATTG CTATCTAGGC TGACAATAAA GATAACCGCT AGCAAAATCA TAGCATAGAT 7200 

GACAAAGTAC TTGAGAATCT TATGCTGGGT ATCTTTGTCA ATCACCGTTT TATTAACATC 7260 

GAGGGTCAAA ACACGGTGGG GCGATAGGAT TGACAAAATT TGGTTTTTGG CAATTTTTGA 7320 

AAGGATGAGG CCTCGAATAA TCTTGAGTCC ACCTGCAGTT GATCCAGCAG AGCCACCGAT 7 380 

TGCCATGAGG AAAAGGAGGA TAAACTGGGA GAAGAGGGGC CAGTTGGTAA TATCTCCATA 7440 

TCCAAAACCA GTTGTTGTAA TGATGTTGGA AACCTGGAAG AAGGTCATTT CAAAGCTCTT 7500 

TGAAAACCCT GGGTAGAGGT AGAGGGTGTT GAGGCTAATC AAGCCTGTAG AAACCAGTAC 7560 

AATGACCAAG TAAGCCCTAA GCTCTTCATC TCCAAAGAAG GCCTTGATGC GACGGAGCAT 7 620 

GAGGTAGTAG TAGAGGTTGA AATTTACTCC AAAAACCAGA ACTCCCATAC TGACCAGATA 7680 

GCTAATCAGT GAGCTGCCAT AGTGGGCAAT TCCGTCGTTA TAGACGGTAA AGCCTCCACT 7740 

TCCCGCTGTC CCCATAGCAA TAACAAAACT ATCGTAGAGA GGCATACCGG CTAGATAATA 7800 

GATGATGACA AAGAGGGAGA AGAGAGCTAG ATAAAGGAGA TAGAGAATCT GGGCAGTGTT 7860 

TTTTAGTTTC GATACAACCT TGCCAAAAAC AGGACCTGGA ACCTCAGCCT TCATCACCTC 7920 

TAGGTGGCTA TTTTTGGCAT TGTCCATAAT AGCAAGTGCA AAAACAACCA CTCCCATCCC 7980 

TCCAATCAAG TGGGTAAAAC TTCGCCAGAA GAGGAGGGAA CGGCTGAGAA CCGAAACGTC 8040 

GTTCAAAATA CTTGCTCCAG TAGTTGTAAA TCCAGAACTA ATTTCAAAAA AGGCATCAAT 8100 

AAGGCTGGGG ATTTCCCCAG AAAAGACAAA GGGGAGACCA CCAAACAAAG ACCAAAGGAT 8160 

CCAACAGAGG GCAACGATCA AGACTCCCTC CTTGGCATAA ATCCGTTGAT TTTTTGGCTT 8220 

CTGTAAACTC CCTGAACCGC CTAACAATAC GAGAATCCCT ATCGTCCAAA AGAGGGCTGT 8280 

AAACACTTGG CTCGATTCAC GGTAATAGAC AGCAATCGCA ACAGGAACCA AAAGAAGAAC 8340 

AGCTTCAATC AAAAGTAATT TTGAAAGGAG GTAACGAATC ATACTTTTAT TCATTTCTTA 8400 

CCTCGCGATC AAGTCATAAA TCTTGGTGAT GTTTGGCAAC AAGGTTGTTA CTAGGAGCTT 8460 

GTCTCCAACT TCCAACATAT CCTCCCCAGT TGGGAAAATA CTCTTGCCCT TTCGAATAAT 8520 

GGCTGCAATA AGAACCCCTT TTTTCAATTT CAGTTGAGAA AGAGGTTTGG CAGTCATTTT 8580 

ATTGGCTTCC TTGATATGGA ATTGCAGGGT TTCGATTTGG CCATTGGCTA GATGGTGCAT 8640 

AGCTTGAAGG TCTGAATACT GGGCATTAAC TCGACCACGA ATAAAGTGCA TAATCGTATC 8700 
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TACAGCGATG CTTTTAGGTG TGATGATACT TGAAAAATCA GGCGC ATTG A TAATCTCGAG 8760 

GAGACTGGTA CGATTGACCT TAGTAATATT TTTCTGTACA CCTACCCTGT CAAGGAACAT 8820 

AGATGTAATC AGATTTTCCT CATCGACTCC TGTTAGAGTC GCAACGGCAT CATAGTGTTG 8880 

AGCACTTTCT TCCAGCAGGA TATCTTTTGC GGTTCCATCT CCTTGAACGA TGTAGAGATT 8940 

TGGGAATTTC TCGCTAAAGA AGCTGGCGAT TTCAGGATTG ATTTCAATGA CTTTTGTATC 9000 

GATACGACTA TCTTTGAGAA TACCAAGTAG ATAATAGGCA ATTCTACCTG CCCCAACGAT 9060 

GAGAAGGCTC TTCACGGCGC GTGATTTAAA ATAATTATGG AAGAGTATCA TATCGACACG 9120 

GTTACCAGTG ACAAAGATTC TATCTTTATC CTGTACAGTC ATGTCACCGC TTGGAATGAT 9180 

AATTTGATGA TCCCTCTCTA TCGCACAGAC AATGACATTA CCAAATTTTT TACGAAAATC 9240 

AGAAATCGGC ATTTGGCAAA GACCGCTGGT GGACTTGACG ACAAATTCCA TCAGGCTAAC 9300 

GCGTCCACCA GCAAAGCGTT CGACAGACAG GGCGTTGGGG AAGTCAATGA TATTCGCGAT 9360 

ACCGCGGGCA GCCAAGAGCT CAGGATTAAC GATAAGAGAA AAACCGAGAA TATTCTTTTC 9420 

CTTGAAATAA GAGTTAGAAT ATTCAGGGTT CCGCACCCGA ACGATAGTTT CTTTAGCTCC 9480 

CATTTTCTTG GCTAGAACTG CTGCAATCAT CTTGACTTCA TCGTGCTCAG TCAGGGCGAT 9540 

AAAGATATCA CAATCTTGGA CGCTGGCTTG CTCAAGAATG GCAAAATCGG CCCCGTTACC 9600 

AAGGATACCA ATGATATCAA AGCGACTGAC AATATGATTG AGAACAGCTT CCTCTTGCTC 9660 

AATCAGCAAA ACATCATGCT TTTCTGCAAC CAAGGAGCGA CAGAGGGCAA AACCAACTTT 9720 

TCCCCCTCCG ACAAGGATAA TTTTCATAAT AAAACCTACT TTTTCATGAT GTAACTATCA 9780 

TACCCTTTTT CAAGAAAAAA TGCACCTACT AGCTAATAAC AAGAGTTTTT AGTGAAAATT 9840 

CGCTATAAGG TAAAACTATA CCCTAACCAA TTGAAATAGC TATTAGCGAC TTTCTCTGAA 9900 

ATATGGTATG ATAAAGGATA TACAAGGAGA TAAAATGAAT AATAATTTAC TGGTATTACA 9960 

ATCACACTTT GGTCTGGTTG ATGGTGCGGT ATCGGCTATG ATTGGAGTGG CTTTAGAAGA 10020 

GTCTCCAACC TTAAAAATAC ATCACTTGAC GCACGATATC ACCCCTTATA ATATTTTTGA 10080 

GGGGAGCTAT CGTCTCTTTC AGACGGTGGA TTACTGGCCT GAGGGAACGA CGTTTGTATC 10140 

GGTTGTCGAT CCAGCTGTCG GTTCGAAACG TAAGAGTGTA GTTGCCAAGA CTGCAAAAAA 10200 

TCAATACATT GTCACGCCAG ATAATGGGAC GCTTTCCTTT ATCAAGAAAC ACGTTGGCAT 10260 

TGTAGCCATT CGTGAGATTT CTGAGGTGGC CAATAGGCGT CAAAACACAG AGCATTCTTA 10320 

TACCTTCCAC GGTCGTGATG TCTATGCCTA TACTGGTCCT AAACTGGCCA GTGGTCACAT 10380 

TACTTTTGAG GAAGTAGGGC CAGAGCTCAG TGTGGAACAG ATTGTAGAGC TTCCAGTCGT 10440 
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AGCGACCATC ATAGAAGATC ATCTGGTGAA GGGAGCCATT GATATTCTCG ATGTGCGTTT 10500 

CGGTTCGCTT TGGACCTCTA TCACACGGGA AGAATTTTAC AAGCTGGAAC CAGAATTTGG 10560 

TGATCGTTTT GAAGTGACCA TCTATCATGC TGATATGCTG GTCTATCAAA ATCAGGTTGT 10620 

CTATGGCAAA TCATTTGCAG ATGTGAGAAT TGGGCAACCs ATcTTTACrc TCACCaTCTt 10680 

CGATTAGCTG GGCAATTCGT TCTAGTTGGA TTTCGTCAAT CAAGGT 10726 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7163 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTATCTTTAA CGATATCAAT CAAGATCTGG TCAATAAAGG GATTGGGGCT TATCGTGAAG 60 

TTGGCATCCA AGCCCATGGA TATCTCTGTG ACGTGACAGA CGAGGACGGT ATCCAAGCCA 120 

TGGTCAAGCA AATCGAACAA GAGGTTGGTC TCATTGACAT CCTCGTTAAT AACGCTGGTA 180 

TTATCCGCCG AGTTCCAATG TGCGAAATGA GCGCCGCTGA TTTCCGTAAG GTCATCGATA 240 

TTGACTTAAA CGCACCATTT ATCGTTTCAA AGGCAGTTAT TCCTTCTATG ATAAAGAAAG 300 

GGCATGGAAA GATTATCAAT ATTTGTTCGA TGATGAGCGA ACTGGGACGT GAAACAGTTA 3 60 

GCGCTTATGC TGCTGCTAAA GGGGGCTTGA AAATGTTGAC CCGCAACATT GCGTCTGAAT 420 

ACGCTGGAGC CAATATCCAA TGTAACGGAA TTGGACCGGG TTATATTGCC ACTCCTCAAA 4 80 

CAGCACCTCT TCGTGAATTG CAAGAAGATG GTTCTCGCCA CCCATTTGAC CAGTTCATCA 540 

7TGCAAAAAC ACCTGCTGCA CGTTGGGGAA ATACTGAAGA TTTGATCGCC CCTCCTGTC7 600 

TTCTCGCTAG TGATGCCAGC AATTTTGTCA ATGGCCACAT CCTATATGTA GATGGCGGTA 660 

TCTTAGCCTA CATCGGAAAA CAACCTGAGT AAAAATAGAA AGAAGATCTT ATGAAAATCG 720 

CATTAATCAA TGAAAATAGT CAAGCTAGCA AGAATCACAT TATTTACGAT AGTCTAAAAG 780 

AAGCGACAGA TAAAAAAGGC TACCAATTAT TTAACTATGG TATGCGTGGA GAAGAAGGAG 840 

AAAGTCAATT AACTTATGTG CAGAACGGAC TAATGGCTGC CATCCTTTTA AATACAAACG 900 

CAGTTGACTT TGTTGTTACC GGCTGTGGTA CCCGTGTACC GCCTATGCTT GCTTTAAACA 960 

GCTTCCCTGG TGTTGTCTGT GGTCTAGCAG TGGACCCAAC TGACGCTTAC CTTTATTCTC 1020 

AAATCAATGG TGGTAACGCC TTGTCTATCC CTTATGCCAA AGGATTTGGC TGGGGGGCAG 1080 

AACTGACCCT CAAATTGATG TTTGAACGCT TATTTGCTGA AGAAATGGGC GGTGGCTACC 1140 
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CAAGAGAACG 


TGTAATCCCT 


GAACAACGCA 


ACGCTCGTAT 


CTTAAACGAG 


GTGAAACAAA 


1200 


TCACCCACAA 


TGATTTGATG 


ACCATCCTTA 


AAATAATCGA 


CCAAGACTTC 


CTCAAAGACA 


1260 


CCATCTCTGG 


CAAATACTTC 


CAAGAATACT 


TCTTTGAAAA 


CTGCCAAGAT 


GATGAAGTTG 


1320 


CTGCTTATTT 


GAAAGAAGTA 


TTAGCCAAGT 


AAAGCTATTC 


TAAACCAGAA 


AGGAACTAAT 


1380 


GGATGACGAA 


AATATTACTG 


TTTGGCGAAC 


CATTAATTCG 


AATTTCACCA 


TTAGATGCCA 


1440 


CCAGTATCGG 


CGATCATGTT 


GCCAGTTCGA 


CTTATTTTGG 


CGGATCAGAA 


ATTAACATCG 


1SOO 


CTTGTAATTT 


GCAAGCCCTG 


GGTATCTCAA 


CGAAAGTTTT 


TACCGCACTC 


CCTGCCAACG 


1560 


AGATTGGAGA 


TCGTTTTCTC 


ACATTCTTGA 


AACAGCACCA 


AATCGATACC 


AGTTCAATCT 


1620 


GTCGGCTTGG 


CGATCGAATC 


GCCCTCTACT 


ATTTGGAGAA 


CGGCTTTGGT 


TGTCGTCAAA 


1660 


GTGAAGTTTT 


CTACGATCGT 


AAGCATACGA 


GTATCAGCCA 


CATTCGGCCA 


AACATGCTAG 


1740 


ATATGGATTC 


TCTCTTTCAG 


GGGATTAGCC 


ATTTTCATTT 


TAGTGGAATC 


ACCGTAGCTA 


1800 


TCCGTCAAGA 


GGTCCCTGCG 


ATCCTTCTCC 


TACTCTTGGA 


AGAAGCCAAG 


CGCCGAGGAA 


1860 


TTGTCGTTTC 


AATGGATCTC 


AATCTGAGAA 


CAAAGATGAT 


TTCAGTCCTA 


GAAGCCAAGT 


1920 


ATGAATTTTC 


TAAGTTTGCA 


CGTTTTACTG 


ACTATTCCTT 


CGGTATTGAT 


CCTCTCATGA 


1980 


TTGATGACCA 


AAATCTAGAG 


ATGTTTCCAA 


GAGACAGTGC 


TAGCCTAGAA 


GAGGTGGAAA 


2040 


ATCGCATGCG 


ACTTTTAAAA 


GAAGCCTATG 


GTTTCAAGGC 


CATTTTCCAT 


ACCCTCCGCT 


2100 


CTAGTGATGA 


GCAAGACAAA 


AATGTCTATC 


AAGCCTATGC 


TCTAGAAGAA 


CTATTTGAAG 


2160 


AGTCTGTCCA 


ACTAAAAACT 


GCAGTCTATC 


AACGAATTGG 


TAGCCGGGAT 


CCCTTT ATA T 


2220 


CTGGTGCCCT 


TTACCAACTA 


CTCCATCATT 


CCTCCCTAAA 


AACTACCATT 


GACTTTGCAG 


2280 


TTGCGAGCGC 


AACTCTCAAA 


TGCACTCTTC 


CAGGAGACCA 


TCTCTCCACT 


TCCTCAACTA 


2340 


GTATTGAAAA 


TTTACTGGCA 


AATCCACAAG 


ATATCATTCG 


TTAGGAGAAT 


TACATGACCA 


2400 


AATCAGATAC 


GATTATTGAA 


CTAAAAAAAC 


AAAAAATTGT 


CGCTGTTATT 


CGAGGAAATA 


2460 


CAAAGGAAGA 


AGGACTACAA 


GCCTCCATTG 


CTTGTATCAA 


GGGCGGTATC 


AAAGCTATTG 


2520 


AAATCGCCTA 


TACCAATCAG 


TATGCAGGAC 


AAATCATCAA 


GGAACTTGTA 


GACTTGTATC 


2580 


AGGACGATCA 


GAGTGTTTGT 


ATCGGTGCAG 


GTACTGTGCT 


TGATGCCGTA 


ACTGCTACAG 


2640 


ATGCCATTCT 


AGCTGGAGCA 


AATTACGTTG 


TTTCTCCATC 


TTTCCATGCT 


GAAACTGCGA 


2700 


AAATGTGCAA 


TCTCTACAGC 


ACACCGTACA 


TTCCAGGCTG 


TATTACCCTC 


ACAGAGATCA 


2760 


CGACTGCACT 


TGAAGCCGGT 


AGTGAAATCA 


TCAAACTCTT 


CCCAGGTAGT 


ACTCTCAGTC 


2820 


CAGCATATAT 


CTCTGCAGTC 


AAGGCACCGA 


TCCCACAAGT 


TTCCGTAATG 


GTAACCGGAG 


2880 
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GAGTCGGCCT AAACAACATC CCTCAATGGT TCGCTGCTGG TGCAGATGCC GTTGGAATTG 2940 

GTGGCGAACT CAATAAACTC GCTTCCCAAG GCAACTTTGA CCGCATCAGC GAGATTGCCC 3000 

AACAGTATAT TACACTCAGA TAAAATCATA ACTACCCGTC TAACGGGTGG TTTATCTCAG 3060 

AGCTATAAGC CCAAATCATC AGCCAGCGCC TAAAGACGCT GGCTTTCACG TTGTTCAAGC 3120 

CTTATTGCTC TTGACTCGTC ACTTGCCTCT TTAAGAGACT TTGGTATTAC TTACCACTAT 3180 

CCCTAAAGGG ATCCTCATAT TCTTTTACAC TCAATTTATC TAGTGCTATA GTACATTGAA 3240 

ACTGGAATAG TACACCTCTG CTTCTAAAAC ATTGTTAAAA ATCGATTTGA CTGTCCTGAT 3 300 

CGATTTTGTC CTGTTCTTAT TTCATTTTAC TATATATCAT ACTTTACTCG TTCTCAAATT 3360 

TTCATACTCA TGAAGAAATC ATCCACTCGA TAATTTCTTT AATCTTGACT ATATTTCTTA 3420 

ATTGTGGCTT CATTAAGCCC TACTGGACTT ACATAATAAC CTTCCTCCCA GAAATGCCGA 3 4 80 

TTCCCAAACT TGTACTTGAG ATTGGCGTGT TTGTCAAACA TCATGAGTGC ACTTTTGCCT 3540 

TTTAAATACC CCATAAAACT TGAAACACTT AGCCTCGACG GAATACTGAC TAACATGTGT 3600 

ACATGGTCTG GCATTAAGTG ACCCTCGATC ATTTCAACAC CTTTATAACT ACACAAGCGA 3660 

TGAAATATTT CGTCTAAACT ACTTCTATAT TGATTATAGA TGACTTTTCG TCTATACTTA 372 0 

GGGGTGAACA CAATATGATA GAACACCTCC ACTTTCTGTA TGATAAACTA TGAGTCTTTT 3780 

GTGCCATATT TTTTCTCCTT TCGCTTTACA ATTGGATTGA ACACCTTTAT TGTATCGCGT 3840 

TTGGAGTTTT TTTGGTATAA CCTTCGACGC GCACCCGTAT AGCGGGTCGT TCTTTTGTCT 3900 

CGCACCTCAC GGAGCGAGAC GGACTAATAT AGTGGAGTCA AATAGGATAC GAACAAATTG 3960 

ATTAGGAAAA TCAAATGAAT TTATAGAAAT CTTTTAGCAG TTATAACGTT CTATTCTAGT 4020 

TTCAAAACGC TATAGTCACA TAATAATGAA GTAAAAAAGG ATAAGTATCA ACTT ATCCTT 4080 

TTTTAAAAGA AAAA7CCGAA GATATTTGCC CTTCTTCGGA TTT7TTCTA7 TTTCCACAGT 4140 

TTCATGTAAT TCATCTAGAT GATGAACAAA TTAGTTGTTC TTTCCTCTAC GGAATAGATA 4 200 

AAATGCCCCA AGTAGCAAGA ACCCTAGACT TGCCAAGATT GACTGACCTT CTCCTGTCTG 4 260 

AGGGAGATTC TTTTGATCCG AATCGTTCTT TTCCTCTTCA GATTTTTCCT TTTCTTTTGA 4320 

ATTCTGTACT TGTGGCTGAG CTGCTTGCTC TAGCTTTTTA AAGACTTCCT GATCTGGAGC 4 380 

TGATTCCTGG GTTTCAGGAT TATAGTAGGC AATCTTATAT TCATCCCCTT CTTTTCGAAT 4440 

GGTATAGACT CCACGTTTCA AAACTTGGAA TTGGTTGGAA ATAGTAGAGA CAGAATCATC 4 500 

ATATTTCACA ATGCCCCAAA CTCCTTGTTT AGCATCATAA ACAGACTGAA GGGTTTCGTT 4 560 

ATTTTCGATG AGGCTACTTT CTAACTCTTT TATCATTTGA TTGAAGGTGG CACGATCCAC 4 620 

4 

GTTAGGAATG AGCATATAGC CATAAGAATC TCTATTTTGC TTATGAGCCT GACTAATCGT 4680 
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AAGAAATTCA TTTTCAACTT CCTTGTCTGA CTGTCCTTCA TTGATATCCT TCCAGGCTCC 4740 
CTTTTGCAAA GCCTTACTCA TACTGATTGA ACTCTTCTTA AACAAAAAGT AACCAATATT 4 800 

CTTTTTCGAA TCGAACGATT CTAAAAAGAC ACTTTGGGTT TCAGGATAAT CCTTTTCTTG 4 860 

TTCTGTAAGG GAGGCTTCTT TATCATTGAC ATAGACTTTA TATGGATTAC CTGATTCCAG 4 920 

TTTTCTCTGG TCAATTGTAG TTGCAGCAGT ATCTGTTGAA GTGTTTTGGA TATTGCTTCC 4980 
TAAAAAGGCG ATCTTATCCT TTAGCATAAA CCAGCTCTTA TGAGCAGTCA ATGTTTGATT 5040 
CCAGTTCGTG AAATCCATGG TTGCTGTCGC ATTGGCATCA TCTAGTTTGC TCGTTCCAAC 5100 
GAAAGCAGAC GGTAAAACTT TACCTGTATC GCTATCCGCT CTCTTAGCAT CCGTCTCTGT 5160 
TGTACCAGGC ATCTTATATG GATTAACTGT TGGCCAGTAG CCATCGCTAT AGTGACTCAA 5220 

ATCGCCATTG TAAAGATAGA ACATCCCATC ACTCGTATAC CAACCACGTT TATTTTCCTT 52 80 

GTTCATGTCT TCGTAATTCA AGGTACCACT GGAAAAGAGT GACAAGCCAA ATCCAAACCC 5340 

TTTCTCTGCA TTGTACATGC CTGTTTTATC CATCTTGTTA AAGGCAGATA GGTAACTTGC 5400 

TCTTGGAACA CTTGCGACTC CTGCATCACT TAACAAGGAT TGCATCAAAC TGATATCCTT 5460 

ATAAGTCTTC AAATTCTTAA AGACATCATA ATAACTATCC GATTGAACAA TGGTCTTCAC 5520 

AAGACTCTGC AAACATTGTT TGGTTTCTCC TTCACACATA TCCGCTATTC GGTGAATCCC 5580 

TCTTAGTACT TCTACTGCGG CCACGTGCCC CTCGCTATTT GCACGACTGA TCGAGCGTCC 5640 

ACGACTCATA TCCATCAACT CTCCATTCAC CAGCAAAGGA GCAAACGATT TATCAATCCA 5700 

GTGGTACATG GTTTGCATTT TATCTTTATC GATTGGATTC TTGGTCTTTT GAATCACTGG 5760 

CAACAGTTGA GACAGGCCAT CAATCAAAAC ATTCCCATAA GCACCCGTAT AGGCAACATT 5820 

GGTGTCGTCG ATATAGGATC CATCTTGATA AAAACCTTCA CCTTGCTCTA CCAACTTGAA 5880 

CACTTGCTCA ATCGAGCGAA TGGTACAACA AATTTCTTGA TCATCCTTAC GCAGTAAACC 5940 

AGCTATTACT TTTACCCTTC CCATATCAAC TAAGTTTCCA CCTAGAGCCT TGAATGGGTT 6000 

ATCAGTCGTC TTTCGGAAAT GTTCGGGATC TGGTACAAAT TTTTCAATCA CATCTGTATA 6060 

TTTTTTAATT TCCTCATCAG AGAAGTATTC TTTCATCAGA GACAAGGTAT TGTTGATGGC 6120 

ACGAGGTGTA CCGATTTCAT AATCCCACCA GTTCCCAACA ATGCTCTTTT CACTATTCTA 6180 

GACATGTTTA TGCATCCATT CCATGGAATC CCTGACTGTT CGAACGACAG TTTCATCTTG 6240 

ATAATAACGA GAAGAAGGAT TGCTCACTTG CTTGGCCATC TCCTCCAATT TCCGATAAGT 6300 

GGCAGTCAGA TTTGCAGACG TTTTATAATT TGAAAATTTT TCCCACAAAT AGGTGCGGTC 6360 

CGCCTGACTT GAAATACTGG ATAGGCTATC AGCTACCTTT CCTTCCAATT CCTGGTTTAA 6420 
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TTTGGCCATC TGTTCATTTT TAGAATCATA GTATTGATTC CCAGCGATGA TGCCATTCCA 64 80 

GTCATCCAAA CGGTCTGTGT ATGCATCCTT AACAGAGGCC AGAATCTTCA AACGAATCTT 6540 

TTTCACTTCC TTGCCATCTT TACTGACAAT GACATTGGTT CTCCCTTCCT TAAGAGGTTC 6600 

TAAAATTCCA TTTTTGACTG AAGCAACGTC AGGATTTTCT ACCTTATAAG TATAGTCCGC 6660 

AAGAGAAAAA ACATGTTTTT TTCCAATTGG TAAATCAATC TTTTCCTCAA GCTGTTTATC 6720 

TGTTTGAGAA TCCTCAGAAA GCTGGTCTGC TACCTCTACC ACCTCAATAT CCTTAAAGGA 6780 

AACAGTCCCA GTTCCTGTTT CATAGAATAA CTCCAGCTTG ATTTTATCAA CATCTAAAGT 6840 

CGGGCTATAG TCTCCTTCAA TGGTCTGCCA GTCCTTTGTT CCTGACGTCG TTGCAGAATT 6900 

CCACAATCGC TTGTCCTTAC CACTTTCCTC AATGATACGA ACTTTGGCAA TCCCGATTTT 6960 

ATTATCTGTT TTAATCTTGA AACGCAGTTT ATACTTTTTC TT AG CTTCAA TAGGAACCAT 7 020 

ACGGTGAAGC GCTGCCCTTA ATTTCTCATG GCTTGAGATA GTGATAGCCC CATCCTTAGC 7080 

CTCAATGACT CGAGTTGAGG CATCTGCACT ATTCTTCTGG TCTACCCAAG CTGACCACCC 7140 

CCTGACCTTT GCTTCCTCTC CGG 7163 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9244 base pairs 
(BJ TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D> TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

CGTTATAACA TACATGTAAG CGGTACCCAA AATGGTGCCA ACTCAAAATT TTTAAGGACG 60 

AAAATACATG TCTTCACATC CAATTCACGT CTTCTCAGAA ATTGGGAAAC TGAAAAAAGT 120 

TATGTTGCAC CGTCCAGGCA AGGAGTTAGA AAACTTCTTG CCGGACTATC TTGAAAGGCT 180 

TCTTTTTGAT GATATTCCTT TCTTGGAAGA TGCTCAAAAA GAACATGATG CATTTGCCCA 240 

AGCTCTTCGC GATGAAGGAA TTGAGGTTCT CTACCTAGAA CAACTCGCTG CTGAATCATT 300 

GACCTCTCCA GAAATCCGCG ATCAATTTAT CGAGGAATAC TTAGACGAAG CCAACATCCG 3 60 

TGATCGTCAA ACCAAGGTTG CTATTCCTGA ATTGCTTCAC GGCATCAAGG ACAACCAAGA 420 
ATTGGTTGAA AAAACAATGG CTGGGATTCA AAAAGTTGAA TTGCCAGAAA TTCCTGACGA , 480 

AGCTAAAGAT CTAACTGACT TAGTTGAATC AGAGTATCCA TTTGCAATTG ACCCGATGCC 540 

AAACCTCTAT TTCACTCGCG ACCCATTTGC AACAATTGGA AACGCCGTAT CGCTTAACCA 600 

CATGTTTGCA GACACTCGTA ACCGTGAAAC ACTCTACGGT AAGTATATCT TCAAATACCA 660 
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CCCAATCTAT GGCGGAAAAG TGGATTTGGT CTACAACCGT GAAGAAGATA CGCGTATCGA 720 

AGGTGGAGAC GAGTTAGTTC TTTCTAAAGA CCTCCTTGCA GTACGTATCT CTCAACGTAC 780 

AGACGCAGCT TCTATCGAAA AACTTTTCGT CAACATCTTC AAGAAAAATG TTGGCTTCAA 840 

GAAAGTTTTG GCCTTTGAAT TTGCTAACAA CCGTAAATTC ATGCACTTGG ATACTGTCTT 900 

CACTATGGTA GACTATGACA AGTTCACTAT TCACCCAGAA ATCGAAGGCG ACCTTCACGT 960 

TTACTCAGTT ACTTACGAAA ACGAAAAACT TAAAATCGTT GAAGAGAAAG GTGACTTAGC 1020 

TGAACTTCTT GCTCAAAACC TTGGTGTAGA AAAAGTTCAT TTGATTCGTT GCGGTGGTGG 1080 

CAATATCGTA GCAGCTGCGC GTGAACAATG GAACGACGGT TCTAACACTT TGACCATCGC 1140 

ACCTGGTGTG GTAGTTGTTT ATGACCGCAA TACCGTGACC AATAAGATTT TGGAAGAATA 1200 

CGGGCTTCGC TTGATTAAGA TTCGCGGAAG TGAATTCGTT CGGGGCCGTG GTGGACCTCG 1260 

TTGTATGTCT ATGCCATTTG AACGTGAAGA AGTGTAATCG CTGTTCGATA TTCGTCAATA 1320 

GAAAATGTAA AAAATAGAAA GAGGAAATAA TAAAATGACA AATTCAGTAT TCCAAGGACG 1380 

CAGCTTCTTA GCAGAAAAAG ACTTTACCCG TGCAGAGTTA GAATACCTTA TTGGTCTTTC 1440 

AGCTCACTTG AAAGATTTGA AAAAACGCAA TATTCAACAC CACTACCTTG CTGGCAAGAA 1500 

TATCGCTCTC CTATTTGAAA AAACATCTAC TCGTACTCGT GCAGCCTTTA CAACTGCGGC 1560 

TATCGACCTT GGTGCTCACC CAGAATACCT CGGAGCAAAT GATATTCAGT TGGGTAAAAA 1620 

AGAATCTACT GAAGATACTG CTAAAGTATT GGGACGTATG TTTGACGGGA TTGAATTCCG 1680 

CGGATTCACC CAACGTATGG TTGAAGAATT GGCAGAATTC TCAGGCGTTC CAGTATGGAA 1740 

CGGTCTAACT GACGAATGGC ACCCAACTCA AATGCTCGCT GACTACTTGA CTGTTCAAGA 1800 

AAACTTCGGT CGCTTGGAAG GCTTGACATT GGTATACTCT GGTGATGGAC GTAACAACGT 1860 

TGCCAACAGC TTGCTCGTAA CAGGTGCTAT CCTTGGTGTC AATGTTCACA TCTTCTCACC 1920 

AAAAGAACTC TTCCCAGAAA AACAAATCGT TGAATTCGCA GAAGGATTTG CTAAAGAAAG 1980 

TGGCGCACAT GTTCTCATCA CTGAAGATGC TGATGAAGCA GTTAAAGATG CAGACGTTCT 2040 

TTACACAGAC GTTTGGGTAT CAATGGGTGA AGAAGACAAA TTCGCAGAAC GTGTAGCTCT 2100 

TCTTAAACCT TACCAAGTCA ATATGGACTT AGTTAAAAAA GCAGGCAATG AAAACTTGAT 2160 

CTTCCTACAC TGCTTGCCAG CATTCCACGA TACTCACACT GTTTATGGTA AAGACGTTGC 2220 

TGAAAAATTT GGTGTAGAAG AAATGGAAGT AACAGACGAA GTCTTCCGCA GCAAGTACGC 2280 

TCGCCACTTC GATCAAGCAG AAAACCGTAT GCACACTATC AAAGCTGTTA TGGCTGCTAC 2340 

ACTTGGTAAC CTTTATATTC CTAAAGTATA ATTTTAGATA ATAAACCGTC TACCAACAGC 2400 
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TATGAGGGCT GCGACTAATA GCTTTAGTCC GGTCCTCTTT TATGTAATGG TAATCTATTA 2460 

TTTCTTATAA AATATGTGAA AAATCATTAA ATTGAAATCT AAACGCATTC TATTGAGTGT 2520 

GATAAAGGAG AATTTATGGC AAATCGTAAA ATTGTAGTAG CTTTGGGAGG AAATGCGATT 2580 

CTTTCTTCTG ACCCATCAGC AAAGGCTCAA CAAGAAGCTT TAGTTGAAAC AGCTAAGCAT 2640 

CTTGTAAAAT TGATTAAAAA TGGAGATGAT CTGATTATCA CTCACGGTAA TGGACCTCAA 2700 

GTTGGGAATC TCTTGCTCCA ACATTTGGCA TCAGACTCTC AAAAGAACCC TGCCTTCCCA 27 60 

CTCGACTCAC TTGTCGCTAT GACAGAAGGT AGCATCCCTT TCTGCTTGAA AAATGCTTTG 2820 

CAAAATGCTC TCTTGGATGA AGGCATCGAA AAAAATGTTG CCTCTGTTGT AACGCAAGTT 2880 

GTCGTAGATA AAAATGATCC AGCTTTTGTT AACTTGAGTA AACCAATCGG TCCTTTCTAT 2940 

TCAGAAGAAG AAGCAAAAGC AGAAGCCGAA AAAAGCGGAG CGACTTTCAA GGAAGATGCT 3000 

GGCCGTGGCT GGCGTAAGGT CGTTGCCTCA CCAAAACCTG TTGACATCAA AGAAATTGAA 3060 

ACCATCCGTA CTCTTTTAAA TAATGCTCAA GTCGTCGTAG CTCCACCTGG TGGCGGTATT 3120 

CCCGTCGTCA AAGAAAACAA TGGACATTTG ACTGGTGTCG AAGCGGTTAT TGATAAAGAC 3180 

TTCGCTTCCC AACGTTTGGC AGAATTGGTT GATGCAGACC TCTTCATCGT TTTGACAGGT 3240 

GTAGATTATG TATTTGTTAA CTACAACAAG CCAAACCAGG AAAAATTGGA ACATGTGAAT 3300 

GTTCCCCAGC TGGAAGAATA TATCAAACAA GATCAGTTTG CACCAGGTAG CATGCTTCCA 3360 

AAACTAGAAG CAGCTATCGC TTTTGTCAAT GGTCCTCCAG AAGGAAAAGC AGTTATTACT 3420 

TCCCTTGAAA ATCTAGGCGC CTTGATTGAA TCTGAAAGCG GAACAATTAT TGAAAAAGGA 3480 

TAAGTTGTTT TACTAATAAG ATGTATTCTA TTTCTAGTAT CTTTATATCA AATTAGAAAT 3540 

TATTCTTGAA AACATGTACA ATATTTCAAA AGATACTAGT TTTAGACTTT AATATGGTAA 3 600 

AACAAATATA AATAGAAAGC GTTTTCTTGA ATGTTTATTT AAGAAACTAG T7GCTTTTTT 3 660 

ACACTTTGTT AGACATCAGG AGGAAAAACA AATGAGTGAA AAAGCTAAAA AAGGGTTTAA 3720 

GATGCCTTCA TCTTACACCG TATTATTGAT AATCATTGCT ATTATGGCAG TGCTAACTTG 3780 

GTTTATCCCT GCGGGGGCCT TTATAGAAGG TATTTACCAG ACTCAGCCTC AAAATCCACA 3840 

AGGGATTTGG GATGTCCTCA TGGCACCGAT TCGGGCTATG CTAGGTACTC ATCCAGAGGA 39Q0 

AGGTTCGCTC ATTAAAGAAA CGAGCGCAGC GATTGATGTA CCCTTCTTCA TCCTTATGGT 3960 

TGGTGGTTTC CTTGGCATTG TCAACAAAAC TGGTGCTCTT GACGTAGGGA TTGCCTCTAT 4020 

CGTGAAGAAG TATAAGGGCC GCGAAAAAAT GTTAATTTTG GTACTGATGC CTTTCTTTGC 4080 

CCTCGGTGGT ACAACTTATG GTATGGGTGA AGAAACAATG GCCTTCTATC CACTCCTTGT 4140 

GCCAGTTATG ATGGCCGTTG GTTTTGATAG CCTGACTGGT GTTGCAATTA TTTTGCTCGG 4200 
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TTCTCAAATC GGCTGTTTGG CATCTACTCT GAATCCATTT GCGACAGGTA TTGCTTCAGC 4260 

GACTGCGGGA GTTGGTACAG GGGACGGTAT CGTACTTCGT CTGATCTTCT GGGTTACCTT 4320 

GACTGCTCTT ACTACTTGGT TTGTTTACCG TTATGCGGAT AAGATTCAAA AAGATCCGAC 4380 

TAAGTCACTG GTTTATAGTA CTCGCAAAGA AGATTTGAAA CACTTTAACG TAGAAGAATC 4440 

TTCATCTGTA GAATCTACAC TTAGCAGCAA ACAAAAATCA GTTCTCTTCT TATTTGTGTT 4500 

GACATTCATC TTGATGGTAT TGAGCTTCAT TCCATGGACA GACCTTGGCG TTACCATTTT 4560 

TGATGACTTT AATACTTGGT TGACTGGTCT TCCAGTTATT CCTAATATTC TCGGTTCATC 4 620 

TACTTCTGCA CTAGGTACTT GGTACTTCCC AGAAGGCGCA ATGCTCTTTG CCTTTATGGG 4680 

TATCCTGATT GGTGTTATTT ATGGTCTTAA AGAAGATAAG ATTATCTCTT CCTTCATGAA 4740 

TGGTGCTGCT GACTTGCTCA GTGTTGCCTT GATCGTAGCG ATTGCTCGTG GTATTCAAGT 4 800 

TATCATGAAC GACGCTATCA TTACCGATAC AATCCTCAAC TGGGGTAAAG AAGGCTTGAG 4860 

CGGTCTATCT TCACAAGTCT TTATCGTTGT AACTTATATC TTCTATCTAC CTATGTCATT 4920 

CTTGATCCCA TCTTCATCTG GTCTTGCCAG CGCAACTATG GGTATCATGG CTCCACTTGG 4980 

AGAATTTGTA AATGTCCGTC CTAGCTTGAT TATCACTGCT TACCAATCTG CTTCAGGTGT 5040 

CTTGAACTTG ATTGCACCAA CATCTGGTAT TGTGATGGGA GCTCTTGCAC TTGGACGTAT 5100 

CAACATTGGT ACTTGGTGGA AATTCATGGG CAAACTCGTA GTCGCTATTA TTGTAGTCAC 5160 

CATCGCCCTT CTTCTCCTTG GAACCTTCCT TCCATTCCTA TAAAATAGTG AGTGAGGTGA 5220 

TTCCATGAAA ATAGATATAA CAAATCAAGT TAAAGATGAA TTTCTTATAT CATTAAAAAC 5280 

CTTGATTTCC TATCCTTCAG TACTCAATGA AGGAGAAAAT GGAACACCTT TTGGACAAGC 5340 

AATCCAAGAT GTCCTAGAAA AAACTTTAGA GATTTGTCCA GACATAGGTT TCACTACCTA 5400 

TCTTGACCCT AAAGGTTATT ACGGATATGC AGAAATCGGT CAGGGAGCAG AGCTTCTGGC 5460 

CATTCTCTGT CATTTGGATG TTGTTCCATC AGGTGATGAA GCAGATTGGC AGACACCGCC 5520 

ATTTGAAGCA ACTATCAAAG ACGGCTGGGT ATTCGGACGT GGTGTCCAAG ATGATAAAGG 5580 

CCCTTCGCTC GCAGCTCTCT ATGCAGTAAA AAGCTTGCTG GACCAAGGTA TTCAGTTCAA 5640 

AAAGCGCGTA CGCTTTATCT TTGGTACCGA TGAGGAAACC CTCTGGCGCT GCATGGCACG 5700 

CTACAATACC ATCGAAGAAC AGGCCAGTAT GGGCTTTGCA CCTGACTCAT CTTTTCCTCT 5760 

GACCTATGCT GAAAAACGGC TTCTACAGGT CAAACTTCAT GGCCCTGGAT CGGATCAACT 5820 

AGAGCTTGAA GTAGGAGGCG CCTTTAACGT TGTACCAGAC AAGGCCAACT ACCAAGGTCT 5880 

CCTCTATGAA CAGGTTTGTA ACGGTCTCAA AGAAGCTGGT TATGATTACC AAACCACTGA 5940 
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ACAAACCGTA ACGGTTCTCG GAGTGCCAAA GCATGCTAAG GATGCTAGTC AAGGTATCAA 6000 

TGCTGTCATC CGACTAGCTA CCATTCTTGC TCCTCTCCAA GAACACCCTG CTCTCAGTTT 6060 

TCTTGCAACA CAAGCAGGTC AAGACGGCAC AGGAAGACAA ATCTTTGGTG ATATAGCAGA 6120 

TGAACCTTCT GGTCACCTAT CCTTTAATGT CGCAGGTCTC ATGATCAATC ATGAACGTTC 6180 

TGAAATCCGT ATTGACATTC GGACTCCTGT CTTAGCTGAC AAGGAAGAAC TAGTAGAGTT 6240 

GCTTACAAGA TGTGCACAAA ACTACCAACT CCGCTACGAA GAGTTTGACT ATCTAGCGCC 6300 

TCTATACGTC GCAGAAGACA GTAAACTCGT TAGCACACTG ATGCAAATCT ACCAAGAAAA 6360 

GACTGGCGAT AACAGTCCTG CTATTTCATC CGGTGGTGCC ACTTTTGCTC GCACCATGCC 6420 

AAATTGTGTA GCCTTCGGCG CCTTATTCCC AGGAGCGAAG CAGACAGAAC ATCAGGCAAA 6480 

TGAATCTGCC GTTCTAGAAG ATTTGTACCG TGCTATGGAT ATTTATGCCG AAGCCGTCTA 6540 

TCGACTTGCA ACTTAATCAG GCAACTGTTT CTACCAAAAA AAATCGACCG ATTAATGAAC 6600 

TGCACCCCAA AAGTTAGACA GAATAAATCT AACTTTTGGG GTGTTTTATT ATGAAATTGA 6660 

GTTATGAAGA TAAAGTTCAG ATCTATGAAC TAAGAAAGCA AGGACAAAGC TTCAAACAGC 6720 

TTTCAAAAAG ATTTGGTGTG GATGTTTCTG GTCTAAAGTC ATCTGAATCT TTGAGATGAG 6780 

CTTTATAAAT CGCTTTTTTC AGTTTTTGCA CTGGTGTTTC GATAAACTCA AACTTTTTAG 6840 

CCGTGGTATT GCCTGATTTT ATAGTATATT GAAACTAGAA TAGTACACCT CTCCTTCTAA 6900 

AACATTTTTA GAAATCGATT TGACTGTCCT GATCGATTTG TCCTGTTCTT ATTTCATTTT 6960 

ACTATATTTG AGCCACTTCG TCTTTAACGG CTTTATTCAT AAGCTCTTGT AATTTTTCTT 7020 

TACTATCAAT TACTTCTGAT TTTCCGTTGT AATTTATTGT AATAGGTTTT AACTTACCTA 7 080 

ATTTCTCGAC ACGCTCATTA ATTTGATCTT TTTTGAAGGC TGCTTATGTT TTTCCTAAGA 7140 

TTTTTTCAAA AATATATTTA TCAGATAGCG GTTTGTCTTC TTCTTCAGCT TGGTTTTTCT 7200 

ATTAATTTGA AACATAAGGA ACAAATCCTT CATAGTAACC TAATGCTCCC ATAAGTTCAA 7260 

AAGCTTGTTT TCTAATTCAA ACCATTGCAA CTCAGATTTC AGCTTTTCAG ATAAATCCTG 7320 

CTCATCCAAA TAATGACTTG AAATTAGTGC TGAACTCGTT TCTCTATCCT GTACAGGCTG 7 380 

AGCACCCATA CCAGCAAAAA ATAAACTCGT TCCTAGCAAG ACCGAACAAG CTCCTATTGC 7440 

ATATGGCCTC AAAGAAAAAC GCTGCTTTCT CTCAAATTGA AATTCTTTCA TCCCATCTCC 7500 

CATCATTCAT TATTACTGTA TATTTTGTAT ATCAGAAATA GTTTGTATTC ACAAATCTTT 7560 

CTAGTTATTC CCTTATCATT CCTAATTAAG GGAGATAACA TACAATAATT TTTAGTTAAA 7620 

TGTATATCGA TGTTTTTTGT TTTTCTTAAT AAACGCAATA CAAAAAGAGC CTGTTACCAA 7680 

CCTCTTTGTA CTCAATGAAA ATCAAAGAGC AAATTAGGAA ACTAGCCACA GGTTGCTCAA 7740 
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AACACCGTTT 


TGAGGTTGCA 


GATAGAACTG 


ACGAAgTCAG 


CTCAAAACAC 


TGTTTTGAGG 


7800 


TTGCAGATAG 


AACTGACGAA 


GTCAGTAACA 


TCTATACGGC 


AAGGCGACGC 


TGACGTGGTT 


7860 


TGAAGAGATT 


TTCGAAGAGT 


ATTAGTCTAT 


TATTTCTTCT 


CAGCGCGAAG 


GGCTGACAAG 


7920 


ATTTGTGTTC 


GGATATCATC 


CACACCATTT 


GGAGTATTTG 


GTAAAAAGAT 


AGTTTGATTT 


7980 


CCTTTAGAGG 


CAAAGGTATT 


CAAGGTATCC 


AAATACTGGT 


TGGTCAAGAG 


GATAGACATG 


8040 


ATTTGTTCTT 


CTGTCATGCC 


AACATTGGCT 


TCCTTGAGTT 


CGGTGATAGA 


CTCTGCCAAT 


8100 


CCATCCACAA 


TCGCCTTACC 


TTGTTGGGCA 


ATCCCCACAC 


CATGAAGGCG 


GTCTTTTTCT 


8160 


GCTTCTGCTT 


CAGCTGCAGT 


GACAATTTTA 


ATCTTGTCAG 


CTTCCGCCAA 


TTCTTGTGCT 


8220 


GCGACCCGCT 


TACGTTGCGC 


CGCATTGATT 


TCATTCATGG 


ATTGCTTAAC 


TTCTGCATCT 


8280 


GGTTCGACCT 


TGGTAATCAA GGTTTTCACG 


ATAATGTAGC 


CGTAAGTGGT 


CATTTCTTCT 


8340 


GCTACTTGGT 


GTTGAACTTC 


AAGGGCAATC 


TCATCTTTTT 


TCTCAAACAA 


TTCATCCAAG 


8400 


GTTAATTTTG 


GAACAGAAGA 


GCGAAGAGCA 


TCTTCGATAT 


AAGATTTAAT 


CTGAGATTCT 


3460 


GGACGTATGA 


GTTTATAGTA 


AGCATCTGTC 


ACGCTCTGCT 


CGTTGACACG 


GTACTCAGTC 


8520 


GCTACATTCA 


TCATAACGAA 


CACATTGTCC 


TTGGTCTTAG 


TCTCAACCAC 


AATATCACTT 


8580 


TGCAACAAGC 


GCAACTGAAT 


CCGTCCTGCA 


ATCGAGTCAA 


TCCCAAAAGG 


CAAGCGAATA 


8640 


TGAATACCGC 


TATTAGCAAC 


CTTTTGGTAT 


TTCCCAAAGC 


GTTCAATAAT 


CCCCACCGAC 


8700 


TGCTGACGAA 


CCACATAAAC 


TGTACTCAGT 


GTGACTATCA 


CCAATAGGAG 


CACACAAACA 


8760 


ATCAGAAAAA 


TCATGAAAAA 


TATTGCCATA 


ATGGAACCTC 


CACAAGTATT 


TTTCTAGTAT 


6820 


TATAGCACAT 


TTAAAGAAGG 


CTGTGCCGTT 


TTTACTGCGA 


TTTTTCCTGA 


AATGTCAATA 


8880 


ATTAGAGGTG 


AATTGTCCTA 


TTGTCGTCCA 


ATCTCTTGCT 


AAAATAACTC 


TTTATAAAAG 


S940 


GCAATCGTTT 


CTTCTAAGGT 


TGGCATAAAT 


GGATTTCCTG 


GTGCGCAGGC 


ATCAATCAAG 


9000 


GCATTCTTAG 


AAAGGTATTC 


AAAGTCGAAA 


TCTTTTTCTT 


CAATACCAAG 


TTCAGTCAGT 


9060 


TTCTTAGGAA 


TACCTACTGT 


CTCAGAAAGC 


TTCTCAATCT 


CAGCAATCGC 


ATAATCGGCA 


9120 


CATTCTTGAT 


CTGATTTACC 


TTCTACATGA 


AGTCCCAAGG 


CTTTGGCAAC 


ATTGCGGAAA 


9180 


GCTTCTGGTA 


CACGTTTAGC 


ATTTTCACGT 


TCTATAACTG 


GTAGCAACAT 


GGCACAGCAC 


9240 


ACGG 












9244 



(2) INFORMATION FOR SEQ ID NO: 69: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GATCTGAACT TTATCATCAT AACTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT 60 

TTTCTGTCTA ACTTTTGGGG TGTAGTTCAG TCATTGGACT GACGTTTTTT TGTATGCTTA 120 

TTTTGATTTG ATGTAGTTGA TACCATCTGC TTTTGGTGCG ACTGCTTTTC CAAAGAAGGC 180 

TGCTAAGACA AGAATTGTCA AAACATAAGG TGCAATTTGA AGATAAACCG CTGGCACTCC 240 

TTGTAGGAAC GCCAATTGAG AACCGATAAC AGCCAAACTT TGTGAAAGTC CAAAGAAGAG 300 

ACTAGAAAGC ATAGCACCGA TTGGATTCCA TTTCCCAAAG ATCATCCCAG CAAGGGCGAT 360 

AAATCCAGGT CCAACAATAG TTGTCACTGA GAAGTTAACT GAGATTGATT GCGCATAAAT 420 

CGCTCCGCCA ATTCCACCTA GAAAACCTGA AATAATAACC CCTAAATATC TCATCTTGTA 480 

GACGTTGATT CCCAAGGTAT CCGCTGCTTG AGGATGTTCA CCGACAGAGC GGAGACGAAG 540 

ACCAAATTGA GTCTTAAAGA GAATAAACCA AGCAAGGAAT GAGAAGGCAA TCGCCAGATA 600 

ACCAAGTAGA CTAGTTGACT TGAAGAAGAT ATCACCAATC ACTGGGATAT TTGCCAAGAC 660 

TGGGAAATCA AAGCGTCCAA AAGTTTGACT TAGGTTGTCG CTTTGTCCTT TGTTATAAAG 720 

AACTTTAACT AAGAAAACAG CCAAGGCAGG CGCCATCAAG TTCAATACCG TACCGCTGAC 780 

AACATGGTCT GCACGGAAAT GAACCGTCCC TGCTGCGTGG ATGATAGAGA AAACACTACC 840 

AACCAATCCT GCTACAAGCA AGGATAGCCA TGCAGTTGCT GCTCCAAATT GTTCTGCAAA 900 

TTCAAGGTTA AAGACAACTC CAGAAAAGGC ACCCATAACC ATAATTCCTT CAAGGCCAAC 960 

GTTTACCACA CCACCACGTT CAGAGAAAAC ACCACCGATA CTTGTAAACA TGAGAGGTGC 1020 

TGAGTAAATC AGCA7AGAAG AC AC CAAC AG GGGGAGCAAC GTTATAATAG ACATCTTTAC 1080 

TTACCTCCTT TAACTTGTTT TTTCGGTTTG ACAAAGCGTT CGATAAGCTA ATGAACACTG 1140 

ACAAAGAAGA TAATAGACGC TGTTACAATG CTGACAAGCT CAGATGGTAC CTGCGCCGCA 1200 

TTCATACCAG GAGCCCCAAC TTGGAGAACG CCAAATAGGA AGGCTGCAAA CAGTATACCA 1260 

ATTGGTGAGT TGGCCGCAAG CAAACTAACC GCCATTCCGT TAAATCCGAT AGCTAATGAC 1320 

GAACCTTGAA CATAGACGTT CTGGAAGGTT CCCAAACCTT CAACAGCTCC ACCAAGACCT 1380 

GCCAAGGCAC CTGAAATAAT CATAGATAGG ATAATAGTCC CCTTGGCAGA AATACCAGCA 1440 

TATTCTGAAG CATGTGGATT AAGACCAACT GCACGGATTT CAAAACCAAG AGTTGTTTTC 1500 

TTGAGCATGA ACCAAATAAC TGCAACGGCA ATGATGGCAA AGAAAATACC AATATTCATC 1560 

CGTGAGTTAC CAGTCAACTC AGCCAACCAA GGTGTCTGAT AGGTTGCATT AGCCCCAACA 1620 
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CGAATGGTCG AATCTGTACT TTGCATGAAG TCTTTAGGGA AAGCATGGAT AAAGGCATTC 1680 

CCTACATACA AGACAATGTA GTTCATCATG ATGGTTACAA TAACCTCTGA CGTCCCTAGA 1740 

TAGGCCCTAA GAATACCTGG AATCGCTCCG ACAATCCCAC CAGCAATCAA GGCAATCACG 1800 

ATGGTTGCTA GAATCATCAA GGGACGGGGC ATATCTGGAT GCGACAGGGC AAACCAACCA 1860 

CTGAGAATCC AACCTGCCAA AGCCTGACCA GGAAGTCCGA CGTTAAAGAA ACCAGCTCGA 1920 

CTGGCAACGG CAAAACCAAG ACCAATCAAG ACCAGAGGAC CCATACCACG GAAGATTTCT 1980 

CCAATCCCAC GCAGACTGCC AAAGGCTGTA TAGAACAATT CTTCGTAGCC CCAAATAGCA 2040 

TCATAACCGA AGATCCACAT GACAATGGCT CCGAGTAAAA TTCCTAGCAA TACAGAAATC 2100 

AAGGGAACCG AAATTTGTTG TAATTTTTTA GACATCACTC TTCTCCTTTC CCAAGTTTCC 2160 

ACCAGCCATC AAGACACCAA GTTCTTGTTT ATTGGTTGTT TCTGGTGATA CAATACCTTG 2220 

AATCTTACCA TCGTGGATAA CGGCAATACG GTCTGAGACG TTTAAAATCT CATCCAATTC 2280 

AAAGCTGACA ACAAGGACAG CCTTGCCATT ATCACGCTCT TCAATCAAGC GTTTGTGGAT 234 0 

ATACTCAATG GCACCGACAT CCAACCCACG AGTTGGCTGC CTAACGATAA GGAGATCAGG 2400 

ATCTCGATCA ATTTCACGAG CAATAATTGC TTTTTGTTGA TTTCCTCCTG AGAGTGCAGC 24 60 

TGCAGGAACT AATTCACTGG CAGCGCGAAC ATCAAACTCT TCCATCAGCT TTTTAGCATA 2520 

AGAAGTAATA TTTGAATAAT TCAAAATTCC ATTTTTACTA TGTGGTTCTT TATAGTAGGT 2580 

TTGAAGGGCA ATATTTTCAG ATATCATCAT TTCCAAAATC AAGCCATCAC GCTGACGGTC 2640 

TTCTGGAACG TGCCCAACAC TTAGTTCTCT AATCTGACGT GGGTGCAAGC CTACAATTCA 2700 

ATCTCCTTTT AGCTCAATGC TACCAGATTC AACCTTACGA AGACCTGTAA TCGCTTGAAT 2760 

CAGTTCAGAC TGACCATTTC CATCAATCCC CGCAATACCA ACAATCTCTC CAGCACGAAC 2820 

ATCCAAGGAC AGATTTTTAA CAGCTGGAAC ACCACGGTTT TCATTGACCA CCAAATCTTT 2880 

GATAGACAAA ACCACTTCTT TTGGTTTAGA GGCTTGCTTC TCTCTTTTAA AGGAAACAGA 2940 

ACGTCCTACC ATCATTTCCG CCAAATCAGC ATTGGTAGCC CCTGCAATTT CAACGGTTTC 3000 

AATTGATTTC CCACGACGGA TAACTGTAAC ACGGTCAGAA ACTGCTCGAA TTTCATCCAA 3060 

TTTGTGGGTA ATCAAGATAA TTGATTTTCC TTCTTTGACA AGATTTTTCA TAATAGCCAT 3120 

CAACTCATCA ATTTCTGATG GAGTCAAAAC AGCCGTTGCT TCGTCAAAGA TAAGGATATC 3180 

AGCCCCCCGA TAAAGTGTTT TTAAAATTTC TACACGTTGT TGGGCTCCAA CTGAGATATC 3240 

TGCTACCTTG GCAGAAGGGT CAACAGCTAA GCCATAACGT TCAGAAAGAG CCTTGATTTC 3300 

TTTGCTAGCT CCAGCGATAT CTAGCACACC ATTTTTAGTC AATTCACTAC CTAAAATGAT 3360 
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CTTTTCAGCC ACTGTGAAGG CTTCAACCAA CATAAAGTGC TGGTGAACCA TCCCGATTCC 3420 

CAAGCTAGCT GCTTTAGATG GGGAGTCGAG ATTGACAACT TGACCGTTGA CCGCGATTTC 3480 

ACCACTAGTT GGTTCAAGAA GCCCTGCTAA CATGTTCATT AGCGTGGACT TACCAGCCCC 3540 

ATTTTCTCCT AAAAGTGCAT GAATTTCACC TTTTCGTAGG TGCAAGTTGA TTTTGTCGTT 3600 

GGCAACAAAT CCACCAAACA CCTTGGTAAT ATCACGCATC TCAATGACAT TTTCGTGTGC 3660 

CATGTGCTCT TCCTTTCAGA GTCTTATTTT ATTTCAATAA AACTTGCTAG TTTGTCTAGT 3720 

AGCAAGCTTT ACTTAGACAA AATGACTTTG TCTCAACTCT TAAAAAAGCG GCCCTTGGCC 3780 

GCTTCCTAAG AAATGACTTC CATCCATTAT TTTTCAGGAA CTTTTACGCT TCCATCAAGG 3840 

ATTTTAGCTT TTGCATCTTC GACAGCTTTT TTACCTTCTT CTGAAAGGTT TGTTACTGCC 3900 

AAGTCAACCC CTTTATCCTT CAATGAGTAA ACGATCACTT GACCGCCAGG GAATTCTCCT 3960 

CTTTCTGCCT TGTTAGAAAT ATCTTTTACA GTTGTACCAA CTTGTTTCAA ACT AG AT AC A 4020 

AGAACAAAGT TTGATTCTTT GCCATCTTTA GAAGTGTATT TACCTTCTGC TTCTTGGTCA 4080 

CCATCAACAC CGATAACCCA AACTTTTTCA TTTTCAGGAC GGCTTTCGTT GAGAGATTTT 4140 

GCCTCTGCAA AGACACCTGC ACCTGTACCA CCAGCTACTT GGTAAACAAT ATCTGCACCG 4200 

GCTGCGTATT GTGCGGCTGC AATTGTTTTA CCTTTAGCCG CATCACCAAA TGAACCAGCG 4 260 

TAGTCAACTT GGACTTTGAT AGATGGGTCT ACTGACGCAA CACCAGCCTT GAATCCTGCT 4320 

TCAAAACGAG AGATAACTTC AGATTCGATA CCACCTACAA AACCAACTTG TTTTCTCTTA 4380 

GTTGTTTTTG CTGCAGCCAC ACCTGCAAGg TAACCTGACT CATTATCAGC GAAAGTTACG 4440 

CTCGCAACAT TCTTTTGGTC TTTAATCACA TCATCAATCA AGACATAGTT CAAGTCAGTG 4 500 

TGTTCTTTTG CTGCATCTTT AACTGCATTA TTAAGGGCAA AACCAACACC GAAGATTAGG 4 560 

TTGTAACTTC CAGCCCCTTG TTGCAAGTTG TTAGCGTAGT CACCTTCACT TGTTGATTCG 4620 

AAGTAAGTGA AACCGTTATC TTTTGAAAGA TTGTGTTCTT TACCCCAAGC CTCCAAACCT 4 680 

TCCCAAGCTG ATTGGTTGAA TGATTTGTCA TCAACACCAC CAGTATCAGT GACGATTGCT 4740 

GCTTTTGTCT TCACATCAGA AGATGAAGCT GCGTTACGAG AAGAGCGGTT ACCACATGCA 4 900 

GCAAGTCCAA CTGCTGCCAC TGCAACTAGG CCAAGACCTA GCCATTGTTT CTTGTTCATT 48 60 

ACTGAACCTC CTAAATAAGA TGTGCAACGA TGTTGCAAGT ATGGATTGGT TGGCCACAAG 4920 

GACCGTGCCA CTCAGAGAGC GACTCAGACT AGTTTAAGTC TGTAAAAGAG TATGGAAGTA 4 980 

ATTCCCCGAC CGTCATCTCG ACCGTCGATT TATCTTTTGC GACTAAGGTC ACTTTTAGAT 5040 

CTTGTTCAAA AAATTCAGCC ATCACTTGGC GACAAGCACC ACATGGCGAG ATCGGTTTTT 5100 

CAGTTTGACC ATAGACAATC AATTCTGAAA ATTCTCTTTG GCCTTCAGAT ATAGCCTTAA 5160 
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AAATAGCTGT 


TCTCTCACCG 


CAATTGGTCA 


AAGGATAGCT 


AGCATTTTCA 


ATATTCACTC 


S220 


CCGTGTAAAC 


ACTTCCGTCT 


TTAGCTACTA 


AAACTGCTCC 


GATAGGAAAG 


TGAGAATAGG 


5280 


GGACATAGGC 


ATGTTTGCTG 


GTTTCAATTG 


CCAGTTCAAT 


CAACTCAGTA 


GTCGCCATCT 


5340 


GCCAATTCTC 


CTTTTAAAAT 


AGCTACCCCA 


GCTGACGTTC 


CGATACGGGT 


CGCACCTGCT 


5400 


TCGACAAAGG 


CAAGAGCATC 


TGCATAAGAA 


CGAGCTCCAC 


CGGCGGCCTT 


GACACCCATA 


S460 


TCAGATCCAA 


CTGTTTCACG 


CATTAATGTA 


ACATCTGCTA 


TCGTAGCACC 


ACCAGTTGAA 


5520 


AAGCCAGTAG 


ATGTTTTGAC 


AAAGTCAGCC 


CCAGCTTTTT 


GGGCCAATTG 


GCAAACAACA 


5530 


ACTTTTTCTT 


GGTCTGTCAG 


AAGGCAAGCT 


TCAATAATGA 


CTTTCACTAA 


CTTATCACCA 


5640 


CTTGCTTCCA 


CTACTGCGCG 


AATATCTGAC 


TCAACCAAGG 


CTAAATTACC 


TGATTTGAGA 


5700 


GCTCCAACAT 


TGATCACCAT 


ATCAATCTCA 


TCTGCACCAT 


TTTGCATAGC 


TTCTTTTGTC 


5760 


TCAAATGCTT 


TCACGGCTGA 


AGTTGTTGCT 


CCCAAAGGGA 


AACCTACTAC 


TGTGCAAACC 


5820 


TTAACATCTG 


TGCCTTCAAG 


TCCTTTTTTA 


GCATGTTCAA 


CCCAGGTCGG 


ATTAACGCAA 


5880 


ACACTGGCAA 


AGTCATACTC 


TCTAGCCTCA 


GACAACAAAC 


TATCAATTTG 


TTTTTTCTTT 


5940 


GCATCTTGTT 


TTAAAAGCGT 


ATGATCTATA 


TATTTATTTA 


ATTTCATTTC 


GGTTTTCCCT 


6000 


CCATTTAGGA 


GATGATTTCT 


ACAATTTCAC 


GGATTTTTTT 


CACTTCATCA 


CTTATTTTAA 


6060 


CACATTTTTG 


GAAATCTGTA 


ACTAGTTGAG 


GTGGAATTTT 


TTCATTTGTG 


TATACTTTTG 


6120 


CAACAATTTC 


ACCCTTTTGA 


ACCGAGTCTC 


CAATCTTCTT 


TTCAAAAACA 


ATTCCTGTTT 


6180 


CATAGTCCAA 


GGCATCAGAC 


TTAACTGCAC 


GACCAGCACC 


CAGCCTCATG 


GCATAAAGAC 


6240 


CAAAGTCCAT 


AGCTGGAAGA 


GCTGAAATGA 


CACCCGTTTC 


CTGAGCAGGG 


ATTTCCACCA 


6300 


CATGAGCTAC 


ATTTACAGGA 


CGATAGAGGT 


CTTCCAAGTC 


TCCACCTTGG 


GCTTGCACCA 


6360 


TTTCCTCAAA 


CTTAGCCAGT 


GCTTGACCAT 


TCTCAAGATG 


TTGGTGAACT 


TCTTCAACAG 


6420 


TTTTGTTAAC 


ATTTGCCAAA 


CCAAGCATAA 


TTTGAGCCAA 


TTCACAAATA 


AAGTGGGTAA 


6480 


TATCCTGACG 


TCCTTGACCT 


TGCAAAATCT 


CCAATGCTTC 


AAGGATTTCC 


AGACGATTTC 


6540 


CAATCGCTCG 


TCCCAAAGGC 


TGGCTCATAT 


CCGTAATCAC 


TGCTACTGTC 


TTCCGTCCAA 


6600 


CAACCTTACC 


AAGATCTACC 


ATAGTTTGAG 


CCAACTCACG 


CGCCTCATCA 


ACCGTCTTCA 


6660 


TGAAGGCACC 


CTCACCGACA 


GTCACGTCTA 


GCAAAATAGC 


ATCCGCCCCT 


GCCGCAATTT 


6720 


TCTTGCTCAT 


CACCGAACTC 


GCAATCAAAG 


GAATCGTGTC 


GACAGTTGCG 


GTCACATCAC 


6780 


GAAGGGCATA 


GAGAAGCTTA 


TCTGCTTTGA 


CCAGCTGGTC 


TGATTGCCCA 


ATGACAGATA 


6840 


CTCCAATATC 


CTGAACCTGA 


CGAATAAAAT 


CCTCTTGACT 


ACGTTCTACT 


TGATAGCCCT 


6900 
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TAATGGACTC CAATTTATCA ATTCTTCCGC CTGTATGGCC AAGACCACGA CCACTCATTT 6960 

TTGCTACAGG CACACCCAAG CTAGCAACAA GAGGAGCTAA AATCAAGCTT ACCTTATCGC 7020 

CGACACCACC AGTAGAATGC TTGTCAACTT TCACACCATC AATGGCTGAC AGGTCAAACT 70 BO 

CTTGCCCAGT CTTAACCATA TTCATCGTTA AATCAGAGAT TTCTCGAGTC GTCATTCCTT 7140 

TAAAATAAAC AGCCATAGCA AAGGCAGACA TCTGATAATC AGGAACAGTT CCTGATACAT 7200 

AGCCTTCTAT CAGCCATTCA ATTTCACTTG AAGTCAGTTC TTGACCGTCT CGTTTTTTTT 72 60 

GGATTAAATC AACTGCTCTC ATTCTTTCAC ACTTCTAAGG ATATAGTATC CCTTGTCTTT 7320 

TTTAAGGATT TCACAATTGC CAAACACATC TTCCATCTTA GACTTGGCAC TTGGAGCTCC 7380 

TTGTTTTTTC TGGATGACGA TGGTCAAATC TCCACCAATT TCCAAGAAAT CTTTACTTTT 7440 

CTCGATGATT TCATGAACGA CTTGCTTGCC CGCACGGATA GCAGGATTGG AAATGACATG 7500 

GTCAAATCGC CCTTGAACTC TTGCATAAAT ATTAGATTGA AATATCGTCG CTTTTGCATT 7560 

ATTTTTTTCA GCATTTCTCT GAGCTAAATC CAGGGCACGA GTGTTAATAT CAACCATGGT 7 620 

CGCCTGAACT CCGTAAACCT TGACCAAGGA CAAACCTAAT GGACCATAAC CACAGCCTAC 7 680 

ATCTACGACT GTCTCTCCTT GGTTGACATC CAGACACTTG AGCAAGAGTT CACTTCCAAA 7740 

GTCAACCATT TTCTTGCTAA AAACACCCGC ATCTGTCAAA AAAGTCATTT TTTCTCCCAA 7800 

CAAGTCCACT CTCAACTCAT GAATGTCGTG AGCAGCGTCA GGATTTTCTG CATAGTACAT 7860 

TTTACTCATG ACACTATTTT ACCATAATTT GACTCAAATT GTAAATCCTT TACAAATTGA 7920 

TAATAAAACG AAAAAGACCG AAGAAAGCAA CTCACGAAGC CATTTTCTTC AATCTCTTTC 7980 

AACACTTATA AATAATAAAC CATTTAGAAC TATAAATATC ACAGTCCAGA TAAAAACAAA 8040 

AAGTTTATCA TCTATAATCA GGCAGATTAT TATTTCTATT GCTTAACCTT AAAATACTTT 8100 

ATTATCAACA AAATTCCTAA CAAAATGTTT AGATAAAAGC CCAACTGATA CGTTTATGTC 8160 

AGGATTTCCA AACTTGTCCA AAGTCGTATC AAATCTTCTA GTGACATGTG GAAGAAATAA 8220 

CCCTCTGTCG CAATCCGTAG GACTAAAAAG CAATAACTAC CCGCAGCAAT CCATTTCGTC 8280 

CATCGTTTTT TAGTAAGAAA GCAATTAAGA ACGAACAAAT AAAGACAGCT CTTACAATAG 8340 

CATGTTCCAT CAAAAAAGTA AAACCGTAAT AGGTTTCCAC AAAGCATCTA CCATTATCTG 8400 

CATTGGTTCC TTTTATAAAA GGTAAAGCAA AACTTAAAAT AAAACAGAGT TCCAATATGT 8460 

AACGTTTTAA GATTTTCATA GTACACCTCC TATAAGTTGT GAACTAAAAA GCCCCCTTTA 8520 

TAAGCTTATA AATCAGTAGA ATCTATCTCC TATTTCATCA ATAAATTGAT CACTTATACT 8580 

ATATACCATT GACTTACCAC ATTCAAGAAA CCGCTTTATT TTTTTAGCTT TTTATGGTAT 8640 

GATAGACAAA ATATCTAGGG GAAAACAAAT GACCAACGAA TTTTTACATT TTGAAAAAAT 8700 
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CAGCCGCCAG ACTTGGCAAT CTTTACATCG AAAGACAACA CCTCCTTTGA CAGAAGAAGA 87 60 

ATTGGAATCT ATCAAGAGTT TTAATGACCA AATCAGTCTC CAAGACGTTA CAGATATCTA 8820 

TCTCCCCTTG GCTCATTTGA TTCAGATTTA CAAGCGAACT AAGGAAGATT TAGCCTTTTC 8880 

AAAAGGAATT TTCCTCCA 8898 



(2) INFORMATION FOR SEQ ID NO: 70: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 188 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

TATCTTAACG aGGATTGGGT TTATCGTCAC TCTTATTGCC CTAATTGTGG GAACAATCCC 60 

TTAAATCATT TTGAAAATAA TCCGCCTGTA GCAGATTTTT ACTGTAATCA TTGTAGTGAG 120 

GAGTTTCAAC TAAAGAGCAA AAAAGGAAAT TTTTCATCAA CAATCAATGA TGGTGCTTAT 180 

GCAACGATGA TGAAGCGTCT GCAGGCAGAT AATAATCCTA ATTTCTTTTT TTTAACTTAC 240 

ACAAAAAATT TTGAGGTAAA TAACTTTCTT GTCCTTCCGA AGCAATTTGT TACACCGAAA 300 

TCGATTATTC AAAGAAAACC ACTTGCACCA ACTGCTAGAC GAGCAGGTTG GATTGGTTGT 360 

AACATTGATT TATCACAAGT ACCTTCTAAA GGAAGGATAT TTCTTGTGCA AGATGGACAA 420 

GTTAGAGATC CAGAAAAAGT TACAAAAGAA TTTAAGCAAG CTTTATTTTT AAGGAAGAGC 4 80 

TCTCTGTCAT CAAGAGCTTC GACAATAGAA ATTCTAAATT GTATAGATAA GATAGAGGGT 540 

TCAGAATTTA CCCTTGAAGA TATGTATCGT TTTGAAAGTG ACCTAAAAAA TATCTTTGTT 600 

AAGAACAATC ATATCAAAGA AAAGATTAGG CAACAGCTTC AAATATTAAG AGACAAAGAA 660 

ATAATAGAAT TTAAAGGTAG AGGAAAGTAT CGGAAATTAT GAAAACGAAA CAACTTGT* n G 720 

CATCAGAAGA GGTGTATGAT TTCTTAAAAG TCATCTGCCC TGATTATGAA ACTGAAAGCC 7 B0 

GTTACGATAA CCTAAGTTTA ATCGTCTGTA CCTTATCAGA TCCCGATTGT GTGAGATGCT 840 

TATCTGAAAA TATGAAATTT GGTGACGAAA AACAACTAGC TTTGATGAAG GAAAAATATG 900 

GGTGGGAAGT AGGAGATAAA TTGCCAGAGT GGCTACATAG CTCCTATCAT AGATTATTGT 960 

TAATAGGTGA ATTATTGGAA AGCAATCTAA AACTGAAAAA GTATACAGTA GAAATTACAG 1020 

AAACTTTATC ACGTTTAGTA AGTATAGAGG CTGAAAATCC AGATGAAGCC GAACGACTTG 1080 

TAAGAGAAAA GTATAAGAGT TGTGAAATTG TTCTTGATGC AGATGATTTT CAGGACTATG 1140 
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ACACTAGCAT ATATGAATAG GTAGATGTTT TTATTTTGTC AACAAAAAAG AGGCTCGCAC 1200 

CTCTTTTTCT TATTTCTTTT TATGATTTAA TACGGCATTG AGGACAATAG CGAGTAGGCT 1260 

GGCTACCACG ATTCCCTTTG AGAAGAACAT TTGGAAGGCT GTCGGCATGC TGACAAAGAG 1320 

ATTACTGTTG TTGAGACCGA CACCTGCAGC GATTGAAACA GCTGCGATAA GGAAGTTGTG 1380 

TTCATTGTTA GCAAAGTCAA CACGGGCGAG GATTTGCATC CCTTGAATTG ATACAAAACC 1440 

AAACATTACC AGCATGGCAC CACCGAGGAC GGAGCTTGGA ATGATTTGGG CAAGGGCGCC 1500 

AAACTTAGGA AGCAGTCCAA GGAGAACCAG GAAACCAGCT GCGTAGTAGA TTGGCAGGCG 1560 

TTTTTTGATG CCTGACAATT TAACCAAACC AACGTTTTGT GAAAATCCGG TGTAAGGGAA 1620 

GGTGTTAAAG ATTCCTCCGA GAAGTACGGC CAAACCTTCT GCGCGGTATC CGTTGCGAAG 1680 

GCGCGTGCTG TCGATTGGAT CCTTTGTGAT ATCAGACAAG GCCAGATAAA CACCAGTTGA 1740 

CTCAACCATA GACACCGTTG CGATGATACA CATCATGACA ATAGATGAGA TTTCAAAGGT 1800 

TGGCATCCCA AAGTAGAGTG GAGTTGGGAC ATGGACAAGT GGAGCTACCG CAACAGGAGA 1860 

GAAGTCCACC AAGCCCATAG TAGCAGCAAT GGCAGTTCCA ACAACCAGAC CAATCAAAAT 1920 

AGAGATAGAC TTGATAAATC CTTTGGTAAA GATGTTGATC AAGAGCATAA TCAGAACAGT 1980 

AATAGCTGCA AGCAAGAGAC TTTGACCAGT TGGCTCTGGA ACGTTATTTC CCATATTTCC 2040 

AATAGCGACA GGGATCAAGG TTAAACCAAT CGTGGTAATA ACAGATCCTG TTACGATAGA 2100 

TGGGAAGAGA TTGGCTACTT TTGAGAAGAT GCCTGAAACA AGAACCACGT AAATCCCAGA 2160 

TGCGATAAGG GCACCAAACA TAGCGCCACT ACCATGGCTT TGCCCAATCA TAATCAAGGG 2220 

AGCGACCGAC TGGAATGCAA CTCCAACAAC GACTGGGAGT CCAATCCCAA AGTATTTGTT 22 80 

GAGTTGGAGT TGGAGGAAGG TTGCCACCCC ACACATGAAG ATATCTGTAG AAATCAGGTA 2340 

GGTCAACTGC TCAGCTGAAT AGCCAAGGGC TGTCGCAATC ATGATGGGAA CCAGGATAGA 2400 

TCCTGAGTAC ATGGCTAGTA AGTGCTGCAA GCCAAGAACG GCTGCTTGCG AGTGTTTTTC 24 60 

TTGAGTTTGC ATTAGAGATC TGCCTCCTTA AATACGACTT GACCATTTTC AAAACAATCC 2520 

AAACGAGCAA GTGATAGGAC AGGGTAGCCT GCTTTTTCAA GCAAATCACG ACCATCTTGG 2580 

AAGGATTTCT CAATCACGAT ACCGATAGCT TGCACTGTGC CACCGGCCTG TTCGATGATT 2640 

TGAATCAAGC CTTTAGCAGC TTGGCCATTA GCAAGGAAAT CGTCGATAAT CAAAACCTTG 2700 

TCCTCTGGTG AGAGGAATTT TTCAGCGATA GAAACGGTGC TGGTCACCTG CTTGGTAAAG 2760 

GAGTAGACTT GAGCAGTTAA GATGCCTTCG TTCATGGTGA TGTTCTTAGC TTTTTTGGCG 2820 

AAAATCATCG GAACGTTTAA GGCTTCAGCT GTAAAAACGG CTGGGGCAAT ACCCGACGCT 2880 

TCAATGGTTA CGACCTTGGT AATGCCAGTA GTAGCAAATT TTTCCGCAAA AACCTTACCA 2940 
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ATCTCTCCCA TCAAGCTAAA GTCAACTTGG TGGCTTAAAA AGGAATCTAC CTTGAGGATG 3000 

TTATCACCCA AGATATGCCC ATCCTTGAGG ATGCGCTCTT CTAATAATTT CATAAGACCT 3060 

CCTAAAGTCT AAAAGTTAAT TTACTTGTTG TTTAAATATT TCTATAGTGA TCCCTTTTGC 3120 

TAATACTATA TATTTGATAA AACTATTACG AGCGAAGCGA GTCTTATCAA ATATTTCCCG 3180 

TTGTAGTGGT ATCATAGACA ATAATCTTGT TATTGTCTAT GACGGGATTT TTGAGAGTAA 3240 

AATAGTTCGG GGAACTATTT TAGCCTAAGC CTAGAAATGA AAGAGCTAGG GGCTCAAAAA 3300 

TTAGGGATGA AATTCCCTGG ATTCCTGAAA TTATTCACAG GATAATTTCA CCTCCCGTCC 3360 

GCACTAATTA AGGGAAATAT TAAAAAAAGA CCTACTTAAT CTCTAAGTAA GTCCCCTAAA 3420 

TACACATGGC AAAAACGGCC ATATCTCACT GCTGACTTAC TTATTGTTAG GTGTTCCGGC 3480 

ACCTTGTAGA AACGTCGTGC CAATTCACGA CATAAACAAG TAAAACGATA TTCAATTTTA 3540 

AATAGGCTTG AGCCAATGTT TTTATTTTAC ACTAAATAAC TTTAGAAATC AACTATTTTG 3600 

TTAGTGTTTT GGTTTAAAAA ACGAACAAAA AGAAGAGAGG GTGAACAAAA ACTCCATTGT 3660 

AAGCTAACAC T TAT ACT AAA TGAAAATCAA AGAGCAAACT AGGAAGCTAT CCACAACCTC 3720 

AAAACACTGT TTTGAGGTTG TGGATAGAAT TGACAGAGCC AGTATCATAT ACCTACGGTA 3780 

AGGCGACGTT GACGTGGCTT GAAGAGATTT TCGAAGAGTA TTAGAAGATT TTTCCATCAT 3840 

AAAAGGCATA CTATCAAGCT TTTAGACACC TGACAATATG CCTTTTTCTA ACTTTAAAGA 3900 

CTTTTCCCAA TTTTTATTAT TCTACTCGCT AAATCTTAAA AAATAGCCAT CTGGATCCAA 3960 

AACTGCAAAT TTATGAGGAT AGATATAGGG ATCACTGACA CGAAACTTTC TTTTGGTCAA 4020 

GGGACGATAA ATAGGATAGT TTGCCTTCAT CACTCTTTAA TAGAGTTTTG AAACATCCTT 4080 

TATGCCAAAG GAGAGAT7GA CTCCACGACC AAAGGGATAG GTCAGTTCAG CTAGTTGATC 414 0 

CTTTGTTCCC TCCTCTAACA TTAGTTGACA CTCTTCAAGA GAAAGAGAAA GTTTTCTTCT 4200 

GGACGTTGGT ATTCAATCCT AAAACCCAGT AAACCACAGT AGAAGGACCG GGACTGTTCC 4260 

ATATTCGATA CAAGCAACTC GGGAATGACC GCATTGTAGT CCATATAGAA AATCCTTACA 4320 

AGTCAATTTC CAAGACAATC GGTGTATGGT CTTGGCGAGC ACCTGAGTCA ATCATATCAG 4380 

ATTTAGTGAC CTTGTCAGCG ATACGGTTAC TTGTGAGCCA GTAGTCGATT CTCCAGCCTG 4440 

TATTGTTGAT TTTAGAAGTT TTGCTGCGTT GTGCCCACCA AGTGTAGCGT TCAGGAACAT 4500 

CGCCATGAAC ATGGCGGAAG GTGTCTGTAA ATCCAGTTGC CAAAAGGTTG GTAAATCCAG 4560 

CACGTTCCTC GTCAGTAAAT CCAGGTGAAC GGCGGTTGCT AGCAGGATTT GCAAGGTCGA 4 620 

TTTCATTGTG GGCTACGTTG TAGTCACCGG TCGCAAGGAC TGGTTTTTCT TTGTCTAGTT 4680 
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CAGCCAAATA 


CTCAGCATAT 


TTGGCATCCC 


AGACTTGGCG 


TTCTTCCAAG 


CGTTTGAGAC 


4740 


CGTCACCAGC 


GTTTGGAGTG 


TAAACTTGGG 


TTACGAAAAA 


TGCATCAAAT 


TCTAGAGTGA 


4800 


TGATACGACC 


TTCCAAGTCC 


ATGGTAGAAG 


GGGCACCGAT 


TTCTGGGAAG 


CTGATAGTAC 


4860 


GTGTAAGTTC 


TTTCTTATAA 


AGGAACATGG 


TTCCAGCATA 


GCCTTTACGG 


GCAGGCTCTT 


4920 


GGGAAGAGCG 


CCACGTGTTT 


TCGTAGCCTG 


GGAAGAGTTC 


TTCTAAAATT 


TCCACGTGTT 


4980 


TCTTTGTAGG 


TCCTTTGGCA 


GAAAGCTTGG 


TTTCTTGGAT 


AGCAATGATA 


TCAGCATTTT 


5040 


CAGCGACCAA 


GGTTTGTAGG 


ACTTCTTGGG 


ACAATTTGGC 


ACGAGCTGAG 


TCACTAGTTA 


5100 


GGGCAGCGTT 


TAGGGAATCA 


ATATTCCATG 


AGATAAGTTT 


CATAAAGTTA 


CCTTTTTCAT 


5160 


TCAGATTATA 


GATTTTATTA 


TACCAAAAAA 


ACATCTATTT 


CCCCAACGTA 


TGGTTTGAAA 


5220 


AATTACTCTC 


TTTCGTTTAT 


AATTAAGAAT 


GATTTTATGA 


AAGGGAGTGA 


AAATACATGA 


5280 


AATTCTACTC 


TTATGACTAT 


GTACTCAGCC 


AAATCCGTCA 


GCAAAATGGT 


ATCATGGTTG 


5340 


GCTTTGGGAT 


TGTTCTATTA 


GCTGTGACAG 


TTTTTTTTGC 


TTTCAACGCA 


TACCATAATA 


5400 


AAAAGGGAAG 


CGAATTTCGT 


GAGTTGGTCA 


TGATTTCAGA 


TCTGGCCTTA 


TTTAGCTCTG 


5460 


CTTTTGGTCA 


GCATCACGAC 


TTATCAAAAC 


AATCAAGTTT 


CTAACAATAA 


ATTTCAAACT 


5520 


TCACTTCATT 


TCATCGAGCT 


TGTTTCCAAA 


GATTTGTGAG 


TAGACAAGTC 


AGAAGTCTAT 


5580 


GTTAATACTT 


CCACAAACAC 


AGATGGCGCA 


CTTATCAAGC 


TGGGAGATCG 


CTATTATCGT 


5640 


GCCCTAAATG 


GAAGTGAGCC 


AGACAAGTAC 


CTGTTAGAGA 


AAGTCGAATT 


GTATAAGACA 


5700 


GACGCAATTG 


AACTGGTGGA 


TGTGAACAAA 


TGACACTTAA 


TTATATCGAA 


ATTTTAATCA 


5760 


AACTGGTCTT 


GACTCTCAAA 


TAGCTCAACA 


ACAATGTTCA 


CTTTGTGAAA 


CGTTTGATTG 


5820 


ATGGTAAGCC 


AACTCTCCTT 


ATCAAAAATG 


GGAATATTGA 


CCCAGAAGCC 


TGTCGTTCAC 


5880 


TTGGTTTGTC 


TGCATCGGAT 


GTATCCCTCA 


AACTTCGTAG 


CCAAGGGATT 


TTCCAGATCA 


5940 


AGCAAGTCAA 


ACGAGCTGTG 


CAAGAGCAAA 


ATGGGCAACT 


CATCGTTCTG 


CAAATGGGAG 


6000 


HTV^ll & B x TV*/"* 

A I \j AAAA L 


TAAGTATCCA 


GTTGTGACTG 


ACGGTGTGAT 


TCAAGTAGAT 


GTCTTGGAAT 


6060 


CGATTGGTCG 


TAGCGAAGAG 


TGGTTGCTTG 


ATAACCTCAG 


TAAACAAGGG 


CATGACAATG 


6120 


TAGCCAATAT 


CTTTATTGCT 


GAATATGACA 


AGGGTGCTGT 


TACACTCGTA 


ACTTATGAAT 


6180 


AAGAAAAACC 


TGGGGTCTTG 


TACTCTTCGA 


AAATCTCTTC 


AAACCGCCTC 


AACGTCGCCT 


6240 


TGCCGTATCT 


AGGTTACTGA 


CTTCGTCAGT 


TCTATCTACA 


ACCTCAAAGC 


AGTGCTTTGA 


6300 


GCAGCCTGCG 


GCTAGTTTCC 


TAGTTTGCTC 


TTTGATTTTC 


ATTGAGTATT 


GGCCTCAGGT 


6360 


TTCCATTTGC 


AATCAGAAAG 


GGATTTTATG 


TCCATTATTC 


AAAAACTTTG 


GTGGTTTTTC 


6420 


AAGTTAGAAA 


AACGCCGTTA 


TCTAGTCGGA 


ATTGTGGCCC 


TGATCTTGGT 


TTCCGTCCTC 


6480 
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AATCTCATTC 


CTCCTATGGT 


TATGGGGCGG 


GTCATTGATG 


CCATCACATC 


GGGGCAATTA 


6540 


ACCCAGCACG 


ACCTCCTTCT 


TAGCCTATTT 


TACTTGCTAC 


TTGCAGCCTT 


TGGTATGTAC 


6600 


TATTTGCGCT 


ATGTGTGGCG 


TATGTATATC 


CTTGGGACCT 


CTTATTGCTT 


GGGACAGATC 


6660 


ATGCGGTCTC 


GCTTCTTTAA 


GCATTTCACA 


AAAATGTCGT 


CAGCCTTTTA 


TCAAACCTAT 


6720 


CGCACGGGTG 


ATCTGATGGC 


ACACGCAACC 


AATGATATCA 


ATGCCTTGAC 


TCGTTTAGCA 


6780 


GGTGGCGGTG 


TCATGTCTGC 


GGTGGATGCC 


TCTATCACGG 


CTCTGGTGAC 


TTTGTTGACC 


6840 


ATGCTCTTTA 


GCATCTCATG 


GCAGATGACT 


CTTGTTGCCA 


TTCTCCCCCT 


ACCTTTCATG 


6900 


GCCTATACGA 


CTAGTCGCCT 


AGGGAGAAAG 


ACTCATAAGG 


CCTTTGGCGA 


ATCCCAAGCT 


6960 


GCTTTTTCTG 


AACTCAATAA 


CAAGGTACAG 


GAGTCCGTAT 


CAGGTATCAA 


AGTGACCAAG 


7020 


TCTTTCGGTT 


ATCAGGCAGA 


CGAGTTGAAG 


TCTTTTCAGG 


CAGTCAATGA 


ATTAACCTTC 


7080 


CAAAAGAACC 


TGCAAACCAT 


GAAATATGAT 


AGTCTCTTTG 


ACCCTATGGT 


TCTCTTGTTT 


7140 


GTTGGTTCGT 


CCTATGTTTT 


AACGCTTTTG 


GTTGGCTCCT 


TGATGGTTCA 


GGAAGGGCAG 


7200 


ATTACAGTTG 


GGAATCTAGT 


CACCTTTATC 


AGCTATTTGG 


ATATGCTGGT 


CTGGCCTCTT 


7260 


CTGGCCATCG 


GTTTCCTCTT 


TAATACTACT 


CAGCCAGGGA 


AGGTTTCTTA 


CCAGCGGATT 


7320 


GAAAATCTTT 


TGTCTCAGGA 


ATCTCCTGTA 


CAAGACCCTG 


AGTTTCCTCT 


GGATGGTATT 


7380 


GAAAATGGGC 


GTTTGGAGTA 


TGCCATTGAC 


AGCTTTGCTT 


TTGAAAATGA 


GGAAACACTG 


7440 


ACGGATATTC 


ACTTTAGTTT 


GGCAAAAGGG 


CAAACACTGG 


GCTTGGTTGG 


GCACACAGGC 


7500 


TCTGGGAAAA 


CGTCCTTAAT 


CAAGCTCCTC 


TTGCGTGAAT 


ACGATGTGGA 


TAAGCGTGCC 


7560 


ATTTATCTAA 


ACGGTCACGA 


TATTCGGCAC 


TATCGTCTGA 


CAGACCTTCG 


CACTCTCATG 


7620 


GGCTATGTTC 


CTCAGGACCA 


GTTTCTTTTT 


GCCACTTCAA 


TCCTAGACAA 


TATCCGCTTT 


7680 


GGCAATCCTA 


ACTTGCCCCT 


TTCAGCGGTC 


GAGGAAGCTA 


CTAAGCTAGC 


CCGGGTTTAC 


7740 


CAAGATATTG 


TAGACATGCC 


TCAAGGATTT 


GATACGCTGA 


TTGGTGAAAA 


AGGAGTCACT 


7800 


CTTTCTGGTG 


GTCAAAAGCA 


ACGGTTGGCT 


ATGAGTCGGG 


CTATGATTTT 


AGACCCTGAT 


7860 


ATCTTGATTT 


TGGATGATTC 


CTTATCCGCC 


GTAGATGCCA 


ACACAGAGTA 


TGCGATTATC 


7920 


GACAACCTCA 


AGGAGATGCG 


AAAGGACAAG 


ACAACCATTA 


TCACTGCCCA 


TCGCCTCAGT 


7980 


GCTGTTGTCC 


ATGCAGATTT 


TATTTTAGTT 


CTACAAAATG 


GTCAAATTAT 


CGAACGAGGC 


8040 


ACGCACGAAG 


ACTTGCTAGC 


TTTGGATGGC 


TGGTATGCCC 


AAACCTACCA 


GTCTCAGCAG 


8100 


TTGGAAATGA 


AAGGAGAAGA 


AGATGCAGAA 


TAAACAAGAA 


CAATGGACTG 


TATTGAAGCG 


8160 


CTTGATGTCT 


TATCTCAAGC 


CTTATGGACT 


CCTGACCTTT 


TTGGCACTCA 


GTTTTCTCCT 


8220 
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AGCGACGACG CTCATTAAAA GTGTCATACC CCTCGTGGCT TCCCACTTTA TCGACCAGTA 8280 

TCTCAGCAAT CTTAACCAAC TAGCCGTTAC CGTTTTGCTG GTCTACTATG GTCTCTACAT 8340 

CCTACAAACT GTAGTTCAGT ATGTCGGCAA TCTTCTCTTT GCGCGCGTGT CTTACAGTAT 8400 

TGTTAGGGAT ATTCGTCGGG ATGCCTTTGC CAATATGGAG AAACTCGCCA TGTCTTACTT 8460 

TGACAAGACG CCAGCAGGTT CTATCGTTTC TCGTTTGACC AACGATACCG AGACGATTAG 8520 

TGATATGTTT TCTGGGATTT TATCCAGCTT TATCTCAGCA GTTTTTATCT TTCTGACAAC 8580 

CCTTTATACC ATGTTGGTGC TGGATTTTCG TTTGACGGCT TTAGTCTTGC TCTTTCTTCC 8640 

TTTGATTTTC CTTTTGGTCA ATCTCTATCG AAAAAAGTCA GTGAAAATCA TCGAGAAAAC 8700 

CAGAAGTCTC TTGTCAGATA TCAATAGTAA GCTGGCAGAG AATATCGAGG GAATCAGGAT 8760 

TATTCAGGCC TTTAATCAAG AGAAGCGCCT GCAGGCAGAA TTTGATGAAA TCAACCAAGA 8820 

ACACTTGGTC TACGCCAACC GTTCTGTAGC CTTGGATGCC CTCTTTTTGA GACCTGCCAT 8880 

GAGTTTCCTG AAACTTCTAG GCTATGCAGT CTTGATGGCC TACTTTGGCT ACCGTGGTTT 8940 

TTCTATCGGG ATAACCGTCG GGACCATGTA TCCCTTTATC CAGTACATCA ACCGCCTTTT 9000 

TGACCCCTTG ATTGAGGTGA CGCAAAACTT TTCAACTCTG CAAACGGCTA TGGTTTCTGC 9060 

AGGTCGTGTC TTTGCCCTGA TAGACGAGAG GACCTATGAA CCTCTTCAAG AAAATGGGCA 9120 

AGCCAAAGTC CAAGAAGGCA ATATCCGTTT TGAACATCTG TGTTTCTCAT ATGACGGTAA 9180 

ACATCCGATT CTGGATGACA TTTC TT TCTC TGTTAATAAG GGTGAAACCA TTGCCTTTGT 9240 

AGGTCATACA GGTTCAGGGA AATCGTCTAT TATCAATGTC CTCATGCGCT TTTATGAATT 9300 

CCAGTCAGGG AGAGTTCTCT TGGATGATGT CGATATCAGG GATTTCAGTC AAGAAG AG CT 9360 

GAGAAAAAAC ATCGGTTTGG TCTTGCAGGA ACCCTTCCTC TATCATGGAA CTATTAAGTC 9420 

CAATATCGCC ATGTACCAAG AAACCAGTGA TGAGCACCTT CAGGCTGCGG CAGCCTTTC7 9480 

GGATGCAGAT TCCTTTATTC AAGAACTTCC TCAGGGGTAC GACTCCCCTG TTTCCGAGCC 9540 

TGGTTCGAGC TTCTCTACTG GGCAACGCCA GCTTCTTGCC TTTGCTAGAA CAGTCGCCAG 9600 

CCAGCCTAAA ATCCTGATTT TGGATGAAGC GACAGCCAAT ATTGACTCTG AAACAGAAAG 9660 

CTTGGTTCAA GCTTCTCTGG CGAAGATGAG ACAGGGCCGA ACAACTATTG CTATCGCTCA 9720 

CCGCCTTTCT ACTATTCAAG ATGCCAACTG CATCTATGTC TTGGATAAGG GACGCATTAT 9780 

CGAGAGTGGA ACCCATGAGG AACTCTTGGC TCTGGGAGGA ACCTATCACA AGATGTATAG 9840 

TTTGCAGGCA GGGGCCATGG CCGATACTCT TTGAAAATCT CTTTAAACCA TGTCAGCTTT 9900 

ATCTGCAATC TCAAAGCTGT ACTTTGATTT TCATTGAGTA CTAGAAGGAA ATCCTTCAAA 9960 

TTACAGATTT CTTTCACCCC CTTTTCCATT TTGTCGTATA ATGAAAAATC TTGACAAATA 10020 
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GTATAATAAA AACAAAGGAG AACAGCATGC TGAAATGGGA AGACTTGCCT GTGGAAATGA 10080 

AATCAAGCGA GGTTGAGTCT TACTACCAGC TTGTCTCTAA AAGGAAGGGT TCGCTGATTT 10140 

TCAAGCGTTG CTTGGACTGG GTTTTGGCCT TGGTCTTACT GGTTCTGACC TCTCCCATCT 10200 

TTCTCATCTT GAGCATTTGG ATCAAGTTGG ATAGCAAAGG GCCAGTGATT TACAAGCAAG 10260 

AGCGTGTGAC CCAGTACAAC CGTCGGTTCA AGATTTGGAA GTTTCGTACC ATGGTGACGG 10320 

ATGCGGATAA AAAAGGAAGT CTGGTGACTT CTGCTAACGA TAGCCGCATT ACCAAGGTTG 10380 

CAAATTTCAT CCGACGTGTC CGTTTGGACG AACTGCCTCA GTTGGTCAAT GTCCTTAAAG 10440 

CTGAGATGTC CTTTGTCGGT ACACGACCTG AAGTGCCACG TTATACAGAG CAGTATAGCC 10500 

CTGAAATGAT GGCAACCTTG CTCTTGCAAG CAGGGATTAC CTCTCCAGCC AGCATCAACT 10560 

ACAAGGATGA GGACACAATT ATCAGTCAAA TGACGGAGAA AGGTCTGTCA GTTGATCAGG 10620 

CCTATGTGGA GCATGTTCTT CCTGAAAAGA TGCGCTATAA CCTCGCCTAT CTCCGAGAGT 10680 

TTACTTTCTT TGGGGACATC AAAATCATGT TTCAAACCGT GTTTGAGGTA CTAAAATAAA 10740 

GTAGTCATAA GAAAATGAGT ACAGATAAAA GGAGCAAATC AATGCCAAAT TACAATATTC 10800 

CATTTTCACC GCCTGATATC ACAGAAGCAG AAATTACTGA AGTAGTGGAT ACCCTGCGTT 10860 

CTGGTTGGAT CACAACAGGT CCTAAAACAA AAGAACTGGA GCGCCGCTTG TCTCTTTACA 10920 

CACAGACACC TAAGACTGTT TGTCTCAACT CTGCGACAGC CGCTCTGGAG TTCATTTTAC 10980 

GCGTTTTGGA AGTGGGACCT GGTGATGAAG TCATCGTTCC ACCCATGACC TATACGGCTT 11040 

CATGTAGTGT CATTACGCAC GTGGGAGCAA CCCCTGTCAT GGTGGATATC CAAGCAGATA 11100 

CGTTTGAGAT GGACTATGAC CTGCTTGAGC AACCTATCAC TGAGAAAACT AAGGTGATTA 11160 

TTCCAGTAGA GCTCGCAGGG ATTGTTTGCG ATTATGACCG TTTGTTCCAA GTCCTGGAGA 11220 

AAAAACGTGA CTTCTTTACC GCTTCAAGCA AGTGGCAAAA GGCCTTTAAC CGTATTGTCA 11280 

TTGTCTCTGA TAGTGCCCAC GCTTTGGGAT CTATTTATAA AGGACAACCT TCTGGTTCTA 11340 

TCGCTGACTT TACTTCCTTC TCATTCCATG CAGTTAAGAA CTTTACAACG GCAGAAGGTG 11400 

GAAGTGCGAC TTGGAAAGCC AATCCAGTGA TTGATGACGA AGAGATGTAC AAGGAATTCC 11460 

AAATCCTTTC CCTTCACGGG CAAACTAAGG ATGCTCTTGC CAAGATGCAA CTGGGGTCAT 11520 

GGGAATACGA TATCGTTACA CCAGCCTATA AGTGCAACAT GACCGATATC ATGGCTTCAC 11580 

TTGGTTTGGT ACAATTGGAC CGCTATCCAA GTTTGTTGCA ACGCCGTAAG GACATTGTGG 11640 

ACCGCTATGA TAGTGGTTTT GCAGGTTCTC GCATCCATCC TTTGGCACAC AAGACTGAAA 11700 

CTGTCGAATC TTCACGCCAC CTCTACATCA CCCGTGTAGA AGGAGCAAGC CTAGAAGAAC 11760 
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GCAACCTCAT 


CATCCAAGAA 


TTGGCTAAAG 


CAGGAATTGC 


AAGTAATGTT 


CACTACAAAC 


11820 


CGCTTCCTCT 


CTTGACAGCC 


TATAAGAATC 


TTGGATTTGA 


TATGACGAAC 


TATCCTAAGG 


11880 


CCTATGCCTT 


CTTTGAGAAT 


GAAATTACCC 


TCCCTCTTCA 


TACTAAATTA 


AGCGATGAAG 


11940 


AAGTAGACTA 


TATCATTGAG 


ACTTTCAAAA 


CAGTTTCTGA 


AAAAGTGCTA 


ACTTTATCAA 


12000 


AAAAATGACA 


AACTACAGTC 


AAGCGAAAGT 


GATCCTGCCC 


CTA7AAAAGTC 


TAATTGAGTG 


12060 


TAAAAACTGT 


TGTTTTCAAT 


TGATAATAGT 


TTACACCTGT 


AGTTGAGGCC 


CCTTTCTCCT 


12120 


CAGAGAGAGA 


ATTTTTATAG 


GATTTTCCTT 


TCTTGTCGGA 


GTCCCGTGGT 


TTGAAATAAG 


12180 


ATGTGAGCAA 


TTTAGTGTAG 


CATTTAGAAT 


CCTTACTAGA 


CATCATTTAG 


AAAATCTAGT 


12240 


CTCTTGTTCT 


AGTTTTCAAT 


TCACCCTATT 


TTTTCAAAGA 


CGTGAGTTTC 


CATGAGTGAG 


12300 


ATTGTGGAAA 


CTCGCGTCTT 


TTTTTGTTTT 


CACAATATTC 


TTCAAAATTT 


TGTGCCTGTC 


12360 


TTTCATGTTC 


TAGTCATTCT 


TTTGCATGAT 


AGAATTTATA 


GCATGTTGAT 


ATTATAATAA 


12420 


TACAAATATT 


CTATATGTTT 


AGTGATGCTT 


GCTATACATT 


ATTAGATCTC 


CTGCGAGACA 


12480 


ATCTATAAAA 


CACTTGTCTA 


CGATTACCTA 


TATGCCCTAT 


TCCAGTATTT 


TAGAAGCACT 


12540 


GCATCTATTT 


TTATCGAGGT 


TAAATCTAGC 


TTTTATAGAA 


GGTCTATTTA 


AGAAATATAT 


12600 


TGTAGTGTTT 


TAGTTTCAAT 


CCGCCATATG 


AGCGATATTC 


AGCTAAATAT 


CCCTGGCGAA 


12660 


TGCTTGTATG 


ACAAGGTATT 


TGTTCTTTCA 


TTTATAATTT 


ACAACATATC 


AACAAATTTA 


12720 


AATATAGTAA 


ATGGGATATT 


TTATATTCAA 


GCTAAGAAAG 


ATAGCATCAC 


TTTTGAATGG 


12780 


AAGGCTAAAG 


AGCAAACTAG 


GAAGTTGGCC 


ATAGATAGCT 


CAAAACCCTG 


CTTTGAGGTT 


12840 


GTAGATATAG 


TAAAATGAAA 


TGAGAATAGG 


ACAAATTGAT 


CGGGACAGTC 


AAATCGATTT 


12900 


CTAACAATGT 


TTTAGAAGTA 


GAGGTGTACT 


ATTTTAGTTT 


CAGTCTACTA 


TAGAACTCAC 


12960 


CAAGTCAGTA 


ACCTAGACTT 


AGGGCAAGGC 


GGCACTGACC 


TAGTTTGAAG 


AGATTTCCGA 


13020 


AGAGTATAAA 


TTTTAATATT 


TTCTTGTGTT 


ATTCCTTGAC 


AATTCAATTT 


GGAAAATATA 


13080 


TGATAAAGAT 


AATGACAGCG 


GTGTCATTCT 


ATCTATTTTA 


AGAAAAGTAA 


TAATCAATTG 


13140 


TTAAAAATAG 


TAAAAAAATT 


GGACGTTCTG 


ATGAAATATT 


TTGTTCCG 




13188 


(2) INFORMATION FOR SEQ ID NO: 71: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
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AACGAGTGCA TCAGTCTCAG CAAGCACCAC TGCGTCGGCC TCAGCAAGCA CCAGCGCGTC 60 

TGAATCCGCA TCAACCAGTG CCTCAGCTTC AGCAAGTACC TCAGCATCTG AATCAGCATC 120 

AACAAGTGCA TCGGCTTCAG CAAGCACAAG TGCTTCAGCC TCAGCAAGTA TCTCAGCGTC 180 

TGAATCGGCA TCAACCAGTG CGTCCGCTTC AGCAAGTACT AGCGCCTCAG CATCAGCGTC 240 

AACAAGTGCT TCGGCTTCAG CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTACGTC 300 

AGCCTCAGCA AGCACATCAG CTTCTGAATC TGCATCAACC AGTGCGTCAG CCTCAGCATC 360 

GACAAGCGCC TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCTTC 420 

AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG 480 

TGCGTCAGCC TCAGCAAGTA CTAGCGCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC 540 

AGCAAGTACT AGTGCATCAG CATCAGCATC AACGAGTGCA TCGGCTTCAG CAAGTACCAG 600 

CGCCTCAGCT TCAGCAAGCA CCAGTGCGTC AGCCTCAGCA AGTACCAGCG CCTCAGCCTC 660 

AGCAAGCACC AGTGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 720 

TGCGTCGGCT TCAGCAAGTA CCTCAGCGTC TGAATCAGCA TCAACGAGTG CATCAGCTTC 780 

AGCATCAACA AGTCCTTCAG CTTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 84 0 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCCGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTCCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 1020 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 1080 

TGCGTCGGCC TCAACCAGTG CATCTCAATC GGCATCAACC AGTGCGTCAg CCTCAGCAAG 1140 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 1200 

AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 1260 

CACCAGTGCG TCAGnCTCAC CAAGTACCAG CGCCTCACCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAgCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 1380 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 1440 

AGCTTCAGCA AGTACCAGTG CGTCCGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 

TGAATCTGCA TCAACCAGTG CGTCCGCTTC AGCGTCAACC AGTGCGTCGG CTTCAGCGTC 174 0 
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GACAAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCGGCC TCAGCAAGCG CAAGTACCTC 1800 

AGCGTCAGct TCCGCCTCAA CCAGTGCGTC GGCTTCAGCA AGCACAAGTG CGTCAGCCTC 1860 

AGCAAGTATC TCAGCGTCTG AATCGGCATC AACGAGTGCG TCTGAGTCAG CATCAACGAG 1920 

TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCAGTG CGTCAGCCTC 1980 

AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 2040 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 2100 

TACTAGTGCA TCAGCTTCAG CATCAACGAG TGCATCGGCT TCAGCATCAA CCAGTGCCTC 2160 

GGCTTCAGCG TCAACCAGTG CGTCAGCTTC AGCAAGTACC AGTGCTTCAG TCTCAGCATC 2220 

AACAAGTGCT TCAGCCTCAG CATCGACAAG TGCCTCGGCT TCAGCAAGCA CATCAGCATC 2280 

TGAATCAGCG TCAACCAGTG CTTCGGCTTC AGCAAGTACC AGTGCTTCAG CTTCAGCATC 2340 

AACCAGCGCC TCGGCCTCAG CAAGCACCTC AGCTTCTGAA TCGGCCTCAA CCAGCGCCTC 2400 

GGCCTCAGCA AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCAG CCTCAGCATC 2460 

AACGAGTGCT TCGGCTTCAG CAAGCACAAG CGCCTCGGGT TCAGCATCAA CGAGTACGTC 2520 

AGCTTCACCG TCAACCAGTG CTTCAGCCTC AGCATCAACA AGTGCGTCAG CCTCAGCAAG 2580 

TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTACGTC 2640 

AGCCTCAGCA AGCACCTCAG CTTCTGAATC GGCCTCAACC AGTGCGTCAG CCTCAGCATC 2700 

GACAAGCGCC TCAGCTTCAG CAAGTACCAG TGCTTCACCC TCAGCGTCGA CAACTGCGTC 2760 

GGCCTCAACC AGTGCATCTG AATCGGCATC AACCACTCCC TCAGCCTCAG CAAGTACTAG 2820 

TGCATCGGCT TCAGCATCAA CCAGTCCCTC GGCTTCAGCG TCAACCAGTG CGTCAGCTTC 2B80 

AGCAAGTACC AGTGCTTCAG TCTCAGCATC AACAAGTGCT TCAGCCTCAG CATCGACAAG 2940 

TGCCTCGGCT TCAGCAAGCA CATCAGCATC TGAATCAGCG TCGACAAGCG CCTCAGCTTC 3000 

AGCAAGTACC AGTGCGTCAG CCTCAGCGTC GACAAGTGCG TCAGCCTCAG CAAGTACTAG 3060 

TGCATCAGCT TCAGCATCAA CGAGTGCATC GGCTTCGGCG TCAACCAGTG CATCAGAGTC 3120 

AGCAAGTACC AGTGCGTCAg CTTCCGCATC AACAAGTGCC TCGGCTTCAG CAAGCACCAG 3180 

TGCGTCGCCT TCAGCAAGTA CTAGCGCCTC AGCCTCACCC TCAACCAGTG CGTCAGCCTC 3240 

AGCAAGTATC TCAGCGTCTG AATCGGCATC AACGAGTGCG TCCGCTTCAG CAAGTACTAG 3300 

CGCCTCAGCC TCAGCGTCAA CAAGTGCATC GGCTTCAGCG TCAACGAGTG CGTCTGAATC 3360 

GGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCCTCAG CGTCAACAAG 3420 

TGCATCGGCT TCAGCATCAA CGAGTGCGTC CGCTTCAGCA AGTACTAGCG CCTCAGCCTC 3480 

AGCGTCAACA AGTGCATCGG CTTCAGCGTC AACGAGTGCG TCTGAGTCAG CATCAACGAG 3540 
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TGCGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCAGTG 
AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG 
TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC AGCCTCAGCA AGTACCAGTG 
AGCGTCGACA AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA 
AGCCTCAGCA AGTACTAGTG CATCAGCTTC AGCATCAACG AGTGCATCGG 
AACCAGTGCA TCAGAGTCAG CAAGTACCAG TGCGTCAGCT TCCGCATCAA 
GGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCTTCAG 
AACCAGCGCC TCGGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA 
GGCTTCAGCA TCAACGAGTG CATCAGTCTC AGCAAGCACC AGTGCGTCGG 
CACCAGCCCG TCTGAATCCG CATCAACCAG TGCCTCAGCT TCAGCAAGTA 
TGAATCAGCA TCAACAAGTG CCTCGGCTTC AGCAAGCACA AGTGCTTCAG 
TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCCGCT TCAGCAAGTA 
AGCATCAGCG TCAACAAGTG CTTCGGCTTC AGCGTCAACG AGTCCGTCTG 
AACGAGTACG TCAGCCTCAG CAAGCACATC AGCTTCTGAA TCTGCATCAA 



AGCCTCAGCA 

TACCAGTGCT 

GGCATCAACC 

TCCGTCCCCT 

AGCGTCGACA 

TGCGTCGGCT 

AGCATCAACG 

TGCGTCGGCT 

CGCATCAACA 

TGCGTCTGAG 

TGCATCAACC 

TGCTTCAGCC 

AGCGTCAACC 

TGCGTCAGCT 

AGCATCAACG 



TCGACAAGCG CCTCAGCTTC AGCAAGTACC 
TCAGCCTCAG CGTCGACAAG TGCGTCGGCC 
AGTGCGTCAG CCTCACCAAG TACTAGCGCC 
TCAGCAAGTA CTAGTGCATC AGCTTCACCA 
AGCGCCTCAG CTTCAGCAAG TACCAGTGCG 
TCAGCAAGTA CCTCAGCGTC TGAATCAGCA 
AGTGCATCAG CTTCAGCATC AACAAGTGCT 
TCAGCATCAA CGAGTGCTTC AGTCTCAGCG 
AGTGCCTCGG CTTCAGCAAG CACCAGTGCT 
TCAGCATCAA CGAGTGCGTC AGCCTCAGCA 
AGTGCGTCAG CTTCCGCATC AACAAGCGCC 
TCAGCATCAA CCAGTGCATC AGCTTCAGCC 
AGTGCCTCGG CTTCAGCAAG TACCAGTGCG 
TCAGCATCAA CCAGTGCTTC GGCTTCGGCA 
AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 



AGTGCGTCAG 
TCAACCAGTG 
TCAGCCTCAG 
AGTACTAGCG 
TCAGCCTCAG 
TCAACAAGTG 
TCAGCTTCAG 
TCAACCAGTG 
TCGGCTTCAG 
AGCACATCAG 
TCGGCCTCAG 
TCAACAAGTG 
TCAGCTTCAG 
TCAACAAGTG 
TCAGCATCAC 



CGTCAGCCTC 
CGTCGACAAG 
CGTCAGCCTC 
CCAGTGCGTC 
CTTCAGCATC 
CAACTGCCTC 
CTTCCGCGTC 
CAACTGCCTC 
CCTCAGCAAG 
CCTCAGCATC 
CCTCAGCAAG 
CTAGCGCCTC 
AGTCAGCATC 
CCAGTGCGTC 
CCTCAGCAAG 
CATCTGAATC 
CATCAACGAG 
CCTCAGCCTC 
CGTCGACAAG 
CGTCGGCTTC 
CAAGTACCAG 
CCTCTGAATC 
CCTCAACCAG 
CTTCTGAATC 
CAAGTACAAG 
CTTCAGCCTC 
CAAGCACAAG 
CCTCAGCATC 
CATCAACCAG 



3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4S00 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 
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TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA TCAACGAGTG CATCAGCATC 5340 

AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCACTGCA TCAGTCTCAG CAAGCACCAG 5400 

TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 5460 

GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG 5520 

TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 5580 

GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCAGCtnTCAG CATCAACGAG 5640 

TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA AGCACCAGTG CGTCAGCCTC 5700 

AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC TCAGCTTCAG CAAGTACCAG 5760 

TGCGTCAGcT CAGCATCAAC AAGTGCTTCA GCTTCGGCCT CAACAAGTGC GTCAGCTTCA 5820 

GCATCAACGA GTGCGTCGGC TTCAGCAAGC ACCAGTGCCT CCGCCTCAGC AAGCACCAGT 5880 

GCTTCAGCTT CAGCATCAAC AAGTGCCTCA GCTTCAGCAA GTACATCAGT TTCAAATTCA 5940 

GCAAACCATT CGAACTCACA AGTTGGAAAT ACTTCTGGAT CGACAGGTAA ATCCCAAAAA 6000 

GAATTGCCTA ATACAGGTAC TGACTCGTCA ATTGGATCTG TGTTACTTGG AGTTCTAGCA 6060 

GCTGTTACAG GTATTGGATT GGTTGCGAAA CGCCGTAAAC GTGATGAAGA AGAGTAAGAC 6120 

AACCTGTAAA GTTAGGCTAA ACTAACTCGC GCACATAAAT CAAGGAGAAA ATTGCTAGTG 6180 

GATGATAAAA TAACAGTCAT TCTACCAGTA TACAATGTGG AAAACTATCT GAGGAAGTGC 6240 

CTAGATAGTA TTATTACTCA AACATATAAA AATATTGAGA TTGTTGTCGT TAATGATGGT 6300 

TCTACGGATG CTTCAGGTGA AATTTGTAAA GAATTTTCAG AAATGGATCA CCGAATTCTC 6360 

TATATAGAAC AAGAAAATGC TGGTCTTTCT GCCGCACGAA ACACCGGTCT GAATAATATG 6420 

TCCGGAAATT ATGTGACCTT TGTGGACTCG GATGATTGGA TTGAGCAAGA TTATGTAGAA 6480 

ACTCTATATA AAAAAATAGT AGAGTATCAG GCTCATATTG CAGTTGGTAA TTATTATTCT 6540 

TTCAACGAAA GTGAAGGAAT GTTCTACTTT CATATATTGG GAGACTCCTA TTATGAGAAA 6600 

GTATATGATA ATGTTTCTAT CTTTGAGAAC TTGTATGAAA CTCAAGAAAT GAAGAGTTTT 6660 

GCTTTGATAT CTGCTTGGGG TAAACTCTAT AAGGCAAGAT TCTTTGAGCA GTTGCGCTTT 6720 

GACATAGGTA AATTAGGAGA AGATGGTTAC CTCAATCAAA AGGTATATTT ATTATCAGAA 6780 

AAGGTAATTT ATTTAAATAA AAGTCTTTAT GCTTATCGGA TTAGAAAAGG TAGTTTATCA 6840 

AGAGTTTGGA CAGAAAAGTG GATGCACGCT TTAGTTGATG CTATGTCTGA ACGTATTACG 6900 

CTACTAGCTA ATATGGGTTA TCCTCTAGAG AAACACTTGG CAGTTTATCG TCAGATGTTG 6960 

GAAGTCAGTC TCGCCAACGG TCAAGCTAGT GGTTTATCTG ACACAGCAAC GTATAAAGAG 7020 

TTTGAAATGA AACAAAGGCT TTTAAATCAG CTATCGAGAC AAGAGGAAAG TGAAAAGAAA 7080 
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GCCATTGTCC 
ATTTGTTATC 
TGGATTAAGC 
GTAACTTCTG 
TATTTCATAG 
GTAACGAAAA 
GTTAGAGATT 
TTGGTAAACA 
AATGAATGGC 
CATAAATGGT 
GATTATCAAT 
AAACCGTGGA 
CTTGAATGGA 
TATCCAATAA 
ATTGAGACAT 
GTTAGTGATC 
CACTATTTGG 
AATCATGGCG 
TTATCCTTTG 
GTTCAAGCAA 
GAGATGCTCT 
ATTTCTCCGT 
ATAGACCTAA 
TTTTTGGsGT 
TATATTATAG 
TCTATAGTTT 
ATTTGTTGTT 
TGGTTTTTGA 
ATTGTAGATC 



TCGCAGCAAA CTATGGCTAT 
ATAATCGTTC GATTCGTTTT 
AATTAAATAA GCGCTTAGAG 
AGCAAATTTC AT GT TAT AAA 
CTGATTTCGT GCAAGAAGAC 
ATCTGGATGA CTTGTTTGCT 
TTGGGGGCAG AGCTTATTTT 
ATGCTTTTTG GAAAAAAGAG 
ATGATAAGGT GGATCAGGCA 
TGGAATTGGA CTTTGATTAT 
TGCCTGAGGG TCAGGATTAT 
AAGATTTGGC GGCCCAAACC 
CAGAATTGGG ACAAAACCAT 
AGGAACCTTT CACTTGTCTA 
TGGTTCAATC CTTGCCTGAT 
GATTGGCTCA GATGACAATT 
TAGATGTCGA TAATGAATTG 
AAAAGACAGA AGAAATTCTC 
AAAATACTAA AACCTATGAA 
TGATTGAAAA ATTGAGAGAA 
GATTTTGACG GTTAGTGATC 
TCATCATATA TGAAACTTCT 
AGATATTTAA TATGAACTGC 
CAGTACAATA TTAGGGTGTG 
CTGTTTGATA CG AAATTTAT 
TTAATGCAGA TAATGATTAT 
ATAATAATTG TTTAAAATTT 
TGATGAATAA GCGATTGAAG 
ATGTTCTTAA AAAGTTTCAT 



579 

GTAGACCAAG TTTTAACGAC 
TATCTGATTC ATAGCGATTT 
AAGTTTGACT CAGAAATTAT 
TCGGATATTA GTTACACAGT 
AAGGCCCTCT ACTTCGACTG 
ACAGACTTAC AAGATTATCC 
GGTCAAGAAA TCTTTAATCC 
AATATGACCC AAAAATTAAT 
GATCAGAGCA TCTTGAATAT 
AATCATATTG TCATTCATAA 
CCTGCTATTA TTCACTATCT 
TATCGTGAAG TTTGGTGGTA 
CATTTACATC CATTACAAAG 
ATCTATACTG CCTCAGACCA 
ATTCAGTTTA AGATAGCAGC 
TATCCAAACG TGACTATATT 
GTAGAAACCA GTCAAGTACT 
GATCAATTTG CTAATCTTGG 
GTAGGTCACG AGGCATATCC 
ATAACCAAAT CAAGAAAAAT 
AGATTGAAGA GTTGGATTAT 
TCAAACATCA CAGTCCTTTA 
ACCCCAAAAG TTAGACAGAA 
ATTAATTATC TTTTTAGGTG 
TATAAGGAAA TTATGTTAAT 
GTAGATAAAT TAG AAACTGC 
TATGTATTTA ATGATGATAT 
ACTATACAAT CTGAAATCGT 
TTACCGTTAA AGAATTTAAG 



AATCAAGTCT 7140 

TCCAAATGAA 7 200 

TAATTGTCGC 7260 

CTTTTTACGC 7 320 

TGATCTAGTT 7380 

TTTGGCTGCT 7440 

CGCTGTTCTC 7 500 

TGATGTAACC 7560 

GCTTTTTGAA 7620 

ACAGTTTGCT 7680 

TTCTCATCGG 7740 

CTATCATGGG 7 800 

ATCTCACATC 7860 

TATTGAACAA 7920 

TAGAGTAATA 7980 

TAACGGAATT 8040 

TTTAGATATT 8100 

CAAGCCTATC 8160 

TGTTGACCAA 8220 

CATTTAGTAG 3230 

TTTTTATAAA 8340 

TAAAATATAA 8400 

AAAATCTAAC 8460 

AAAATGATTC 8520 

GAATACAAAA 3580 

AATTAAATCT 8640 

TGCGTCAGAG 8700 

TAATGTAAAG 3760 

TTATGCCACT 8820 
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TTCTTTCGTT ATTTTATACC TAATTTTCTC AAAGAAAGTC GTGCTTTATA CCTAGATTCT 
GACATCATTG TTACAGGAAG TTTAGACTAT TTATTTGATA TAGAACTAGA TGGTTATGCC 
TTCGCAGCAG TAGAAGATTC TTTTGGTGAT GTTCCTTCTA CCAATTTTAA CTCCGGAATG 
TTATTAGTTA ATGTAGATAC TTGGAGAGAT GAAGATGCTT GTTCGAAACT GTTAGAACTG 
ACCAATCAAT ATCATGAAAC AGCATATGGA GATCAAGGAA TTTTAAATAT GTTATTCCAT 
GATAGATGGA AAAGATTAGA CCGAAATTTT AATTTTATCG TGGGGATGGA TAGCGTCGCA 
CACATAGAAG GAAATCATAA ATGGTATGAG ATTTCTGAGT TGAAAAATGG AGATTTACCT 
AGTGTTATAC ATTATACTGG GGTAAAACCT TGGGAAATAA TTTCCAATAA TCGCTTTAGA 
GAAGTTTGGT GGTTTTATAA TCTGTTAGAA TGGTCTGATA TTTTATTGAG AAAAGACATT 
ATTAGTCGTA GTTTCGAAGA ACTTGTATAC AGTCCTAAAG CTCATACAGC AATTTTTACA 
GCTAGTTGTG AGATGGAGCA TCTACAATAT TTGATAGAAA ATTTACCAGA GGTACATTTT 
TCTATACTAG CACATACATA TTTTGCGTCT AGTGTCGTTG CTTTATTAAG ATATAGCAAT 
GTTACGATTT ATCCTTGTTT TTCTCCATTT GATTATCGAA AAATTTTGGA TAATTTAGAT 
TTTTATTTAG ATATTAATCA TTATAAAGAA GTGGATAATA TTGTATCCGT TGTTCAACAA 
CTATCTAAAC CAATTTTTAC CTTTGAAAAT ACTAGTCATG ATATAGGCAA TCAAACTAAT 
ATATTTTCTT CAACCGAACC AAACAAAATG GTAGAGGCTA TTAGACAATT TATAGGAGAA 
TAAGTTTATG GCAGACGAAC TAATTAGTAT TGTAGTTCCA ATCTACAACG TTGAGAATTA 
TTTGCGAATG TGTTTGGATA GCATTCAGAA TCAGACGTAT CAAAATTTTG AGTGTTTATT 
AATCAATGAT GGCTCTCCAG ATCATTCATC CAAAATATGT GAAGAATTTC TAGAGAAAGA 
TTCTCGTTTC AAATATTTTG AGAAAGCAAA CGGCGGTCTT TCATCAGCTC GTAACCTAGG 
TATTGAATGT TCGGGGGGGG GCGTACATTA CTTTTGTAGA CTCTGATGAT TGCTTGGAAC 
ATGATGCTTT AGACCGATTA TATGGTGCTT TGAAAAAGGA AAACGCAGAT ATTAGTATCG 
GGCGTTATAA TTCTTATGAT GAAACACGCT ATGTGTATAT GACTTATGTT ACGGATCCAG 
ATGATTCTCT AGAAGTGATA GAAGGTAAAG CAATTATGGA TAGGGAAGCT CTCGAAGAAG 
TCAGAAATGG GAACTGGACT GTAGCTCTCT TGAAGTTATT CAAGAGAGAG TTACTACAAG 
ATTTACCATT TCCTATAGGA AAAATTGCAG AGGATACTTA CTGGACATGG AAGGTACTTC 
TAAGAGCTTC GAGGATAGTC TATTTGAATC GTTGTGTTTA CTGGTACCGT GTTGGTTTAT 
CTGATACTTT ATCGAATACA TGGAGTGAAA AGCGTATGTA TGATGAAATT GGGGCTAGGG 
AAGAAAAGAT AGCTATTTTA GCAAGTTCAG ACTATGACTT GACCAATCAT ATTTTGATTT 
ATAAAAATAG ATTACAAAGA GTGATAGCAA AATTAGAAGA ACAAAATATG CAGTTCACAG 
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AGATTTACAG 


AAGAATGATG 


GAAAAATTGT 


CTTTACTTCC 


GTAGATAGTA 


ATAAAAAATG 


10680 


AGATAGCGTA 


ATATGAAACT 


ACATTTAACA 


AATTTATACG 


GCATGGCTGG 


TGATAGTACG 


10740 


GTTATCTTAG 


CTCAAAATGC 


TGTTCAAAAG 


ATAGCTAGTC 


AACTGGGATT 


TAGAGAGGTT 


10800 


GGTATTTATT 


TTTACAACAT 


TGCTTCAGAT 


AGTCCTTCTG 


AAATGAATAA 


GCGTCTGGAT 


10860 


GGTATTATGG 


CCAGTATCTC 


TATTGGGGAT 


ATTTTAGTCT 


TTCAGTCTCC 


AACCTGGAAT 


10920 


GGTTTTGAAT 


TTGATCGTCT 


CTTGTTTGAT 


AACCTAAAGG 


ATATGCAGGT 


GAAAATTATT 


10980 


TGCTTTATCC 


ATGATGTTGT 


TCCCCTCATG 


TTTGATAGTA 


ACTATTATCT 


CATGAAAGAT 


11040 


TATCTGTATA 


TGTATAATCT 


ATCAGATGTT 


TTGATAGTGC 


CCTCAGAGAG 


AATGAAAACA 


11100 


CGCCTGATGG 


AAGAAGGATT 


GACGACTAAG 


AAGATTCTTG 


TTCAAGGGAT 


GTGGGATCAT 


11160 


CCTCATGATT 


TATCCTTATA 


CACCCCTGCT 


TTTAAAAAAG 


AACTTTTTTT 


TGCTGGAAGT 


11220 


TTAGAGCGTT 


TTCCAGACTT 


ACAAAATTGG 


TCTCAAGATA 


CGCCTTTGAG 


AGTATTTTCA 


11280 


AATAAAGGGG 


AACCTACTTC 


TAGTGCTAGA 


AGTCTCAGCA 


TCGAAGGATG 


GAAAAAAGAT 


11340 


GAGGAATTGT 


TCCTAGAATT 


ATCAAAGGCT 


GGATTTGGCC 


TTGTCTGGGG 


AACCcATCAA 


11400 


AATGAGGGAG 


AAAGTAACCA 


ATACTATACC 


TTCAATATAT 


CTCATAAGGT 


GAGTACCTAT 


11460 


CTAACAGCCG 


GCATTCCAGT 


CATTGTACCA 


AGTAGCTTGT 


CAACTGCTAA 


ATTTATAGTA 


11520 


GATCAAGGCT 


TGGGCTTTAT 


GGCGGATAGT 


CTGGAAGAGG 


TTCATGAGAT 


AGTTGATAAA 


11580 


ATGAATCTAC 


AAGAATATCA 


AGAAATGACG 


AATCCTATCA 


AGACCTTTAG 


CTATTTGTTA 


11640 


AAAGAGGGCT 


ATTTCACTAA 


AAAGTTATTG 


GTAGATGCAA 


TCTATCACTT 


GGGAATTGAT 


11700 


TAAGGGAATG 


AAATGAACAA 


AACAATTGTA 


CTAGCAGGGG 


ATCGCAATTA 


CACCAGGCAG 


11760 


TTAGAAACAA 


CGATAAAATC 


TATTTTATAC 


CACAATCGAG 


ATGTTAAGAT 


TTATATTTTG 


11820 


AATCAAGATA 


TCATGCCAGA 


TTGGTTTCGC 


AAACCACGAA 


AAATAGCTCG 


CATGTTAGGT 


11880 


AGTGAGATTA 


TCGATGTTAA 


ACTACCTGAA 


CAAACTGTCT 


TTCAACATTG 


GGAAAAGCAA 


11940 


GATCACATTA 


GTAGCATTAC 


TTATGCTAGA 


TATTTTATTG 


CAGATTATAT 


CCAAGAAGAT 


12000 


AAGGTTTTAT 


ATTTAGACAG 


TGATTTGATT 


GTAAATACTT 


CTTTAGAGAA 


ATTATTTAGT 


12060 


ATTTGTTTAG 


AAGAAAAATC 


ACTCGCAGCA 


GTTAAAGATA 


CAGATGGAAT 


TACATTTAAT 


12120 


GCAGGTGTTT 


TATTAATCAA 


CAATAAAAAA 


TGGCGTCAAG 


AGAAATTAAA 


AGAACGACTA 


12180 


ATTGAACAGA 


GCATTGTTAC 


AATGAAGGAA 


GTTGAAGAAG 


GCCGTTTCGA 


GCATTTTAAT 


12240 


GGTGATCAAA 


CGATTTTTAA 


TCAGGTCTTG 


CAAGATGATT 


GGTTAGAACT 


AGGTCGAGCT 


12300 


TATAATTTAC 


AAGTAGGGCA 


TGATATTGTG 


GCTTTGTATA 


ACAATTGGCA 


GGAACATCTG 


12360 
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GCTTTTAATC 


ATAAACCAGT 


GGTGATTCAT 


TTTACGACCT 


ACAGAAAACC 


CTGGACTACC 


12420 


TTGACAGCCA 


ATCGTTATCG 


TGATTTATGG 


TGGGAATTCC 


ATGATTTGGA 


GTGGAGTCAG 


12480 


ATTTTACAAC 


ACCATATGGG 


AGAATTTGAA 


CTAATATCGC 


CTCTAGATAA 


GGAATTTTCT 


12540 


TGCTTAACCT 


TAACGAATTC 


CCAAGATTTA 


GAAGGAATAG 


AAGACCTAGT 


TACAGCTCTA 


12600 


CCTGAGGTGG 


TATTTCATAT 


CGCAGCTTGG 


ACGGATATGG 


GAGATAAATT 


AAAAAAATTA 


12660 


GCTCTATATA 


ATAATGTGAG 


ATTGCATCCA 


CAAATTGTTC 


CACCGGTCTT 


AGATAAGCTG 


12720 


AAAAAGTCAA 


CAAATCTATA 


TTTGGATATC 


AATCATGGTA 


CTGCAGATGA 


GAACTTTTTA 


12780 


AAATCTTTGC 


AAGAACAAGA 


AAAAACGCTA 


CTAGCTTTTC 


AATCGACTCA 


GCACGGAGAC 


12840 


TTAGGACAAA 


TCGTTTTCGA 


AAATGGGAAA 


GTTTCCTTTA 


TGATTGATAC 


GATTAAAGAT 


12900 


TTTAAGAAAA 


ACGGACATCT 


TACCTGTTTT 


CGACAACTTC 


CAAGTTTAAC 


TTGTTTAACG 


12960 


TTTACGGCTT 


CTCAGTATAT 


CGAACAATTG 


GATTACTTGG 


CTGGACAGTT 


GCCAAATGTT 


13020 


GTTTTTCAAA 


TTGCTGCTTG 


GACAGCTATG 


GGGCCAAAAT 


TATATGATTT 


GTCTAATCGT 


13080 


TATCCTAATA 


TTCAGCTCTA 


TCCGGCAATT 


TCTAGACATA 


AGCTAGACGA 


GTTGAAGGAG 


13140 


AAGATGGATG 


CTTATTTAGA 


TATCAACCTA 


CTGACTTCAA 


CATCCGATAT 


CGTTGCAGAA 


13200 


ATGGCTCATC 


TATCTAAACC 


TATACTAGCC 


TTTTATAAAT 


CTCAAAATGG 


GAATAATGGC 


13260 


CAAAGGTTGT 


ATTCAAGTGA 


ACATCCTGAA 


CGAATGTTGG 


CTGATTTGCA 


AAAATTGATA 


13320 


ACTAAGGATA 


TGCTAGAAAA 


ACCGCTTGAT 


ATAATCCAGG 


TGAAAGGGAT 


AGATGAAACC 


13380 


TTGGATTATA 


TTATTGAACA 


CAACTCTTCT 


TTAGTTCCTT 


TTGGAGATGG 


GCAAATCAAT 


13440 


ATGCTTGCAG 


GGCATTCAAT 


TCCCTACCAG 


GATTATCATC 


AAGAGTTGGT 


TTCAATCATG 


13500 


AGGGACATTA 


TCGGCCAAGA 


AAGTCGAGAA 


GATTTAGTAG 


TGTGCCTTCC 


TGATGCTTTT 


13560 


ACAGATCGT7 


TTAGGTTTAC 


ATCCTGCGCG 


ATTCCATTTT 


GGAAAGATCA 


C ATGGATC AT 


13620 


TATATGGATT 


TTTACAGAGA 


GTTATGCAGT 


GATTCATGGT 


ATGGCTCAAC 


CTTTGTATCT 


13680 


CGCCCTTATA 


TCGATTTTGA 


AGACAAGAGT 


CAAGCTAAAG 


CTCAATTTGA 


AAAATTGAAA 


13740 


AGCATTTGGG 


AAAACCGTGA 


CTTACTGATA 


GTCCAAGCTG 


CGACTTCTCG 


TTCAGGTGTC 


13800 


GGAAATGATT 


TATTCGATGA 


GGCAAATTCT 


ATTAAGCGAA 


TTATCTGTCC 


TTCTCATAGT 


13860 


GCCTTTTCTA 


GAGTTCATGA 


ACTTGAACAA 


GAAATTGAAA 


AGTATGCTGG 


TGGTCGCTTG 


13920 


ATTTTATGTA 


TGCTTGGACC 


TACAGCAAAA 


GTTCTGAGTT 


ATAATCTATG 


CCAGATGGGC 


13980 


TATCAAGTTT 


TGGATGTAGG 


CCATATTGAC 


TCAGAGTATG 


AATGGATGAA 


AATGGGAGCT 


14040 


AAAACTAAGG 


TTAAATTTTC 


TCATAAACAT 


ACTGCAGAAC 


ATAATTTCGA 


CCAAGATATT 


14100 


GAATTTATTG 


ATGATGAAAC 


CTATAACAGT 


CAGATTGTTG 


CACGAATATT 


AAACTAGACT 


14160 
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ATTTAAAATA AATGATAAGG ATTTAAAATG ACAAATACCA AACGCCCTGT ACTATTTGCA 
GGTGATTACG CTTATATTCG ACAAATCGAA ACGGCGATGA AGTCACTCTG TAGACACAAT 
AGTCATTTGA AAATTTATCT GCTAAATCAG GACATTCCTC AGGAATGGTT TAGTCAAATA 
AGAATATATT TACAAGAGAT GGGGGGCGAC TTGATTGACT GCAAGTTAAT TGGCTCACAG 



TTTCAAATGA ATTGGTCTAA TAAATTACCT CATATCAATC ATATGACATT TGCACGCTAT 
TTTATTCCAG ATTTTGTAAC AGAAGATAAA GTTCTCTATC TAGATAGTGA TTTGATTGTG 
ACTGGTGATT TGACCGATTT GTTTGAATTA GACTTAGGTG AAAATTATTT GGCAGCAGCT 
CGTTCTTGCT TTGGAGCAGG AGTCGGCTTC AATGCTGGTG TTCTCTTGAT TAACAACAAA 
AAATGGGGAT CTGAAACTAT TCGACAAAAA TTGATTGACT TAACAGAAAA AGAACATGAG 
AATGTGGAAG AAGGAGACCA GTCAATTTTG AATATCTTGT TTAAAGATCA ATATAGTTCC 
CTTGAAGATC AATATAATTT TCAAATAGGA TATGATTATG GGGCGGCAAC CTTTAAACAT 
CAATTCATTT TTGATATTCC GCTCGAACCA CTCCCACTAA TTTTACACTA TATTTCTCAG 
GATAAGCCTT GGAATCAATT TTCTGTTGGA CGTCTAAGAG AAGTTTGGTG GGAATACTCT 
TTGATGGATT GGTCTGTTAT TTTAAATGAA TGGTTTTCAA AGAGTGTGAA GTACCCTAGT 
AAATCACAAA TATTTAAGTT GCAATGTGTT AATTTAACGA ATTCTTGGTG TGTCGAGAAA 
ATCGATTATT TGGCGGAGCA ATTGCCAGAA GTTCATTTTC ATATTGTTGC TTATACAAAT 
ATGGCAAATG AACTACTAGC TTTAACGCGT TTTCCTAATG TTACCCTATA TCCAAATTCC 
TTACCAATGT TATTGGAACA AATAGTAATA GCTTCACATT TGTATTTGGA TTTGAATCAT 
GATCGAAAAT TAGAAGATGC ATATGAGTTT GTGCTTAAGT ACAAAAAACC AATGATAGCT 
TTCGACAATA CTTGCTCTGA AAATCTTTCT GAGATTTCAT ATGAAGGTAT CTATCCAACC 
TCCATTCCGA AAAAAATGGT TGCAGCAATC AGATCTTACA TGAGGTAGAG AACAGTATGA 
GAAAATCAAT ACTATTAGCG GCAGATAATG CCTATCTTAT TCCTTTAGAG ACGACTATHA 
AGTCTCTATT GTATCACAAT AGAGATGTTG ATTTTTATAT TCTCAACAGT CATATAGCTC 
CTGAATGGTT TAAATTATTG GGGAGAAAAA TGGAAGTTGT GAATTCTACA ATTCGCAGTG 
TACACATTGA TAAAGAACTT TTTGAAAGCT ATAAAACAGG ACCTCATATA AATTATGCTT 
CTTACTTTAG ATTTTTTGCG ACAGAAGTGG TTGAATCTGA TAGGGTATTG TATCTGGATT 
CCGATATCAT TCTAACTGGG GAACTAGCTA CTTTGTTTGA GATAGATCTC AAAGGATATT 
CAATTGGTGC TGTTGATGAT GTCTATGCCT ATGAAGGACG AAAATCTGGA TTTAATACTG 
GTATGTTACT AATGGATGTT GCAAAGTGGA AAGAACATTC TATTGTCAAT AGTTTATTGG 
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AATTAGCGGC 


CGAGCAGAAT 


CAAGTTGTTC 


ATCTTGGGGA 


TCAGAGTATT 


TTAAATATTT 


15960 


ATTTTGAGGA 


TAATTGGCTA 


GCCTTAGATA 


AAACATATAA 


TTATATGGTG 


GGTATTGATA 


16020 


TTTATCACCT 


TGCTCAAGAA 


TGTGAACGTC 


TAGATGACAA 


TCCACCTACA 


ATTGTTCACT 


16080 


ATGCTAGTCA 


TGATAAACCT 


TGGAATACAT 


ATAGTATATC 


TAGACTACCT 


GAATTATGG7 


16140 


GGGTTTATAG 


AGATTTGGAT 


TGGTCAGAGA 


TTGCTTTTCA 


ACGTTCCGAT 


TTAAATTATT 


16200 


TTGAAAGAAG 


CAATCAGTCT 


AAAAAACAAG 


TGATGCTTGT 


GACATGGAGT 


GCAGATATAA 


16260 


AACATTTAGA 


GTATTTAGTA 


CAACGGTTAC 


CTGATTGGCA 


TTTTCATTTG 


GCTGCACCGT 


16320 


GTGATTGTTC 


TGAGGACCTG 


ACCTCTCTAT 


CACAGTATAC 


GAATGTAACA 


GTATATCAAA 


16380 


ATGTATTACA 


TAGTAGAATT 


GATTGGCTAT 


TGGACGATTC 


TATAGTTTAT 


TTAGATATTA 


16440 


ATACAGGTGG 


AGAGGTTTTT 


AATGTAGTTA 


CAAGGGCACA 


AGAAAGTGGC 


AAGAAAATCT 


16500 


TCGCTTTTGA 


TATCACACGT 


AAAAGTATGG 


ATGATGGACT 


CTATGACGGT 


ATTTTTTCTG 


16560 


TGGAGAGACC 


AGATGATTTA 


GTGGATAGAA 


TGAAGAATAT 


AGAGATAGAG 


TAATGAGTGA 


16620 


ATTAATTAGT 


GTTGTGGTAC 


CGATATACAA 


TACGGGAAAA 


TATTTACTGG 


AGTGTGTCGA 


16680 


GCATATTCTG 


AAGCAAACCT 


ATCAAAATAT 


AGAAATTATT 


TTAGTTGATG 


ACCGTTCTAC 


16740 


GGATAATTCT 


GCGGAAATTT 


GTGATGCTTT 


TATGATGCAA 


GATAATCGTG 


TGCGAGTATT 


16800 


GCATCAAGAA 


AATAAGCGCG 


GGGCAGCACA 


AGCTAAAAAT 


ATGGGGATTA 


GTGTAGCTAA 


16660 


GGGAGAGTAC 


ATCACGATTC 


TTGATTCAGA 


TGATATCGTA 


AAAGAAAATA 


TGATTGAAAC 


16920 


TCTTTATCAC 


CAAGTCCAAG 


AAAAGGATGC 


AGATGTTGTT 


ATAGGGAATT 


ACTATAATTA 


16980 


TGACGAAAGT 


GACGGGAATT 


TTTATTTTTA 


TGTAACAGGG 


CAAGATTTTT 


GCGTCGAAGA 


17040 


ATTACCTATA 


CAAGAAATTA 


TGAACCGTCA 


AGCAGGAGAT 


TGGAAATTCA 


ATAGCTCGGC 


17100 


CTT7A7ATTG 


CCGACATTTA 


AGTTGATTAA 


AAAAGAATTA 


TTCAATCAAG 


TTCACTTTTC 


17160 


AAATGGTCCC 


CGCTTTGATG 


ATGAAGCAAC 


TATGCATCGC 


TTTTATCTTT 


TAGCCTCTAA 


17220 


AATCGTCTTT 


ATAAAt-OATA 


ALL IUIaIV. 1 


HTATAGAAflA 

\J 1 r\ I Awn/tun 


CGTTCAGGAA 


GCATCATGAG 


17280 


AACGGAATTT 


GATCTTTCCT 


GGGCAAGAGA 


TATTGTTGAA 


GTGTTTTCTA 


AGAAAATATC 


17340 


GGATTGTGTC 


TTGGCTGGTT 


TGGATGTCTC 


CGTTCTGCGT 


ATTCGATTTG 


TCAATCTTTT 


17400 


AAAAGATTAT 


AAGCAAACTT 


TAGAATACCA 


TCAATTAACA 


GATACTGAGG 


AATATAAAGA 


17460 


TATTTGTTTC 


AGATTAAAGT 


TGTTTTTTGA 


TGCAGAACAA 


AGAAATGGTA 


AAAGTTGAAA 


17520 


TAAAAGAATT 


GTTATTTACC 


ATATCACAAA 


CAATGAAGGT 


GAGGGGAGTG 


TTTTATGACT 


17530 


AAGATTTATT 


CGTCAATAGC 


AGTAAAAAAA 


GGACTATTTA 


CCTCATTTCT 


ACTGTTTATC 


17640 


TATGTATTCG 


GAAGTCGTAT 


TATTCTCCCT 


TTTGTTGACC 


TAAATACTAA 


AGATTTTTTA 


17700 
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GGAGGTTCAA 


CAGCCTATCT 


AGCCTTCTCA 


GCCGCCCTAA 


CAGGTGGGAA 


TCTAAGAAGT 


17760 


TTATCAATTT 


TTTCTGTTGG 


ATTATCCCCT 


TGGATGTCCG 


CCATGATTTT 


ATGGCAGATG 


17820 


TTTTCTTTTT 


CTAAACGGTT 


GGGTTTAACA 


TCTACGTCTA 


TAGAAATACA 


AGATCGCCGT 


17880 


AAAATGTACC 


TGACCTTGCT 


AATTGCTGTG 


ATTCAATCCT 


TGGCAGTTAG 


CTTGAGACTG 


17940 


CCAGTACAAT 


CCTCCTATTC 


TGCAATATTG 


GTTGTTCTAA 


TGAATACAAT 


ATTGCTGATA 


18000 


GCAGGAACAT 


TTTTTCTTGT 


TTGGTTGTCA 


GATTTAAATG 


CGAGTATGGG 


GATTGGAGGT 


18060 


TCTATTGTAA 


TCCTCCTATC 


CAGTATGGTT 


TTAAATATTC 


CTCAGGATGT 


TTTGGAAACA 


18120 


TTTCAGACAG 


TACACATTCC 


AACAGGGATT 


ATTGTGTTAC 


TTGCTTTATT 


AACCCTTCTC 


18180 


TTTTCTTATT 


TACTTGCCCT 


TATGTATCGA 


CCTCGCTATT 


TGGTTCCTGT 


TAATAAAATT 


18240 


GGCTTACACA 


ATCGATTTAA 


ACGCTATTCT 


TATCTCGAAA 


TCATGTTGAA 


TCCTGCAGGT 


18300 


CGGATGCCTT 


ATATGTATGT 


GATGAGTTTT 


CTTAGTGTAC 


CAGCTTATTT 


GTTCATCTTG 


18360 


TTGGGATTTA 


TTTTCCCTAA 


TCATTCAGGG 


TTAGCGGCTT 


TATCAAAGGA 


ATTTATGGTT 


18420 


GGAAACCCTT 


TGTGGGTCTA 


TGTTTATATT 


TCGGTCTTAT 


TTTTATTTAG 


TATCATTTTT 


18480 


CCTTTTGTTA 


CGATGAATGG 


AGAAGAGATT 


GCAGACCGTA 


TGAAAAAATC 


TGGAGAATAC 


18540 


ATTTATGGTA 


TTTATCCAGG 


TGCGGATACT 


AGTCGATTTA 


TTAATCGATT 


GGTCCTTCGT 


18600 


TTCTCAGTCA 


TAGGTGGTCT 


CTTTAATGTG 


ATTATGGCAG 


GTGGTCCCAT 


CCTTTTTCTT 


18660 


TTGTTTGATG 


AAAAGTTATT 


ACGATTGGCA 


ATGATTCCAG 


GCTTATTTAT 


GATGTTCGGG 


18720 


GGCATGATTT 


TTACGATTAG 


AGACGAGGTC 


AAGGCTTTAA 


CGCTAAATGA 


GACCTATAGA 


18780 


CCTTTGATTT 


AGGAGACTTT 


TATGTATTAT 


TTTATTCCAG 


CTTGGTATGG 


CrCACAAAGA 


18840 


ACATGGCATG 


CAGATATCAC 


TCCATGGTAT 


TTTTCTCATT 


TTCGTC7ACA 


CTTTGATGAT 


13900 


ACCTTTCACC 


AGATTCGGCT 


CTTTCAAGAG 


CAAGATATAC 


ATTCTCGTCT 


ATTAGTATTA 


18960 


GCTTACCAGC 


CTCATCTACG 


TTATTTTTTA 


TATAGACATG 


GTGTGTTAGA 


AATGGATACT 


19020 


TATTCCGTTT 


TTGATGTTAT 


GCAAGATTTT 


CATAATCTCC 


ATACCCAAGT 


TTTAAGCATT 


19080 


AGAGATATTG 


AGTGGGATGA 


TGACTGTGAA 


TTTATTTATA 


GTCCCTTTAC 


GATTATCGTT 


19140 


CAAAAAAATG 


GGAAGAAATT 


TGCTAAGGTT 


GAACATGGAG 


TTGAAGCCTT 


CATCAGTGAT 


19200 


ATACAGTATT 


TTGAACCAAA 


TGGTCAAATA 


CATATGCACC 


ATATCGTGGA 


TGATCGTGGG 


19260 


TTTGTATCGA 


GCATTATCTT 


TTTTGAAGAT 


GGGCAAGCAG 


CCTATCAAGA 


ATATCTGAAC 


19320 


CTCAAGGGAG 


AGTGGCAATT 


CAGAGAGCGT 


TTAAAAGAAG 


GAGGACAGGT 


AGAAGTCAAT 


19380 


CCAATTTTGG 


GTTATCGCTT 


TAAAATGCTT 


ACCTATCAAA 


ATATGGGAGA 


TCTGGTGGCA 


19440 
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GAATTTTTTC 


AGAATTATCT 


GCAAACGTAT 


GTGAAGGATC 


AGGATATTTT 


TATGCTTCCT 


19500 


TCTCATTCTC 


ATCATGACCA 


GTTGGTACTA 


GATCGTTTAC 


CTAGTACTAA 


TCCTAAACTG 


19560 


TTGAGTCTGT 


TCATTGGACG 


TAATCCTCAA 


GATACCTTTA 


GGGATTTAGA 


TGTAACTTTT 


19620 


GAAAAATCGG 


ATTTGATTTT 


GGTGGATAGA 


GAGGATAGTT 


TACGATTGTT 


GCAGGAGTTG 


19690 


TATCCTGAAC 


GAATGCATCA 


ATGTTATCAT 


TTATCATCTT 


TTGACACCCG 


ATTACGATTG 


19740 


GGACGAAGCC 


AAACTAAGAA 


AGAATCCATC 


ATTTATTTTC 


AACTGGATTT 


TGAGCAGGGG 


19800 


ATTGATAATC 


AAGCTCTGCT 


TCAAGTCTTG 


TCCTTTGTCG 


CTGAAAATAA 


GGATACTGAG 


19860 


GTGATTTTTG 


GAGCCTTTCC 


TGCTAGTCAG 


CAGCAAATGA 


ATGAGCTTGA 


AGGGATTGTT 


19920 


GAGTCTTTCA 


TCCAAGAAAA 


CATTCAATCC 


GAAAATCTGG 


GAAAGGCGAT 


TGATTATGGT 


19980 


GATGCAGAAA 


ATCCTCTGGA 


AGAAAATCAA 


CACCAGGACT 


TACCCTTACA 


GTTTGTTAAC 


20040 


TTGAATGATG 


AGTTAGATTT 


GATAAAAACA 


CTAGAATTTG 


TCCGTTTGAT 


TCTGGATTTA 


20100 


AATACACATC 


CTCATCTCTA 


CACACAGATT 


GCTGCGATTA 


GTGCAGGAAT 


TCCTCAAATC 


20160 


AACCTAGTTG 


AAACCCTCTA 


TGTTGAACAT 


TTAAAAAATG 


GTTATTTGTT 


AGCAGATGTT 


20220 


ACAGAATTTT 


CTAAGGCTGC 


ACATTATTAC 


ACAGATAGGT 


TGAACGAGTG 


GAATGAGTCC 


20280 


TTGATATATT 


CAATTGATAA 


GATTAAGGAG 


CACACAGGAC 


AACAATTTCT 


TCGAAAATTA 


20340 


GAGAAATGGA 


TAGAGGAGGT 


TAAAAATGTC 


AAAGGAACTT 


AATATTTTAC 


AGATAGGACT 


20400 


TGCCAATTGG 


GAAAATCACT 


ATCACATACC 


TGAAAATATG 


ACTTCGTATT 


ATTTTTACCC 


20460 


AAACTCATCA 


AAAGCCCTTC 


GTGAAATAAT 


TGAAAAAGAG 


GATATTAACC 


GTTTTCATGC 


20520 


AGTTTTAATA 


GAAGATGGTC 


AGTATTCCAG 


AGACTTATTT 


TCCTATGTAA 


AATATTTTGA 


20580^ 


ACCTTATACT 


TTATTTTATA 


ACCAGAATCT 


ACAAATAAAT 


GATAGAGAGG 


TTGTGGATTT 


20640 


TCTAAAAAAA 


CCATGTGCAC 


AAGCAATTCA 


TTTTTTAAGT 


CCCCAACAAC 


TAATCAATGA 


20700 


TTTAAGTAAA 


TCTCTTTTTG 


GCGCTGGGTA 


TGGTGATAAA 


CTCTTTCCTC 


CGACGATACA 


20760 


AGTCAATCCA 


4ATTTT A C A CI 




1 1 Al^.fv\V)Un 


I I UOA 1 I A l\J 


TCAIjTI 1GGA 


20820 


AGGTGAGTTT 


GGGCAAGATT 


TTGCCCAGCT 


TGCCTATTGG 


GCTTATAATA 


TTATCGTGCA 


20880 


AAAAACACTC 


CCTATTGAGT 


TGTGGCTTGA 


ATATGAGAAG 


GAACGCAATT 


GTGACTTTCG 


20940 


TTTAGTAATC 


CGTAAAATGT 


GGAGTGGGTC 


TGTTGATGAT 


TTCTTTGAAG 


AAGTAATAGT 


21000 


ATCTGAAAAA 


GACTTGGAGC 


AAGCACTTTT 


TATGGATAGT 


CGAGACGGAG 


ACTACTTTCT 


21060 


CTCGATATCT 


GTTGAAGCAA 


GAGGTCGTGG 


AACTATCAAA 


CTAGGTAATC 


TTCACCAACG 


21120 


ATGGAGTCGA 


AAACAATTTG 


GTAAGTTTGT 


ACTTGGTGGA 


AATATCCTAC 


ATGATTCCAA 


21160 


GCGTGATCAA 


ATAAACTATT 


TCTTCCATCC 


AGGTGATTTT 


AAACCGCCTT 


TGACTGTCTA 


21240 
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TTTTGCACGT TATCGACCTG CAGAAGGATT CGAGGGTTAC TTTATGATGA AAACTCTTGG 21300 

ATGTCCCTTC ATTTTATTTT CTGATCCACG TTTAGAGGGG GGAGCTTTTT ATCTCGGAAC 21360 

GGATGAGCTA GAGGGAAAAG TAAAGGATAC GATCACTCAC TATCTTGATT ATTTAGGCTT 214 20 

TGATCATAAG GATTTGATTT TATCAGGTCT TTCTATGCGA ACGTTTCCGG CTCTCTATTA 21480 

TGGTGCTTCT TTTGAACCCC ATGCCATCAT AGTTGGTAAG CCCTTGGCTA ATTTAGGAAC 21540 

TATAGCTAGT CGTGGACGTT TGGACGCACC GGGTGTCTCT AACTTAGCTT TTGATTGTTT 216O0 

AATTCATCAT ACAGGTGGGA CAAGTTCTCA AGATATGACG GAGTTGGATC AGCCTTTTTG 21660 

GAAAATTTTT AAACAAGCAA ATTTTTCAAA GACAACCTTT GGTTTATCCT ATATGAAAGA 21720 

TCAAGAAATG GATCCACAAG CCTATGAACA ATTAGTGTCT TATCTGTGTA ATACAGGTGC 21780 

GAAGATTTTA TCTAAAGGAA CTGCTGGACG ACACAATGAT GATACAGATA CCAATATTTC 21840 

TTGGTTTTTG CACTTTTATA GAATGGTCTT AGAGACTGGT TTTGGAAGGG AGAAAAGATG 21900 

ATTATTACAC AGAGACAGTC TATTCATTGG GCAGAAGTTG CTGGCACTTA TATGTATGGA 21960 

ACAACTGTAT CTTATTACCC TGACAAAAGT GTTCGTCTGT ATAATCCTCT ATTGCCATCT 22020 

GGTGAGATTC TAAAGACTTG GTTTTCTAGT GTCAATTACC AGGCTGCACG AACCCAACCT 22080 

CAGCTTCCCT TATTAAAAAG AAAGCAGGAG TATCAACTAT CACTGGTTTT TGACTGTCAG 22140 

CCTGAAAATG GAGTTTATAC CAAGATAACT TTTTTTGACC GCTATGGTGA TATTTTACAA 22200 

AAAAAGGTAG AAAAAGTGAA AGATTTCATA TTTACTTATC CAGAAGATAG TTATACTTAT 22260 

CGAGTTTCTC TTTTAAGTGC TGGATTTGAG TCCTTAACTT TTTATCATTT TTCTATCAAG 22320 

GAGATCAGAA GTGTTTAGAC GTTTAGGTCA AGATTTCCAG CTTAGGAAAG TGAAAAAGAT 22380 

TTTAAAGCAG ATTAATGCCC TGAAAGGCAA CATGTCCTCT CTTTCGGATC AAGAATTAGT 22440 

AGCTAAAACA GTAGAGTTTC GTCAGCGTCT TTCCGAGGGA GAAAGTCTAG ACCATATTTT 22500 

GCTTGAAGCT TTTGCTGTGG TGCGTGAAGC AGATAAGCGG ATTTTAGGGA TGTTTCCT*A 22560 

TGATGTTCAA GTCATGGGAG CTATTGTCAT GCACTATGGA AATGTTGCTG AGATGAATAC 22620 

GGGGGAAGGT AAGACCTTGA CAGCTACCAT GCCTGTCTAT TTGAACGCTT TTTCAGGAGA 22680 

AGGAGTGATG GTTGTGACTC CTAATGAGTA TTTATCAAAG CGTGATGCCG AGGAAATGGG 22740 

TCAAGTTTAT CCTTTTCTAG GATTGACCAT TGGTGTACCA TTTACGGAAG ATCCAAAGAA 22800 

GGAGATGAAA GCTGAAGAAA AGAAGCTTAT CTATGCTTCG GATATCATCT ACACAACCAA 22860 

TAGTAATTTA GGTTTTGATT ATCTAAATGA TAACCTAGCC TCGAATGAAG AAGGTAAGTT 22920 

TTTACGACCG TTTAACTATG TGATTATTGA TGAAATTGAT GATATCTTGC TTGATAGTGC 22980 
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ACAAACTCCT 


CTGATTATTG 


CCGGTTCTCC 


TCGTGTTCAG 


TCTAATTACT 


ATGCGATCAT 


23040 


TGATACACTT 


GTAACAACCT 


TGGTCGAAGG 


AGAGGATTAT 


ATCTTTAAAG 


AGGAGAAAGA 


23X00 


GGAGGTTTGG 


CTCACTACTA 


AGGGGGCCAA 


GTCTGCTGAG 


AATTTCCTAG 


GGATTGATAA 


23160 


TTTATACAAG 


GAAGAGCATG 


CGTCTTTTGC 


TCGTCATTTG 


GTTTATGCGA 


TTCGAGCTCA 


23220 


TAAGCTCTTT 


ACTAAAGATA 


AGGACTATAT 


CATTCGTGGA 


AATGAGATGG 


TACTGGTTGA 


23280 


TAAGGGAACA 


GGGCGTCTAA 


TGGAAATGAC 


TAAACTTCAA 


GGAGGTCTCC 


ATCACGCTAT 


23340 


TGAAGCCAAG 


GAACATGTCA 


AATTATCTCC 


TGAGACGCGC 


GCTATGGCCT 


CGATCACCTA 


23400 


TCAGAGTCTT 


TTTAAGATGT 


TTAATAAGAT 


ATCTGGTATG 


ACAGGGACAG 


GTAAGGTCGC 


23460 


GGAAAAAGAG 


TTTATTGAAA 


CTTACAATAT 


GTCTGTAGTA 


CGCATTCCAA 


CCAATCGTCC 


23520 


GAGACAACGG 


ATTGACTATC 


CAGATAATCT 


ATATATCACT 


TTACCTGAAA 


AAGTGTATGC 


23580 


ATCCTTGGAG 


TACATCAAGC 


AATACCATGC 


TAAGGGAAAT 


CCTTTACTCG 


TTTTTGTAGG 


23640 


CTCAGTTGAA 


ATGTCTCAAC 


TCTATTCGTC 


TCTCTTGTTT 


CGTGAAGGGA 


TTGCCCATAA 


23700 


TGTCCTAAAT 


GCTAATAATG 


CGGCGCGTGA 


GGCTCAGATT 


ATCTCCGAGT 


CAGGTCAGAT 


23760 


GGGGGCTGTG 


ACAGTGCCTA 


CCTCTATGGC 


AGGACGTGGT 


ACGGATATCA 


AGCTTGGTAA 


23820 


AGGAGTCGCA 


GAGCTTGGGG 


GCTTGATTGT 


TATTGGGACT 


GAGCGGATGG 


AAAGTCAGCG 


23880 


t 

GATCGACCTA 


CAAATTCGTG 


GCCGTTCTGG 


TCGTCAGGGA 


GATCCTGGTA 


TGAGTAAATT 


23940 


TTTTGTATCC 


TTAGAGGATG 


ATGTTATCAA 


GAAATTTGGT 


CCATCTTGGG 


TGCATAAAAA 


24000 


GTACAAAGAC 


TATCAGGTTC 


AAGATATGAC 


TCAACCGGAA 


GTATTGAAAG 


GTCGTAAATA 


24060 


CCGGAAACTA 


GTCGAAAACG 


CTCAGCATGC 


CAGTGATAGT 


GCTGGACGTT 


CAGCACGTCC 


24120 


TCAGACTCTG 


GAGTATGCTG 


AAAGTATGAA 


TATACAACGG 


GATATAGTCT 


ATAAAGAGAC 


24180 


AAATCGTCTA 


ATAGATGGTT 


CTCGTGACTT 


AGACGATCTT 


GTTGTCGATA 


7CATTCAGAG 


24240 


ATATACAGAA 


GAGGTAGCGG 


CTGATCACTA 


TGCTAGTCGT 


GAATTATTGT 


TTCACTTTAT 


24300 


TGTGACCAAT 


ATTAGTTTTC 


ATGTTAAAGA 


GGTTCCAGAT 


TATATAGATG 


TAACTGACAA 


24360 


AACTGCAGTT 


CGTAGCTTTA 


TGAAGCAGGT 


GATTCATAAA 


GAACTTTCTC 


AAAAGAAAGA 


24420 


ATTACTTAAT 


CAACATGACT 


TATATGAACA 


GTTTTTACGA 


CTTTCACTGC 


TTAAAGCCAT 


24480 


TCATGACAAC 


TGGGTAGAGC 


AGGTAGACTA 


TCTACAACAG 


CTATCCATGG 


CTATCGGTGG 


24540 


TCAATCTGCT 


AGTCAGAAAA 


ATCCAATCGT 


AGAGTACTAT 


CAAGAAGCCT 


ACGCGGGCTT 


24600 


TGAAGCTATG 


AAAGAACAGA 


TTCATGCGGA 


TATGGTGCGT 


AATCTCCTGA 


TGGGGCTGGT 


24660 


TGAGGTCACT 


CCAAAAGGTG 


AAATCGTGAC 


TCATTTTCCA 


TAAAAGGAGA 


AAATATGACA 


24720 


ATTTACAATA 


TAAATTTAGG 


AATTGGTTGG 


GCTAGTAGCG 


GTGTTGAATA 


CGCTCAAGCC 


24780 
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TATCGTGCTG GTCTTTTTCG GAAATTAAAT CTGTCCTCTA AGTTTATCTT TACAGATATG 24840 

ATTTTAGCCG ATAATATTCA GCACTTAACA GCCAATATTG GTTTTGATGA TAATCAGGTT 24900 

ATCTGGCTTT ATAATCATTT CACAGATATC AAAATTGCAC CTACTAGCGT GACAGTGGAT 24960 

GATGTCTTGG CTTACTTTCG TGGTGAAGAA AGTCACAGAG AAAAAAATGG CAAGGTTTTA 25020 

CGTGTATTCT TTTTTGACCA AGATAAGTTT GTAACCTGTT ATTTGGTTGA TGAGAACAAG 25080 

GACTTGGTTC AACATGCCGA GTATGTTTTT AAGGGAAACC TGATTCGGAA GGATTACTTT 25140 

TCTTATACGC GTTATTGTAG CGAGTATTTT GCTCCCAAGG ACAATGTTGC AGTCTTATAC 25200 

CAACGAACTT TTTATAATGA AGACGGGACT CCAGTCTATG ATATCTTGAT GAATCAAGGG 252 60 

AAGGAAGAAG TTTATCATTT CAAGGATAAG ATTTTCTATG GAAAGCAAGC TTTTGTGCGT 25320 

GCCTTTATGA AATCTTTGAA TTTGAATAAG TCTGATTTGG TCATTCTCGA TAGGGAGACA 25380 

GGTATTGGAC AGGTTGTGTT TGAGGAAGCA CAGACAGCAC ATCTAGCCGT AGTTGTTCAT 25440 

GCGGAGCATT ATAGTGAAAA TGCTACAAAT GAGGACTATA TCCTTTGGAA TAACTATTAT 25500 

GACTATCAGT TTACCAATGC AGATAAGCTT GACTTCTTTA TCGTGTCTAC TGATAGACAA 25560 

AATGAAGTTC TACAAGAGCA ATTTGCCAAA TATACTCAGC ATCACCCAAA GATTGTTACC 25620 

ATTCCTGTAG GCAGTATTGA TTCCTTGACA GATTCAAGTC AAGGGCGCAA ACCATTTTCA 25680 

TTGATTACGG CTTCACGTCT TGCCAAAGAA AAGCACATTG ATTGGCTTGT GAAAGCTGTG 25740 

ATTGAAGCTC ATAAGGAGTT ACCGGAACTA ACCTTTGATA TCTATGGTAG TGGTGGAGAA 25800 

GATTCTCTGC TTAGAGAAAT TATTGCAAAT CATCAGGCAG AGGACTATAT CCAACTCAAG 25860 

GGGCATGCGG AACTTTCGCA GATTTATAGC CAGTATGAGG TCTACTTAAC GGCTTCTACC 25920 

ACCCAAGGAT TTCCTCTCAC CTTCATGCAA GCTATTGGTT CACGTCTACC TCTAATTGGT 25990 

TTTGATGTGC CTTATGGTAA TCAGACCTTT ATAGAGGATG GGCAAAATGG TTATTTGATT 26040 

CCAAGTTCAT CTGACCATGT AGAAGACCAA ATCAAGCAAG CTTATGCCGC TAAGATTTGT 26100 

CAATTGTATC AAGAAAATCG TTTGGAAGCT ATGCGTGCCT ATTCTTACCA AATTGCAGAA 26160 

CGCTTCTTGA CCAAAGAAAT TTTAGAAAAG TGGAAGAAAA CAGTAGAGGA GGTGCTCCAT 26220 

GATTGAACTT TATGATAGTT ACAGTCAAGA AAGTCGAGAT TTACATGAAA GTCTAGGCGC 26280 

TACTGGTCTT TCTCAACTTG GAGTGGTCAT CGATGCAGAT GGTTTTCTGC CTGATGGTCT 26340 

GCTTTCTCCT TTTACCTATT ATCTAGGTTA CGAGGATGGA AAACCTCTCT ATTTTAATCA 26400 

AGTTCCCGTT TCAGATTTTT GGGAAATTTT AGGAGATAAT CAGTCTGCTT GTATTGAAGA 264 60 

TGTGACGCAG GAGAGGGCTG TCATTCATTA TGCTGATGGA ATGCAGGCTC GCTTGGTTAA 26520 
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ACAGGTAGAC 


TGGAAAGACC 


TAGAAGGTCG 


AGTACGTCAG 


GTTGACCACT 


ACAATCGCTT 


26580 


CGGAGCTTGT 


TTTGCTACAA 


CGACTTATAG 


CGCAGATAGC 


GAGCCGATTA 


TGACAGTTTA 


26640 


CCAAGATGTC 


AATGGTCAAC 


AAGTTTTACT 


GGAAAACCAT 


GTGACGGGTG 


ATATCTTATT 


26700 


GACTTTGCCA 


GGTCAGTCCA 


TGCGTTACTT 


TGCAAATAAA 


GTTGAATTTA 


TCACCTTCTT 


26760 


TTTGCAAGAT 


TTGGAAATAG 


ATACCAGTCA 


GCTTATCTTT 


AATACTCTAG 


CGACTCCTTT 


26820 


CTTGGTTTCC 


TTCCATCATC 


CAGATAAATC 


TGGCTCGGAT 


GTCTTGGTAT 


GGCAGGAACC 


26880 


TCTCTATGAT 


GCCATTCCAG 


GTAATATGCA 


GTTGATTTTG 


GAAAGTGATA 


ATGTGCGTAC 


26940 


TAAGAAGATC 


ATCATTCCAA 


ATAAGGCGAC 


TTATGAGCGC 


GCTTTAGAGT 


TAACTGACGA 


27000 


GAAATACCAT 


GATCAGTTTG 


TGCACTTGGG 


TTATCATTAC 


CAGTTCAAAC 


GTGATAATTT 


27060 


CCTAAGACGA 


GATGCCTTAA 


TCTTGACCAA 


TTCAGATCAG 


ATTGAGCAAG 


TAGAAGCAAT 


27120 


CGCAGGAGCC 


TTGCCTGATG 


TCACTTTCCG 


TATTGCAGCG 


GTGACAGAGA 


TGTCTTCTAA 


27180 


GCTCTTAGAC 


ATGCTTTGCT 


ATCCTAATGT 


GGCCCTTTAC 


CAGAACGCTA 


GTCCACAGAA 


27240 


GATTCAGGAG 


CTGTATCAAC 


TGTCGGATAT 


TTACTTGCAT 


ATAAACCACA 


GTAATGAGTT 


27300 


GCTACAGGCA 


GTGCGTCAGG 


CCTTTGAGCA 


CAATCTCTTG 


ATTCTTGGCT 


TTAATCAGAC 


27360 


GGTCCACAAT 


AGACTTTATA 


TCGCTCCAGA 


CCATCTATTT 


GAAAGTAGTG 


AAGTTGCTGC 


27420 


TTTGGTTGAG 


ACCATTAAAT 


TGGCCCTTTC 


AGATGTTGAT 


CAAATGCGTC 


AGGCACTTGG 


27480 


CAAACAAGGC 


CAACATGCAA 


ATTATGTTGA 


CTTGGTGAGA 


TATCAGGAAA 


CCATGCAAAC 


27540 


TGTTTTAGGA 


GGCTAACATG 


TCAGAGGAAG 


ATTTATTTTA 


CAAAGACGTT 


GAAGGCCGCA 


27600 


TGGAAGAGTT 


GAAACAAAAA 


CC CATCAAG A 


AGGAAAAAGA 


AACCCGAGGG 


GAAAACATTA 


27660 


GTAAGACTTT 


TTCACTTTTA 


CTGGCTTTGA 


TGATTCTGAT 


TGGTTTGCTC 


TTTACTTTGC 


27720 


TGGGAATTTT 


GAGGTAGATC 


TATGATTGAA 


ATACTAATTG 


TTTTAGCTAT 


TATCCTATCT 


27780 


CTTGCTTTGA 


TTGTATTGGT 


AACTATACAA 


CCCCGTCAAA 


ATCAACTATT 


TTCCATGGAT 


27840 


GCCACTAGTA 


ATATTGGTAA 


ACCAAGCTAC 


TGGCAGAGCA 


ACACCTTGGT 


CAAGGTGCTC 


27900 


ACl'lTATTGG 


TGAGTTTGGC 


TTTATTTATT 


CTACTATTAA 


CCTTTATGGT 


GATTACTTAT 


27960 


AAATAAAAGA 


AAACTTCAGA 


TATTCACCTT 


TTGTGGATTG 


GTCTGAAGTT 


TTCTTTTTTA 


28020 


TACTCAATGA 


AAATCAAAGA 


GCAAACTAGG 


AAGCTAGCCG 


CAGGCTgCTC 


AAAACACCGT 


28080 


TTTGAGGTTG 


TAGATATAAC 


TGACGAAGTC 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTAGAT 


28140 


ATAACTGACG 


AAGTCAGCTC 


AAAACACCGT 


TTTGAGGTTG 


TGGATAGAAC 


TGACGAAGTC 


28200 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTGGAT 


AGAACTGACG 


AAGTCAGCTC 


AAAACACCGT 


28260 


TTTGAGGTTG 


TGGATAGAAC 


TGACGAAGTC 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTGGAT 


28320 
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AGAACTGACG 


AAGctCAGTA 


ACATATATAC 


AGCAAGGCGA 


CGCTGACGTG 


GTTTGAAGAG 


28380 


TATTACTGTC 


TATATTTTTG 


GTAAAAATCA 


ACTTTTACTT 


GGATGAAGGT 


TTTGGCTTCA 


28440 


CGTAGGAGTT 


GAAGAAGGGT 


GGCGCGGGTT 


TCAAATTCTT 


CTCTTGTCTT 


GGGCAGACTG 


28500 


CGGTTCCGGA 


AGACTTCCAG 


ATAACGTTCA 


ATTTCATCTA 


GCAAATCAGA 


AGCAGGATTG 


28S60 


GTCTGGCTCA 


GTTGACCTGC 


AATTTTTGAA 


AAGAGTTGCG 


CTAAGATCAG 


GCTTTCACTG 


28620 


GCGGCAAGGT 


GACAAGTGTT 


AATCTGTTGG 


GCCATGTTTC 


TCAGGATACG 


ACTTTGTCGC 


28680 


TGTCTCATCT 


CAAAGTAGTG 


GATATGGTAG 


TCTGTCTGGT 


GAAAGAGGTG 


GTCAGAGTGA 


28740 


TCCAAATAGA 


CCAGTCTGAG 


GGCTTCTTTC 


AAAAGCGTGT 


CTAATTCTGC 


TACCAGCTGT 


28800 


GCTCGGTTGC 


GTCCGTCTCC 


TCTGGATAAA 


TAGTATTTGA 


AGCGCTGGAG 


GATATCTTTT 


28860 


AACTTTTCTT 


CCACCAGCGT 


GTGGTAGTGC 


TGGATTTCCT 


CTTCTCGTGA 


AGGCATATAG 


28920 


AGATTAACAA 


GCAAGGCAAA 


TCCTGTACCA 


ATAGCAAAGA 


GAAGGAATTC 


ATTGACTAGA 


28980 


AGGTCTGGAG 


AGGTTGACTC 


TTGAACCAAG 


AGATGGCTAA 


CCAAAACAGT 


GCTTGGTGTG 


29040 


ATGCCAATTT 


CCCAGCCCAT 


CTTGTAGGCT 


AAAGGAACGT 


AGAAGGCCAG 


ATAGAGGCCG 


29100 


AGACTCCAGA 


TATGAAATCC 


GCTCAAGTGA 


AAAGCTAGAA 


CACCGATAGC 


CAGAGCTAGA 


29160 


AGCATAGAAA 


AAAGACGATT 


GCGAGCCAGT 


TTTAAAGTAC 


TTCTACGCGT 


ATCAGATAGG 


29220 


CTCAAGAGAG 


CGATAATTCC 


AGCCGAAACT 


GCTGACGAAA 


GATTGAGAAA 


ATAAGCAAGC 


29280 


AGGCAGGCAA 


GACAGGTAGC 


TAAGATGAGC 


TTGGTCGTAC 


GTTGGCTAAT 


AGACATAACA 


29340 


ATTTCCTAAT 


AAGTTAGAAT 


AAAAGCGTAA 


AAGACAAGAC 


ATGACCACGC 


TTGCCTTGAT 


29400 


GAGTTATTTT 


TTACGGGTTG 


CTGCCTATTC 


GGCAACGGCG 


GTAAAGAGGA 


CATCTGTAGA 


29460 


AGAGTTAAGG 


GCTGTTTCAC 


ATGAGTCTTG 


GATGACACCA 


ATCACAAAAC 


CAACCCCAAC 


29520 


AATTTGTATG 


GCAATATCGT 


TAGAAATACC 


GAAAAGGCTA 


CAAGCAACTG 


GGATAAGAAG 


29580 


GAGGGAACCT 


CCGGCAATAC 


CTGAAGCATC 


ACAGGATGAG 


ATAGCTGCTA 


CCACACTGAG 


29640 


GACAAAGGCT 


GTGGCAAAGT 


CAACAGGAAT 


TCCAAGAGTG 


TTAACTGCAG 


CAAGGGTCAA 


29700 


AAGGTTAATG 


GTAATCGCTA 


CTCCAGCCAT 


ATTGATAGTA 


GAACCGAGTG 


GGATAGAAAC 


29760 


AGAATAGGTA 


TCTGGGTTGA 


GTCCAAGGTC 


ATGGCAGAGT 


TTCATGTTGA 


CAGGAATGTT 


29820 


AGTCGCAGAA 


CTACGAGTGA 


AAAAGGCTGT 


CACACCGCTG 


ACACGGAGGC 


AGTTCCAAAC 


29880 


TAGAGGGTAA 


GGATTGCGTC 


TCATAAAGAA 


GAAGGCAATC 


AAAGGGTTGA 


CCACAGGGGC 


29940 


AACAAAAAGC 


ATAGTCGTTA 


CTAATAGAAC 


CAATAAAATA 


CCGTAGTTGG 


CAAGGCTTCC 


30000 


GACTCCCTTG 


TCAGAAATCG 


TTTTAAAAAC 


AAGACCAAGG 


ATTCCAAATG 


GAGCCAGATT 


30060 
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GATGATCCAT 


TCGACAATTT 


TAGAAGTCAC 


GTCAGCGATA 


GTTTTTAGCA 


ATTCTTGACT 


30120 


ATTTTTACTG 


GCTTCTCTCA 


TAGCGATTCC 


AAAAATGACT 


CCCCAAGATA 


AGATTCTAAT 


30180 


ATAGTTAGCA 


GTAAGCAGGG 


CGTTGACTGG 


GTTGTCAACC 


AGTTTGAGCA 


AGAGGTTGCT 


30240 


GAGAACCTGC 


CCAATCCCAT 


CTGGTGGTGC 


AATTTCAGTA 


TTGGCACTAT 


TTGGGGTAAT 


30300 


TTCAATAGGG 


ACGATGAAAT 


TTGCTAGTAC 


AGCTACAAGA 


GCAGCGGCGA 


AAGTCCCTAT 


30360 


CATAGGATAT 


ACAAGAAAAC 


AACAGTTTTC 


ATATTGCTAT 


CTTGTCCCTT 


TTGATGTTGG 


30420 


GAAAGGGCAT 


TGGCAACGAG 


AGCAAAGACT 


AGGATAGGAG 


CAACAGCTTT 


TAGACCTCCA 


30480 


ACGAATAAAT 


CCTCGAGTAG 


CCCAATCCCT 


GAGAGATTAG 


GAAGGGTCAG 


TCCTAGGATT 


30540 


CCCCACAAAG 


CATACCAATC 


AAGATACGCT 


TGACAAGGCT 


TGCCTTATTC 


CAAGCATGAA 


30600 


TGATTCTTTT 


CATAATAATC 


TCCTTTTTGT 


GTAGTGATTA 


TGATTATAGT 


ATAAATGATA 


30660 


GACAAAATCA 


AGAATTTTCT 


GTCTATTTTT 


TGAATATTTA 


TGGAGAATGA 


GACTCATGAA 


30720 


AATATGGTAT 


AATGAAATAA 


AGGAGTTTTA 


TATGCAAAAA 


TTTATTCAGG 


CTTATATTGA 


30780 


AAAGCTAGAT 


GTGACAACCA 


TTATCGAGAA 


TATTCTAACC 


AAGGTCATTT 


CTCTTTTACT 


30840 


GCTTTTAATT 


GTATTTTATA 


TTGCTAAAAA 


AATGCTTCAT 


ACCATGGTGC 


AGAGAATTGT 


30900 


CAAACCTTCT 


CTAAAAATGT 


CTCGTCATGA 


TGTTGGACGC 


CAAAAAACCA 


TCTCACGTTT 


30960 


ACTAGAAAAT 


GTGTTTAATT 


ATACGCTATA 


TTTCTTTTTA 


CTCTACTGCA 


TTTTGTCGAT 


31020 


TTTAGGTTTG 


CCAGTTTCTA 


GTTTGCTGGC 


TGGAGCTGGT 


ATTGCTGGGG 


TAGCGATTGG 


31080 


TATGGGAGCC 


CAAGGCTTTC 


TGTCTGATGT 


CATCAATGGC 


TTTTTCATCC 


TCTTTGAACC 


31140 


TCAACTGGAT 


GTGGGAGATG 


AGGTCCTTCT 


GACAAATGGA 


CCGATTACTG 


TATCGGGTAA 


31200 


GGTTGTCAGT 


GTGGGAATTC 


GTACGACACA 


GCTTCGTAGC 


GAGGAGCAAG 


CCCTTCACTT 


31260 


TGTCCCTAAC 


CCAAATATCA 


CAGTTGTTAC 


CAATTTCTCA 


CGCACAGACT 


AGACCTGTTA 


31320 


TTTTAAGTAA 


TTTGTGGTAC 


AATAGAGGGA 


GTTTAATAAG 


GAGAAAAGAT 


GGTTTTAGAA 


31380 


AAGCAGTTGG 


GCAATGGTTG 


TACCTGGATA 


GACCTAGACC 


TAGGAAAGTT 


GAATAAACTA 


31440 


GAAGACCTTT 


CTGAAATTTA 


CGGTTTUGAC 


AAGGAAACCA 


TTGAATACGC 


ACTGGATAGA 


31500 


AACGAGCGCG 


CCCACATGGA 


CTACCACCGT 


GAAAGTGAGA 


CGGTTACCTT 


TATCTATAAT 


31560 


GTCTTAGACG 


TAAAAAAGGA 


CAAGGCCTAC 


TATGAGACTT 


TTCCCATGAC 


CTTTATTGTC 


31620 


GAGCATCGTC 


GCCTGATTAC 


CATTAGTAAT 


ACCAAGAACG 


CCTATGTCAT 


TGAACAGATG 


31680 


ACTCGTTATC 


TGGAGAACCA 


TGACACGCTT 


TCGATTTATA 


AGTTTCTCTT 


TGCCAGTCTG 


31740 


GAAATCATCA 


GCAATGCCTA 


CTATCCTGTC 


ATTGAGCAGA 


TGGACAAGAG 


TAGGGATGAG 


31800 


GTCAATGACC 


TCTTGCGCCA 


GCGAACTACC 


AAGAAAAACC 


TCTTTGTCCT 


GTCTGATTTG 


31860 
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GAGACTGGTA 


TGGTTTATCT 


GACGGCAGCT 


GCCAAACAAA 


ATCGGATTTT 


GTTAGAGCAT 


31920 


ATTCAAGGTC 


ATGCCTTGTA 


TCGTAGTTTT 


GATGAGATTG 


AGAGAGAACA 


GTTTGATGAT 


31980 


GCCATGATTG 


AGGCTCATCA 


GCTGGTATCC 


ATGACAGACC 


TAATCTCTCA 


GATTTTACAC 


32040 


CAGCTTTCAC 


CCTCTTACAA 


CAATATTCTA 


AACAATAATC 


TGAATGACAA 


TTTGACAACC 


32100 


TTGACTATCA 


TTTCAGTCTT 


CCTAGCTGTT 


TTGGCAGTCG 


TGACAGGCTT 


TTTCGGAATG 


32160 


AATGTTCCCT 


TACCTTTAAC 


AGATGAGCCC 


CATGCTTGGC 


TCTATATCAG 


TTTGGCTAGT 


32220 


GCAGGTTTGT 


GGATTGTTTT 


ATCCTTGTTA 


CTAAGGAAAA 


TTGCGAAAAA 


AAGTTAAGAA 


32280 


AAGGAGCCAG 


AATGGCGATT 


G AAAATT AT A 


TACCAGATTT 


TGCTGTGGAA 


GCAGTCTATG 


32340 


ATCTGACAGT 


CCCAAGCCTG 


CAGGCGCAGG 


GAATAAAGGC 


TGTTTTGGTC 


GATTTGGATA 


32400 


ATACCCTCAT 


TGCTTGGAAC 


AACCCTGATG 


GAACGCCAGA 


GATGAAGCAA 


TGGCTACATG 


32460 


ACCTTCGGGA 


CGCGGGTATT 


GGCATTATCG 


TAGTGTCAAA 


TAACACCAAA 


AAACGCGTTC 


32520 


AACGAGCAGT 


TGAGAAATTT 


GGGATTGATT 


ACCTTTACTG 


GGCCTTGAAG 


CCCTTCACAT 


32580 


TTGGTATTGA 


CCGTGCTATG 


AAGGAATTCC 


ACTATGACAA 


AAAGGAAGTG 


GTCATGGTTG 


32640 


GTGACCAACT 


CATGACAGAT 


ATACGAGCAG 


CCCACCGTCC 


AGGGATTCGG 


TCAATTTTAG 


32700 


TCAAACCCTT 


GGTCCAACAT 


GACTCAATCA 


AAACGCAGAT 


TAACCGAACT 


CGTGAGCGTC 


32760 














32768 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14872 base pairs^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CCAGTCACAA AGAAATTGAG CGCGTTCAGc TGAGGATGCA CTATGATGCA AGCTACATTT 60 

CATTTGATGG GATATTAAGA AAGGAGATTT TCATGACACT TTTAGATGTA AAACACGTTC 120 

AAAAAATTTA TAAAACACGT TTTCAGGGCA ACCAAGTAGA AGCCCTCAAG GATATTCACT 180 

TTACCGTAGA AAAGGGTGAC TACGTTGCCA TCATGGGTGA GTCTGGTTCT GGTAAATCAA 240 

CTCTTCTCAA TATTCTAGCT ATGTTGGATA AACCAAGTCG TGGTCAGGTT TACTTGAATG 300 

GAACTGACAC CGCAACTATT AAAAATTCAC AGGCTTCTAG TTTCCGGCGT GAAAAGCTAG 3 60 

GATTTGTCTT CCAAGACTTT AACTTGCTAG ATACTCTGTC TGTTAAGGAC AATATCTTGC 4 20 
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TTCCGCTTGT 


CTTGTCAAGA 


AGACCTATAA 


CGGAGATGAT 


GAAGAAATTG 


GTGGTGACAG 


460 


CTGAGAATCT 


GGGTATTAAC 


CAATTGCAAG 


AGAAGTACCC 


TTACGACATT 


TCTGGTGGTC 


540 


AGAAACAGCG 


TGTAGCAGTA 


GCCCGCGCCA 


TCATCACAGA 


ACCTGAAATT 


CTCCTTGCGG 


600 


ACGAGCCAAC 


AGGAGCCCTT 


GATTCCAAGT 


CATCTGCAGC 


CTTACTTGAT 


GTCTTTAATG 


660 


AAATCAATGA 


GCGTGGGCAA 


ACCATCCTCA 


TGGTAACCCA 


CTCAACAGCA 


GCTGCTAGCA 


720 


GGGCCAAGCG 


TGTTCTCTTT 


ATCAAAGACG 


GCATTCTTTA 


CAACCAAATC 


TACCGTGGAG 


780 


AGAAGACAGA 


GCGTCAGATG 


TTCCAAGAAA 


TCTCTGATAC 


CTTGACTGTC 


ATGGCAAGCG 


840 


AGGTGAATTA 


GTATGTTTCG 


ATTAACCAAT 


AAGTTAGCGG 


TATCGAACTT 


GATTAAAAAC 


900 


CGCAAACTCT 


ACTATCCCTT 


TGCACTGGCT 


GTTCTCTTGG 


CAGTCACCAT 


CACCTATCTC 


960 


TTTTACTCCC 


TAACCTTCAA 


TCCAAAGATT 


GCGGAAATCC 


GTGGAGGAAC 


CACCATTCAA 


1020 


GCAACACTTG 


GATTTGGTAT 


GTTTGTCGTT 


ACCCTTGCGT 


CACCATTATC 


GTCCTCTATG 


1080 


CCAATAGTTT 


TGTCATGAAA 


AACCGTTCCA 


AGGAACTGGG 


TATATATGGC 


ATGTTAGGCT 


1140 


TGGAGAAGCG 


CCATCTAATC 


AGTATGACCT 


TTAAGGAGTT 


AGTGGTATTT 


GGGATTCTAA 


1200 


CTGTTGGAGC 


GGGTATCGGT 


ATTGGAGCCT 


TGTTTGACAA 


GTTAATTTTC 


GCTTTCCTGC 


1260 


TCAAACTAAT 


GAAACTGAAG 


GTTGACCTCG 


TTGCTACCTT 


CCAAATGAAT 


GTTGTCATTG 


1320 


CAGTACTTGT 


TGTCTTTGGA 


TTGATTTTCC 


TAGGCCTCAT 


GTTCCTGAAT 


GCTCTTCGAA 


1380 


TCGCCCGTAT 


GAATGCCCTC 


CAGCTCTCGC 


GTGAGAAAGC 


AAGCGGAGAG 


AAAAGAGGTC 


1440 


GCTTCCTACC 


TCTCCAAACG 


ATTCTTGGTT 


CCATAAGTTT 


AGGGATTGGC 


TATTATCTTG 


1500 


CCCTTACGGT 


AACCGATCCT 


CTTACAGCCC 


TAACAACTTT 


CTTCCTAGCT 


GTTTTGCTGG 


1560 


TTATCTTTGG 


TACTTATCTA 


TTGTTTAATG 


CAGGGATTAC 


AGTCTTCCTA 


CAAATCTTAA 


1620 


AGAAAAACAA 


GAAATACTAT 


TACCAACCTA 


ATAACCTCAT 


ATCTGTTTCC 


AACTTGATTT 


1680 


TCCGTATGAA 


GAAAAATGCG 


GTTGGACTAG 


CAACCATCGC 


TATTTTCTCA 


ACAATGGTTT 


1740 


TGGTAACCAT 






TCAATTCCGC 


AGAAAGCTTT 


AAAAAAGTTC 


1800 


TAAATCCTCA 


TGATTTTGGG 


GTTTCAGGGC 


AAAATGTTGA 


AAAAGAAGAT 


TTGGACAAAC 


1860 


TCTTGAGCCA 


GTTTGCAAGT 


GACAAAGCTT 


ATAGTGTCAA 


AGAGAAAGAA 


GTACTTCGTT 


1920 


ACAGTAACTT 


TGGTATTGCA 


AATCAAGAAG 


GAACCAAGTT 


AACTATTTTT 


GAAAAAGGAC 


1980 


AAAACCGTGT 


CCAACCCACA 


ACAGTTTTCA 


TGGTATTTGA 


CCAAAAAGAT 


TATGAAAATA 


2040 


TGACTGGTCA 


AAAACTGTCT 


CTATCAGGAA 


ATGAGGTCGG 


TCTCTTTGCC 


AAAAATGACG 


2100 


GACTGAAAGG 


ACAGAAAGCT 


CTAACTCTAA 


ATGATCATCA 


ATTTTCTGTC 


AAAGAAGAAT 


2X60 


TTAATAAAGA 


TTTCATTGTG 


AACCATGTTC 


CAAATAAGTT 


TAATATCTTG 


ACTACTGATT 


2220 
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ACAATTACCT 


TGTTGTTCCT 


GATTTACAAG 


ccrrnrGGA 


TCAATTCCCA 


GATTCGGCTA 


2280 


TCTATAATCA 


GTTTTACGGT 


GGTATGAATG 


TAAATGTCAG 


TGAAGAAGAA 


CAACTCAAGG 


2340 


TCGCTGAGGA 


GTATGAAAAC 


TACCTCAATC 


AATTTAATGC 


TCAATTAGAC 


ACAGAAGGTA 


2400 


GCTATGTTTA 


TGGTAGCAAT 


CTAGCAGATG 


CTAGTTCTCA 


GATGAGTGCC 


CTCTTTGGTG 


2460 


GTGTCTTCTT 


TATCGGTATT 


TTCCTATCCA 


TTATCTTTAT 


GGTCGGAACT 


GTTCTGGTCA 


2520 


TCTACTACAA 


ACAAATTTCT 


GAAGGCTACG 


AAGACCGTGA 


ACGCTTTATT 


ATCTTGCAGA 


2580 


AAGTCGGTTT 


GGACCAAAAG 


CAAATCAAGC 


AAACCATCAA 


CAAACAGGTT 


TTAACTGTTT 


2640 


TCTTCCTTCC 


TTTGCTCTTT 


GCCTTCATAC 


ATCTCGCCTT 


TGCCTACCAT 


ATGCTTAGCC 


2700 


TGATTTTAAA 


AGTGATTGGT 


GTACTGGATA 


CGACTATGAT 


GTTGATTGTG 


ACCTTGTCTA 


2760 


TCTGCGCTAT 


CTTCCTCATC 


GCCTATGTGC 


TGATTTTCAT 


GATTACTTCA 


AGAAGTTATC 


2620 


GCAAGATTGT 


GCAAATGTAA 


AAAAGATACC 


TCGACTTCAA 


AATCGAGGTA 


TTTCTTGTAT 


2880 


TCTAAATGCT 


GAAAACTTGT 


CCGAGCAGGA 


AGGTAACTCC 


CATGCTCAAG 


AGACCAATAG 


2940 


CAAGGTTCCG 


AATCATAGCT 


GTTTTGGTTG 


GGGCTTTTCC 


AAGTCTAGCA 


CTTGTGTAAC 


3000 


CAGTGAGAAG 


AACGGCCACA 


CCGACAATAA 


GGACGGTAGC 


AGGGATGCGG 


TAATCACTTG 


3060 


GAAAAATGGT 


CACTGACAGC 


ATTGGAGGCA 


AACTTCTAAG 


GAAAAAGGCA 


ACGAAGCTAG 


3120 


AAATGGCAGC 


GTGCCAAGGA 


TTGGTAAATT 


CTTCATACTC 


AATCCCATAT 


TTTTCCTCTA 


3180 


CCAGAGCCTT 


GAGTGGATTT 


TTAAGAAAGA 


TCTTATTGGT 


CAAGAGTTGG 


GCAGAAGTTT 


3240 


TGAATTCTCC 


ATTTTCGATA 


TAAGCAGCAT 


AGAGGGATTT 


TTTGCCTAGT 


TCCCTATCTT 


3300 


GGTCTAGCAA 


GAGTTTTTCT 


CGCGAAACGG 


CAGCTTCCTC 


GGTATCTTTT 


GGAGTTGAAA 


3360 


CGGA7ACATA 


TTCTCCACCA 


GCCATTGAAA 


AGGCACCAGC 


TAAGA7AGCC 


GTAAAACCTG 


3420 


ATAAAAAGAT 


AATCCAGATA 


TTGGTCGTGG 


CACTGGCAAC 


TCCGATAACC 


ACACCAGCAA 


3480 


TGGAAATAAT 


TCCATCGTTA 


GCATCAAGAA 


CACCCGCACG 


CAGGATATTT 


AAACGACCTG 


3540 


CAAAATTTGA 


ATCAATTTCG 


TGATTTGTTT 


CTGACGCTAA 


ATTTCAAGTT 


CAAGTTAGCC 


3600 


ATCAAGAAGT 


CTTCTCTGGG 


TGACTTGTAG 


TCCAAGCATT 


TTTTAGGATA 


GTTGTTAATC 


3660 


CACTTTTCGA 


TGAATGCGAC 


TTCTTTGGGA 


GTCATTTTCT 


TGGTTCCCTT 


AGGTAACCAT 


3720 


CTACGAATGA 


GCCTGTTGTG 


ATTCTCATTA 


GTTCCCCTTT 


CCCAAGAGGC 


ATAGGGATGT 


3780 


GCATAATAAA 


TGTGCTCCTC 


AGAAAATACA 


TTAGACAAGC 


GATTGAATTC 


CGTTCCATTA 


3840 


TCTGCCGTGA 


TGGAAAGAAT 


CTTGTGTTGT 


TTTAAGATGA 


GTTTTAGAGC 


CTGATTGACC 


3900 


ACATCAGCAC 


TTTTATTTGG 


AATCAATCGG 


ATGATCTGAT 


GTCTACTTTT 


TCGATCCGTC 


3960 
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AAGACAAGCA 


AGCAGTAGTT 


TTTCGCTCTC 


GTAAGTAGAA 


CTGTATCAAT 


CTCATAATGC 


4020 


CCATTCTCCA 


AGCGAAAATT 


GATAGCTTCA 


AGCCGCTGTT 


CGATGGATTG 


ACCAGCAGGT 


4080 


TTAAAGTTGG 


TGCTGGCCTG 


TTTCTTAAGC 


GCTTTTCCTT 


TTCTAGGGTA 


AACCAGATCC 


4140 


TGTTTGCTTA 


ACCCCAATTT 


TCCATGATGA 


ATCCAATAGT 


AAATGGTTGA 


AATTCCCACG 


4200 


TTAACCCCTT 


TAGCCATCAC 


CATCATTTCA 


GGCGAAAATT 


TTTGGTTATG 


ATAGTGGAGA 


4260 


ATCTTTTCCT 


TTAGTTCCTT 


GGTCAAGCTT 


GATTTCTTGA 


CCGAGCGCTT 


GCGATTGTTT 


4320 


TCATAAGACT 


GTTGAGCATA 


GTCGGCAGAA 


TAAACCTCTT 


TGAAGCGCCC 


TTTTCCAAGA 


4 380 


CATTGTCGGA 


CTGTCCCACG 


CTTGATTTCA 


GTGTGGATAG 


TTTGAGGAAC 


TTTTCCAAGC 


4440 


AGAGAGGCAA 


TTTCTCTATT 


TGATTTCCCT 


TCTTTTTTCC 


ATCTTTCGAT 


TAAGCGACGG 


4500 


CTATCGATTG 


TCAAATGTTC 


GCCTTTTGTA 


GTATAATGGT 


TTTGCATCTC 


TGTGCCTTTC 


4560 


TTGTGTTTGT 


GGTTGAACAA 


CAAGTATAAC 


ACAGAGGTGT 


TTTCTTATGC 


CTACAAGAGC 


4620 


TATCGGCTAG 


TTGAACCATC 


TAATTTTTAG 


GAGGGCTGGC 


TGGCTAACTT 


CATTATAGAA 


4680 


CTTTCATTTA 


CGAACATATA 


GTAAAATGAA 


ACAAGAACAG 


AACAAATCGA 


TCAGGACAGT 


4740 


AAAATCTATT 


TCTAACAATC 


TTTTAGAAGC 


AGAGGTGTAC 


TATTCTAGTT 


TCAATCTATT 


4800 


ATATTTTTGT 


TTTTTATCAA 


AAAATACTTT 


ACAAGTTCTT 


AAAAACATGA 


TATAGTAATA 


4860 


AAGCTTAGAA 


AATGAGATGA 


TGTTTTCTAG 


CAAATATAAA 


CCCGAGTAAA 


AAATGCCTAC 


4920 


GGACAGGCAG 


GGTTGAATGC 


CGAAGCGTGG 


TTGAAAAGCC 


ACATTATTGA 


TAGGGTTAAA 


4980 


AGCCTACTTT 


TATAAGTTGA 


TCTTAGGACA 


CTTGTCCTAA 


TTCATAAATT 


TTTAGTGTGG 


5040 


TGAAAGCACA 


CGTCATCTTG 


TGAAACGATC 


AATAAAGTAC 


CTAATATTTG 


CTACTAGAGA 


5100 


GTTAGGAAAC 


ATCGGGAACA 


GACATACTCA 


ACAGAAACCA 


AAATAAACAC 


GTCAGAAGAT 


5160 


TGCAGAGCAG 


GTGAAAACCT 


GCTCTTTTTT 


CATGAGTCAA 


CCTTTACTTC 


CTTAGTTTTC 


5220 


ATAAGCTCCT 


AAAAATATTG 


AAAGGAGTAT 


GTTTTGAAAG 


AGTTAGATCA 


AAACCAAGCC 


5260 


CCAATTTATG 


AGGCCTTGGT 






1 IO 1 I 1 1 






GGTCACAAGC 


GTGGACGGGG 


AAATCCAGAA 


CTTGTCGAAC 


TCTTAGGAGA 


AAAATGTGTA 


5400 


GGCATTGATG 


TCAATTCGAT 


GAAACCTTTG 


GATAATTTAG 


GCCATCCTAT 


TTCGATTATT 


54 60 


CGTGATGCAG 


ACGAGCTGCC 


TGCAGATGCT 


TTTGGAGCTA 


GCCATCCCTT 


TCTAATGATT 


5520 


GGTGGAACAA 


CTTCATCGGT 


GCAGACTATG 


ATTCTGGCAA 


CCTGCAAGGC 


AGGAGATAAG 


5580 


ATTATTCTGC 


CACGAAATGT 


CCATAAATCT 


GCTATCAATG 


CGTTGGTTCT 


ATGTGGTGCC 


5640 


ATTCCCATCT 


ATATCGAGAT 


GAGTGTAGAT 


CCTAAGATTG 


GTATCGCTTT 


AGGTCTTGAA 


5700 


AATGACCGAG 


TAGCACAGGC 


CATAAAGGAC 


CATCCAGATG 


CTAAGGCTAT 


CCTAATCAAC 


5760 
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AATCCTACTT ACTACGGCAT CTGTTCAGAC CTAAAGGGGT TGACAGAAAT GGCTCATGAA 5820 

GCTGGCATGA TGGTTTTAGT AGATGAAGCC CACGGAGCGC ATTTGCATTT CACTGATAAA 5880 

CTTCCAATTT CTGCTATGGA TGCAGGGGCT GATATGGCAG CAGTTTCCAT GCATAAGTCT 5940 

GGTGGGAGTT TGACCCAAAG CTCCATTTTA CTTATCGGGG AGCAGATGAA TTCTGAATAC 6000 

GTTCGTCAGA TAATTAACCT GACCCAGTCT ACATCTGCCT CTTACTTGTT GATGGCTAGT 6060 

TTGGATATTT CACGTCGCAA CTTGGCCCTT CGTGGTAAAG ACTCGTTTGA GAAAGTCATT 6120 

GAGCTATCTG AGTATGCCCG CCGTGAAATC AATGCTATCG GTGGCTACTA TCCCTACTCA 6180 

AAAGAGTTAA TAGACGGTGT TTCGGTTTGC GATTTTGACG TAACTAAGCT GTCAGTTTAC 6240 

ACTCAGGGTA TTGGCTTAAC AGGTATCGAG GTTTATGACC TCTTGCGAGA CGAATACGAC 6300 

ATTCAGATCG AGTTTGGTGA TATCGGCAAT ATCTTGGCCT ATATTTCCAT CGGCCACCGC 6360 

ATCCAAGACA TCGAGCGCTT GGTTGGTGCT CTGGCTGATA TTAAGAGACT CTATTCAAGA 6420 

GATGGAAAAG ATTTGATAGC AGGAGAATAT ATTCAGCCCG AGTTAGTGCT GTCTCCGCAA 64 80 

GAAGCCTTCT ATTCAGAAAG AAAAAGTTTA ACTTTGGATG ATTCTGTTGG ACAGGTCTCT 6540 

GGAGAATTTG TTATGTGTTA CCCTCCAGGT ATTCCTATCT TGGCTCCTGG TGAACGCATT 6600 

ACACGAGAAA TTGTCGACTA TATCCAATTC GCCAACGAAC GTGGTTGCTC CCTCCAACGG 6660 

ACGCAAGATC CAGAGGTCAA TCATATCAAC GTTATTAAGA GAAAGACAAA CTATAAGAAA 6720 

AGTCAATAGT TTTATCTAAA CTATTTCTTA TTTCAATTTG ATGATTTGGC GATGATTTTA 6780 

GAGCACGGCA AAAAGCCCTT GAATTAGAAG CGGTCAATCG CTTAATTTCT ATCAGCTTAT 6840 

CAAATCCTGC CTCAAGCCTT TTCTGAGGAT TAGGGTAGCG TGTCAAGAGT TGGTAGGTAT 6900 

ATTCTGAATG CTTTCCAACG ATTTTATCCA ACTCAGGAAA GATGATATCA ACACAACGAC 6960 

TGTATTGTAC TTTCCAATCA GACTGTTTTT TCTTGAGACG ATGAATATGT CTAGCCAGTA 7020 
TTTTTAGTTC TACTTGCCGA TTATCGTGTT GAAATTGTTC ACGATTGGGG TCAGAAAGAA ' 7080 

GTTTAAGAGC GATGCCATGA GCGTCTTTCT TATCCGTTTT AGTTTTGCGA AGTGATAATG 7140 

ATTTGGCAAA TTTCTTGATG AGCAAAGGAT TGTAGGTGTA AACTTTATAT CCTTGTTCAT 7200 

GCAGGAAGTT CAGTAGATTA AAGGCATAAT GTCCGGTATT TTCAAGAGCG ATGAGACAGT 7260 

CTTGGTTGAG CTGTCGAAGA GACAGATCTA AGAGTTCAAA ACCACCTTTA TTATTTGAAA 7320 

AAGTGAGTGG TTTAAGAACA GTTTTTCCTG GAACATTCAA GGCTGTAACA TCGTCTTTAT 7380 

TTTTAGCGAC ATCAATGCCC ACATAAAGCA TGGGAGTATC TCCAGATATA GTATTTCAAG 7440 

TCTACTGGGT TATCCACGAA CTTTTTGCCT TGTTACCTTA GACGAGATAA AACGTCTATG 7500 
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CGTTATCAAA CTCATTACCA ATTGAAACAA AAAACTGTGG TTAGAGCCTT TCGGAAATCG 7560 

TCAAGCGATT GGAGGAAATG AACTAATCCA CACTGGCTTA TTCCAAGTAT ACCACTTGGG 7620 

CTTTGGCAGT AGCTAACTGC GCTAAATATA ATATAAGGAG AAATAGATGG ATTTATGGTT 7680 

TTCTGAAGTT CATACTCCAG ATGTCAAATT GTCTCTGAGA ACAGCCAAGC AACTTTACGC 7740 

TGGAAAAAGT GAATGGCAGG ATATCGAAGT CTTGGATACC CCAGCTTTTG GGAAAATACT 7800 

GATTTTAAAT GGCCATGTCT TGTTCTCAGA TGCGGATGAT TTCGTCTACA ATGAAATGAC 7860 

CGTTCACGTT CCCATGGCTG TCCACCCAAA TCCAAAGAAA GTATTGGTTA TTGGGGGTGG 7920 

TGACGGCGGT GTTGCCCAAG TATTAACCCT CTATCCTGAA CTGGAGCAAA TTGATATTGT 7980 

GGAACCGGAT GAGATGTTGG TCGAGGTCTG TCGTGAGTAT TTCCCAGACT TTGCTGCAGG 8040 

GCTAGATGAT CCTCGTGTTA CCATTTACTA CCAAAATGGG CTACGCTTTT TGCGAAACTG 8100 

CGAAGATGAT TACGATATTA TCATCAACGA TGCGACAGAT CCATTTGGCC ATACGGAAGG 8160 

ACTCTTTACC AAGGAATTCT ACGGCAATAG TTATCGAGCT CTGAAGGAAG ACGGCATCAT 8220 

GATTTACCAG CATGGGAGTC CCTTCTTTGA CGAGGATGAG TCGGCCTGCC GAAGCATGCA 8280 

CCGCAAGGTC AATCAAGCCT TTCCAATCAG TCGGGTTTAT CAGGCCCATA TTCCAACTAG 8340 

CCCAGCTGGC TATTGGTTGT TTGGATTTGC ATCGAAAAAA TACCACCCTG TCAAAGATTT 8400 

TGACAAGGAA GGCTGGAAAA AACGCCAGCT TTTCACAGAA TACTACACTG CAAACTTACA 8460 

CGTGGGAGCC TTTATGTTGC CCAAGTATGT TGAGGACATT TTAGAAGAAG AGGAAGGAAA 8520 

AAAATGAGTC GTTTACTAGT TATTGGTTGT GGGGGCGTTG CCCAAGTTCC TATTTCAAAG 8580 

ATTTGTCAAG ATAGCGAAAC ATTTACAGAG ATTATGATTG CTACCCGTAC CAAGTCAAAA 8640 

TGCGATGACT TGAAAGCGAA GCTAGAAGGC AAAACAAGTA CTAAAATTGA AACTGCAGCA 8700 

CTTGATGCTG ACAAGGTTCA AGAAGTGATT GCCCTGATTG AAAGCTACAA ACCAGAAGCT 87 60 

GTTTTGAATG TAGCTCTGCC TTATCAAGAT TTAACCATTA TGGATGCTTG TTTGGCAACA 8820 

GGTGTTCACT ATATCGATAC AGCCAACTAC CAAGCAGAAG ACACAGAAGA CCCTGAGTGG 8880 

CGTCCTATCT ACGAAAAACG TTGTAAGGAA CTTGGTTTTA CAGCCTACTT TGACTACTCA 8940 

TGGCAGTGGG CTTATCAAGA GAAATTCAAA GAAGCAGGCT TGACTGCTCT TCTTGGTTCT 9000 

GGTTTTGACC CAGGTGTAAC TAGTGTCTTT TCAGCTTATG CCCTCAAACA CTATTTTGAT 9060 

GAAATCCATT ATATCGACAT TTTAGACTGT AATGGCGGTG ACCACGGTTA TCCATTTGCA 9120 

ACCAACTTTA ATCCAGAAAT TAATCTCCGT GAGGTTTCTG CGCCAGGTTC TTACTGGGAA 9180 

GATGGGAAAT GGGTCGAAGT CGAAGCTATG TCTATCAAGC GTGAGTATGA TTTCCCTCAA 9240 

GTTGGACAAA AAGATATGTA TCTCCTTCAC CATGAAGAAA TCGAATCATT GGCCAAGAAC 9300 
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ATTCCAGGTG 


TCAAACGCAT 


TCGTTTCTTT 


ATGACTTTTG 


GTCAATCTTA 


CTTGACGCAC 


9360 


ATGAAATGTC 


TTGAAAATGT 


TGGACTCCTT 


CGTACGGATA 


CCATTAACTT 


TAACGGCCAA 


9420 


GAAATTGTTC 


CAATTCAATT 


TTTGAAAGCC 


TTGCTTCCAG 


ATCCTGCCAG 


TCTTGGGCCA 


9480 


CCTACAGTCG 


GAAAAACCAA 


TATTGGATGT 


ATCTTTACAG 


GTGTCAAAGA 


CGGTGTCAAA 


9540 


AAGACTATCT 


ATATCTACAA 


TGTCTGCGAC 


CATCAGGAAT 


GTTACGCAGA 


GGTTGGTTCG 


9600 


CAAGCTATTT 


CTTATACGAC 


AGGAGTTCCA 


GCCATGATTG 


GGACAAAATT 


AGTCATGAAC 


9660 


GGAACTTGGA 


AACAAGCTGG 


AGTGTATAAC 


CTTGAGGAGT 


TAGATCCAGA 


TCCATTCATG 


9720 


GAAGCTTTGA 


ATGAGTATGG 


TTTGCCATGG 


GTTGTGGTTG 


AAAATCCACA 


AATGGTCGAC 


9780 


TAATGAACTT 


AGAACAAGTA 


CCAACACCAG 


CCTATGTTAT 


TGACTTGGCC 


AAGTTAGAAG 


9840 


CTAATTGCCG 


CATTCTACAA 


TATGTACAAG 


AAGACCCCCC 


TTCCAACGTC 


TTGCTTGCCC 


9900 


AGAAGGCATA 


TTCCCTCTAC 


AAAACTTATC 


CCTTGATTAG 


CCAGTATCTA 


TCAGGTACGA 


9960 


CAGCTAGTGG 


ACTCTATGAG 


GCCAAATTGG 


CAAGGGAAGA 


ATTTCCTGGT 


GAAGTCCATG 


10020 


TATTTGCGCC 


TGCTTTCAAG 


GATGCAGACT 


TGGAGGAATT 


GCTAGAGATA 


ATGGACCATA 


10080 


TAGTCTTTAA 


CTCAGAGAGA 


CAGTTGCGTA 


AACACGGTCC 


GCGTTGTCGA 


GAGGCTGGTG 


10140 


TCAGTGTTGG 


TTTGCGCCTC 


AACCCTCAGT 


GTTCAACTCA 


AGGcAGATCA 


CGCGCTCTAT 


10200 


GACCCTTGTG 


CACCAGGTTC 


TCGCTTTGGA 


GTTACTATAG 


ACAAGATTCC 


GAGTGATTTG 


10260 


CTAGATTTGG 


TTGACGGACT 


TCATTTTCAT 


ACCCTTTCCG 


AGCAGGGAGC 


AGATGATTTA 


10320 


CAAACAACTT 


TCAAAGCACT 


AGAAGAACAG 


TTTGGTCCCT 


ACTTACATGA 


GGTAAAATGG 


10380 


CTCAATATGG 


GTGGTGGTCA 


TCATATTACA 


AGAGAAGGTT 


ACGATGTGGA 


TTTGCTGATT 


10440 


TCAGAAATCA 


AGCGTATCCG 


AAAAACTTAC 


AATCTTGAAA 


TCTATATCGA 


GCCTGGTGAA 


10500 


GCCATTGCGC 


TTAATGCGGG 


TTATTTAGCA 


ACTGAGGTAT 


TAGATATTGT 


AGAAAACGGT 


10560 


ATGGAAATCT 


TGGTTTTAGA 


CGCCTCTGCG 


ACCTGCCATA 


TGCCTGATGT 


ACTTGAGATG 


10620 


CCCTATCGTC 


CACCTTTGAG 


AAATGGCTTT 


GAGTCACAGG 


AAAAAGCCCA 


TACCTACAGA 


10680 


CTTTCTTCTA 


ATACCTGTCT 


GACGGGCGAT 


GTGATTGGTG 


ATTATAGTTT 


TGAAAATCCA 


10740 


GTCCAAATCG 


GAG AC AG ACT 


TTATTTTCAA 


GACATGGCCA TTTATTCTTT 


TGTCAAAAAT 


10800 


AATACCTTTA 


ATGGTATTGG 


ATTGCCAAGT 


CTCTATCTCA 


TGGACGAACA 


GGGAGACTGT 


10860 


AGCTTACTCA 


AAGCTTTTCG 


CTATCAAGAC 


TTTAAAGGGA 


GATTATCATG 


ATGGACAGTC 


10920 


CAAAAAAATT 


AGGCTATCAC 


ATGCCAGCAG 


AGTACGAACC 


CCATCATGGT 


ACCCTCATGA 


10980 


TATGGCCGAC 


TCGACCAGGA 


TCATGGCCTT 


TTCAAGGAAA 


GGCTGCTAAA 


AGAGCATTTA 


11040 
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CTCAGATTAT 


CGAGACCATA 


GCAGAAGGGG 


AAAGAGTCTA 


TCTTTTGGTG 


GAGCAGGCCT 


11100 


ATCTATCTGA 


AGCCCAATCC 


TATCTTGGAG 


ACAAGGTTGT 


TTATTTAGAC 


ATTCCCACCA 


11160 


ATGATGCCTG 


GGCGCGTGAT 


ACTGGCCCAA 


CCATTCTCGT 


CAATGATAAA 


GGTAAGAAAT 


11220 


TAGCCGTGGA 


TTGGGCCTTC 


AATGCTTGGG 


GAGGCACCTA 


TGATGGTCTT 


TATCAAGATT 


11280 


ATGAAGAGGA 


TGACCAAGTA 


GCCAGTCGTT 


TTGCTGAGGC 


CTTGGAAAGG 


CCTGTCTATG 


11340 


ATGCTAAACC 


TTTTGTACTG 


GAAGGAGGCG 


CAATCCATAG 


CGATGGTCAA 


GGAACTATTC 


11400 


TCGTAACTGA 


AAGTTGCTTG 


CTTAGTCCTG 


GTCGCAATCC 


TAACTTGACT 


AAAGAGGAGA 


11460 


TTGAAAACAC 

m • W mm mm* mt^mt-mt 


ATTATTAGAA 


AGTCTTGGTG 


CTGAAAAAGT 


TATTTGGCTT 


CCTTATGGTA 


11520 


TTTATCAGGA 


TGAAACCAAT 


GAACACGTCG 


ATAATGTTGC 


TGCCTTTGTT 


GGTCCTGCTG 


11580 


AGCTTGTTTT 


GGCTTGGACA 


GATGACGAAA 


ATGATCCCCA 


GTATGCCATG 


TCAAAAGCAG 


11640 


ATCTCGAACT 


CTTAGAACAG 


GAAACAGATG 


CAAAAGGTTG 


TCACTTCACC 


ATTCATAAAT 


11700 


TGCCTATCCC 


TGCAGTTCGA 


CAAGTTGTGA 


CAGAAGAAGA 


TTTGCCAGGC 


TACATCTATG 


11760 


AAGAAGGAGA 


AGAAAAGCGA 


TACGCAGGTG 


AACGACTAGC 


AGCTTCCTAC 


GTAAACTTTT 


11820 


ATATCGCCAA 


CAAGGCTGTC 


TTGGTTCCAC 


AGTTTGAGGA 


TGTAAACGAC 


CAAGTGGCCT 


11880 


TAGATATCCT 


CAGCAAGTGT 


TTCCCAGACC 


GTAAAGTTGT 


CGGAATACCA 


GCCAGAGATA 


11940 


TTCTCTTAGG 


TGGTGGCAAT 


ATCCACTGTA 


TCACCCAACA 


AATTCCAGAA 


TAGGAGAAAA 


12000 


AGATGAGAAA 


TGTAAGAGTT 


GCAACCATTC 


AGATGCAATG 


CGCTAACGAT 


GTGGCAACAA 


12060 


ATATCCAAAC 


CGCAGAGCGT 


TTAGTACGTC 


AGGCTGCTGA 


GCAAGGAGCC 


CAAATTATTC 


12120 


TCTTGCCCGA 


GTTGTTTGAA 


CATCCCTATT 


TCTGTCAGGA 


ACGTCAGTAT 


GACTACTACC 


12180 


AGTATGCCCA 


ATCTGTAGCG 


GAAAATACTG 


CCATTCAGCA 


TTTTAAGGTG 


ATTGCTAAGG 


12240 


AACTACAAGT 


TGTTTTACCA 


ATCAGTTTCT 


ATGAAAAAGA 


TGGTAATGTC 


TTGTATAACT 


12300 


CTATTGCCGT 


CATTGATGCA 


GATGGGGAAG 


TGCTGGGCGT 


TTATCGAAAG 


ACCCATATAC 


12360 


CAGATGACCA 


mriH V 4Mtft % m wm tm m m & 

TTATTATCAA 


GAAAAATTCT 


All 1 V_i\l~ Vjk_t- 




GGTTTCAAGG 


12420 


TCTGGAATAC 


TCGCTATGCT 


AAGATTGGTA 


TCGGTATCTG 


TTGGGATCAA 


TGGTTCCCTG 


12480 


AAACAGCGCG 


CTGTCTTGCA 


TTGAATGGTG 


CTGAATTGCT 


CTTTTATCCT 


ACACCTATCG 


12540 


GTTCAGAGCC 


AATTTTGGAT 


ACAGATAGTT 


GTGGTCACTG 


GCAACGTACT 


ATGCAAGGGC 


12600 


ACGCAGCAGC 


GAATATTGTT 


CCAGTCATCG 


CAGCCAATCG 


TTATGGTTTA 


GAGGAGGTTA 


12660 


CTCCTAGTGA 


GGAAAATGGC 


GGACAGAGCT 


CCAGTCTTGA 


CTTCTACGGT 


TCCTCCTTTA 


12720 


TGACGGATGA 


AACAGGAGCT 


ATTCTAGAAC 


GAGCTGAAAG 


ACAAGAAGAA 


GCTGTTCTGT 


12780 


TAGCTACTTA 


TGACCTAGAC 


AAGGGAGCAA 


GTGAACGCCT 


AAACTGGGGC 


TTGTTTCGAG 


12840 
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ATAGAAGACC AGAAATGTAT AGACAAATTA CAGATTAGTG TGGGAGAAAT GAGAGATTCA 12900 

TTCTGCTAGA CTAACTTCTT ATTAGTAACT ATAAGATACT ATGGCATCTA GTAAATCGAT 12960 

TTTTATGATT CGCTATTCTT GTCTATTGAT TAGTCCGTAT TTTAAAATAT TAGCAAAAAA 13020 

GCAAATAGCA GTAACTTCTG TCTATTTGCT TTTCTTTTTT ATAGAATATA TTTCTCAATA 13080 

GCACGCGCAA CGCCGTCTTC TTCGTTGCTT GAGCTAACGG CATCCGCAAG AGATTTGATA 13140 

TAATCGCTCG CATTTCCCAT TGCAATCCCA ACCCCTGCAA ACTGGAGCAT TTCGATATCG 13200 

TTATTACCAT CGCCCATGGC CATAATCTCT GAGGAATCAA TCTTCAAAAT CTCAGCTAGT 13 2 60 

CGTGAAAGAG CAGTAGCCTT TGTCGTTCCA AGCGGCATTG CTTCATAAAT GACAGGCTGC 133 20 

GAACGAACTC CACTGAATCG TTGGCAAAGC TCTTCAGCAA AACGCTGCTC AAAATCGTCT 13380 

GTTTGTTCTT TTGTTCCTAA ACACATACCT TCGAACATCC GGAACTTTCC ACTACTCGCT 13440 

TCTTCAAGAG AAATTTCAGT CAGGTCTGAA AATACTAGTT TAGCATCATT TTCAATAACT 13S0O 

TGATTGGGCT TGTCACCGAG AACAAAATAA TGTGACTCGT CAAAAAGTGT CAACTGAACA 13560 

TCACTCTTTT CAGCAAGGTC ATAGAGGTAT TCGATGTCAG CTGGACTCAG TTCTTTCCAG 13620 

TCAACTAGAC TCCAATCACT GGTCTGGTGA GTTGAACAAC CGTTCTTAAC AATAATATAT 13680 

TCGTTCTGGA GGTCAAGCTC CACTTTTTTG TAGTACCGCA GGACACCGAA AAGGGGGCGA 13740 

CCCGTACAGA GAACCAGTTT GACACCTTTT TCAATGGCTT TGTGAATAGC AGTAATGTGT 13800 

GCTTGTGGGA TTTCCTTGGC TTCATTGAGG AGGGTGCCGT CCATATCCAA GGCTAGTAGT 13860 

TTAATCATAG GTCTTCCTCT TTATCTTTGC T ATT ATT ATA GCATATTTTG GAGAAGAAAT 13920 

TGATAGAAAG CTTGAGACTA ATTGATTTTA TAGTTTAACA TGTTTTGATG ACAATTCATG 13980 

ATTTGAAGAG GATATTTCGC AAAGATATGC TATACTATGT TTGTCAATGT TGCAACTAGA 14040 

CAAATTAAAA AACCAACTTA ATATAATAGT TTTTTTGTAA GTAGGTATGA CTAGCAGATT 14100 

ACTCAACTAA TCTGAAGAAT AATGGAGGAA ATATATCATG ATTTTAATGA CAAAAAATAT 14160 

AAATCTAACA AATGAACAAT TAGAGCTGAT ACAAGGTGGA GCAGATCCAT ATGGTAAAAA 14220 

TCCTAATGGT AGGTACGATT GGGAAATAGA ACCAGTATTA ACTCTGCTGG TTCATGGATT 14280 

TTGTCCCAGA GGCACCTATG ATTCAGGATA TATTCGAGGA GGTAATCATC TTTCCAAAGG 14 340 

AAGTGCTGCG AGATTTTAAG TAAAATTTAT TAGGAATATG AAGAAACAAG GGGAGAAAAC 14400 

AGAGGATTTA ATATGAAAAA ACGAGCTATT CAAATTTTAC TAGCATTGTC CTTAATTTTT 14460 

TACAAATCAA CTTGGTTTTG GAGGCTTTTC AATTATCTCG CAAAGCCCTA TCTACCAGCA 14520 

AGTCGTGAAT TTTTTCAGAT TCTGCTTTTG ATGGAGAGCG GAGTTCTTTT CTTAGCGGTC 14580 
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ATCTATCTAC TGGTTTTTGC AGGAAAGAAA ATTTTTCATT TCAAGTGGCA GCTGAGGTAC 14 640 

TTCATCTACC TTTTACTGGG CTACATCATT TCATATATGT CTGACTTCCT CTTTTCGTAT 14700 

TTCATATCCC TGTCTTCAAA TCAGATTTCT TTGAATGAAA CGGTAGAAAT GATGGGGAGA 14760 

CAGGAGTTCC CTTATGTCTT GCTCATCGTT TGCTTCATCG CCCCTATTGC TCAGGAATTG 14820 

ATTTATCGAG GtGTGCTTAT GACAACCTGT TGCAAAAACT CACCTTGGTA CG 14872 

(2) INFORMATION FOR SEQ ID MO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH : 10223 base pairs 
(3) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CGTGCTATCG GTCTCAAAAC CAATCTGGTC GCTATGGTCA AATCCAGTTG GAAAATCCAT 60 

TCTTCTTGGA GCCATCTGCT GGATTGCCAT CATCCTCACC ACTCTTGGTA TGCAGACCCT 120 

TATCGGCATT TTCTAATACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGCC 180 

GTAGGTATAT GTTACTGACT TCGTCAGTTC TATCTGCAAC CTCAAAACGG TGTTTGAGCT 240 

GACTTCGTCA GTTCTATCTG CAACCTCAAA ACGGTGTTTT GAGCTGACTT CGTCACTCGT 300 

ATCTACAACC TCAAAACAGT GTTTTGAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA 360 

ACAGTGTTTT GAGCAGCCCG TGGCTAGTTT CCTAGTTTGC TCTTTGATTT TCATTCAGTA 420 

TAACACAAAA GGTAGCCCAT CAGCTACCTT TTTCTTATGC TTCCTCAATC AAGCGAGTAT 480 

GTTCTCTCTT GATACAGCGA TTCATCACGA TATCATCACA TCCACCATCA CGCAAAATCT 540 

CTTTCGCTTC TAAACTTTCA AGTCCTACCT GTGCCCAAAA AATCTTGGCA TCAGCTTTGA 600 

GAAAATCACG CGCCACATCG GGCAGAAATT CACTGCGACG ATAAACATTG ACAATATCTA 660 

CAGGAAAAGG AATTTCAGCG AGGCTAGCAT AAGCCTTTTC ACCCAAGATT TCGCCACCTG 720 

CCCCCTTGGG ATTGACTGGG ATGATTTTAT AGCCCCGAGC CTGCATTTCC TTTGTTACTC 780 

GATTGCTGGT TGTTTCTTCA CGGTCAGACA AACCCACCAC AGCAAGGGTT TTACTCGTTG 840 

CGAGATACTG ACGAATCACG CCATCACTTG GATTGATAAA TTCTTGACTC ATAGAAATCC 900 

TCCTTTTTCA TCAGTATACC ACATTTTGAA AAGGTTTGCA GAATTATACT ACAAAAAAGG 960 

AGGACTAGCC CCCTTTTTAT TTAGCCTCGT ACCAGGTTGC CCCTTCATTC TCATCTGCGA 1020 

TAAGAGGAAC ACTGAGTTGA ATGGCTTCTT CCATGGTTTG TTTCACCAAT TTTTTCATCT 1080 

CTACCAATTC AGATTTAGGC ACTTCAAGGA CGATTTCATC GTGCACTTGT AACAGCATCT 114 0 
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TAGTCTGATA ACCACCTGCA ACCAAGGCTT TATCCAGCTG AATCATGGCA ATCTTGAGAA 
TATCTGCTGC CGAACCCTGG ATAGGTGAGT TGATAGCAGT TCGCTCCGCA AAACCACGAA 
TATTGAAGTT GCGCGAATTG ATATCTGGCA ACTCACGGCG ACGCTTAAAG AGGGTCTCTA 
CATAGCCCTT ATCACGCGCC TCCCGCACCA CTTCATCCAT GTAGTTTTTA ATACCTGGAA 
AACGTTCAAA GTAGGTATCA ATGTAGGCTT TGGCTTCCTT ACGACTAATT CCCAAATTAT 
TAGACAAGCC AAAGTCTGAA ATCCCATAAA CCACTCCAAA GTTAACTGCC TTGGCATTGC 
GACGGTCGTT TGCAGTCACA TCATCAGGAC GCTCAATGCC AAAGACCCGC ATGGCTGTCG 
AAGTATGGAT ATCTGCCCCC TCTTGGAAGG CCTTAATCAA GTGCTCATCC TTAGAAATAT 
GCGCCAAAAC GCGCAATTCA ATCTGTGAAT AGTCAGAGCT GAGTAGCACA CTATCCTCCC 
ACTCTGGCAC AAAAGCCTTC CGAATCAAGC GCCCCTGTTC CAATCGGGCA GGAATATTTT 
GCAAGTTTGG ATCCACACTA GACAAACGCC CGGTCTGGGT CAAATCCTGC ACATAGCGAG 
TATGAATCTT TCCATCAGCC AAAATCCAGT CCTGCAAGCC AATTACATAA GTAGATTGAA 
TCTTAGCAAT TTGACGGTAA TCCAGGATTT TCTTAACAAT CGGAGCAATA GGAGCGACAC 
CCTCTAAAAC ATCCACTGCT GTCGAATAAC CTGTCTTGCT TTTCTTAGTG TATTCTAGAG 
GAAGTCCCAA TTTCTCAAAG AGAAGCACGC CCAACTGCTT AGGCGAGTTG ACATTAAACT 
CCTCACCAGC CAGCTCGTAA ATCTCTTGAG TCAGTTTTTC AATGACAAGC TCATTTTCAG 
CCTGCATCTC AAGCAAGGTC TCTTTCTTGA CCATAATCCC AGCAATTTCC ATCTTGGCAA 
GGACAAAAGC CAGAGGTTGC TCCATATCAT AAAGAAGCTC TAATTGCCCA 7TTTCGCTGA 
GTTTTTCAAG TAAAATAGGC TCTGTTTCTA CCAAAACAGC AAGTTTACAA GCTAAGTGTT 
CCAAGAATTT CTCACGTTCA GGAATGGCCT TTTTAACACC CTTACCGTAG AAAGTTTCAT 
CATCAACCAA GTAAGTCTGA CCATAAAGAC TAGCGATGGT CGCAATTTCA TTGTCCTCCA 
CAGTCGAAAG GAGGTATTTA GCCAAACGGA TGTCAAAAGC AGGCGCCTGC AAATCCACAC 
CAAAACCTTG CAAAAGAACT TTAACCTTCT TAAAGTCATA AACTCTCAGA GATGTTTTTT 
CTAAGAAATC CTTGAAAATC GCGTCTTGCA ACAGCTCAAG CTTGTCTGTC GCATAGAGCT 
TATCCCCACA AGACCAGACA AATCCAACCA AATTATCCGT ATGGTAATTC TCACCAAAAA 
GCTCAAAGTG GAAGATAGAC TCTTCACTCA GCATATCTTG ACTGATTTGG TCAACAATAG 
TAAAATCCAA ACTCTCAGAC ACATCAGCTG ACGACACATT TAAAGCCTGC TTTAGCTGTT 
TGAAGCCCAT CTCATCGTAG AATTTCCCAA GATTTTCAAC ATCTGGACCA CTATAGACCA 
AGTCCTCTAA ACCAATCGCA ATCGGTGCCT TGGTATCAAT GGTCGCTAGT GTTTTAGACA 



1200 
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2040 
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2340 
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AAAAGGCCTG TTCCTTGTCA TTGATGAGAT TTTCCTTCAT CTTAGAAGTC TTCATTCCAT 2 940 

CAATATTTTC ATAAATCCCC TCAAGCGAAC CATGCTCCAG CAAGAGCTTA ATACCCGTCT 3000 

TTTCACCGAC TTTGGTCACC CCAGGGATAT TATCCGACTT ATCACCCATG AGCGCCTTGA 3060 

GATCGATAAA CTGAGCTGGT GTGAGGCCCA TTTCTTCCAT GAGGTAATCT GGCCTAAACC 3120 

CCTCAAACTC AGCCACACCT TTCTTGGAAA TTTCAACCAC CGTATGCTCA TCCGTCAGCT 3180 

GAATCAAATC CTTGTCCCCA CTGACAATAG T AATATC AAA ACCATCCTGC TCTGCTAGCT 3240 

TATCCAGCGT CCCAATGATG TCATCCGCCT CATACTGAGC CAGATCATAG TGACGAATCC 3300 

CCATATGATC CAGCAACTCA CGAATGAAAG GAAATTGCTC ACGAAACTCA TCAGGAGTCT 3360 

TGGCCCGACC ACCCTTATAG TCCGCATACA TCTCTGTCCG GAAGGTCGTC TTTCCCGCAT 3420 

CAAAAGCCAC CAAAATATGA CTCGGCTCAA CCCGCTCCAA TAAATGACTC AACATCAACT 3480 

GAAAACCATA AATCGCATTG GTATGCAAAC CAGCCACATT CTTAAAACGG TCCAACTGCT 3540 

GATACAGCGC AAAAAACGCC CGAAAAGCTA CAGAAGACCC ATCAATCAAT AATAATTTTT 3600 

TCTTATCCAT ACACCCATTA TAAACCAAAG AATCAAAAAA TACCATTGGG AAGACCTAGA 3660 

GCAAGTATTT TTCAAACTTT TTCCGAATAA ATAGATAGAG CCAGAGAATT TAGTAAACCT 3720 

AGATTTAAAA ATGTGCTATA ATATAGTATA TTGAATCTAT AATAGTACAC CTTGACTGCT 3780 

AAAATATTTC TATAAATTAA TTTGACTTTC CTGATAGAGT TATTCACATC TTATTTCAAC 3840 

TCACTATAGA AGCAGGAATA GGAGGATTCT CAGACATCCG GCCATCAGCC CAACTAATGA 3900 

TTTGATTGCT AAGAAAATAT TCAGCAATCC AGAAATCACT TGTCAATTTA TTCGCGATAT 3960 

GCTGGACTTC CCAGCAAAAA ATGTGACCAT TTTGGACGGA AGCGATATTC ACGTATTACT 4020 

CTCCATGCCT TACTCGGTGC AGCATTTTTA TACCAGTATA GACGTCTTGG CGGAGTTGGA 4080 

TAACGGTACT CAAGTAATTA TTGAGATTCA AGTCCATCAT CAGAATTTTT TCATCAATCA 4140 

CTTGTGGGCT TACCTGTGCA GTCAGGTTAA TCAAAATCTT GAAAAAATTC GTCAGCGAGA 4200 

AGGTGATACT CACTAGAGCT ACAAACACAT CGCTCCTGTT TACGCCATTG CTATCGTGGA 42 60 

TAGTAATTAT TTCTCAGATG ACCTGGCTTT TCATAGCTTT AGTATGCCCG AAGACACAAC 4320 

AGGTGAGGTA TTGGCGATTA CCAACAATGC ACAGGAAAAC CATCTGGTTA AGATGGCATT 43 80 

CTTGGAATTA AAAAATACAG ACAAACCAGC AAAGACAAGG TTCGCAAGCC ATGGTTGGAG 4440 

TTTTTCGGCA ACAAGCCCTT TACCCAGCAA CCGCAACGAG CCATTACCCA AGCAAATCAA 4500 

CTGCTGGACT ACAAGACCTG GTCCGAGGAG GACAGGAAAA TGTTTAGTCA ACTA CAT ATG 4560 

CGAGAAGAAC AAGTCTTGTT AGCACAGGAC TATGCCTTGG AAACTGCTAG GGCTGAAGGC 4620 

CTTGAACAAG GACTAGAGCG TGGGAAAGTT GAAGGAAGGG CAGAAAGGAA ACTTTTTGCC 4680 
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TTCCTAGACA TAGTACGCCA AGGTCTTCTG ACTTCTCAGG TTGCCAGCCA GCAATTAGGT 4740 

ATGTCAGTAT CTGAATTTGA GGCACTGTTG TAAAATCGCT CCATAATATC CATAGTGGGT 4 800 

AAATCCCCTA TGCATATTAT GGACCCTATT TTGTGTAGAA AAAAAGTCCC ATATGACCTA 4860 

TAATGAAAAG CGACAAAACA ACTCATTAGA AAGAATCATA TGGAACAATT ACATTTTATC 4920 

ACAAAATTAC TAGACATTAA AGACCCTAAT GTCCAGATTT TAAACATCAT CAATAAGGAT 4980 

ACACACAAGG AAATCATCGC CAAACTGGAC TACGACGCCC CATCTTGCCC TGAGTGCGGA 5040 

AACCAATTGA AGAAATATGA CTTTCAAAAA CCTTCTAAAA TTCCTTATCT TGAAACGACT 5100 

GGTATGCCTA CAAGAATTCT CCTTAGAAAG CGTCGATTCA AGTGCTATCA CTGTTCAAAA 5160 

ATGATGGTCG CTGAAACTTC TGATGACGTA CAGTCATATT TCTTCTCTTT TTATTATATC 5220 

ACAGTTTTAA ATCTAGCTTT ACTAGATTCA CCGCTACTAT CTATTTATTC GGAAAAAAGA 5280 

CGAAAAAACC TGAGAATCAT CTCAGGCTTG GTCATTAAAT TTTTTTCTCA ATATCGAAAA 5340 

GTGGAGAAAG TGGTCGTTTT TCATGAATAC GTACGATACC ATCCCCTAGG AGATGAGCGA 5400 

TTGAAATCTG CTCAATCTTA TCAATCAAAC GCTCTTCTGG CAGATAGATG CTATCCAAAA 5460 

CAACCAATTT CTTAATAGCT GATTTTTGGA TATTGTCCGT AGCAGGACCA GAAAGAACTG 5520 

GGTGCGTACA GCTTGCATAG ACTTCAACAG CACCAGCTTC CGCAAGAGCA TCTGCCGCAT 5580 

GACAAATCGT TCCAGCGGTA TCAATCATAT CATCAATCAA GATACAAGTC TTGCCTTCAA 5640 

CCTTACCGAT GATATTCATA ACTTCACTAG TATTCATCTT ATCAACGCTA CGACGTTTAT 5700 

CAATAATAGC CATAGATGTT TTCAAAAATT CTCCCAACTT ACGAGCACCA GTCACCCCTC 5760 

CATCGTCCGG GCTGACAACC ACATAGTCAG AACCAACCAT ACCACGACGC TCAAAATAAT 5820 

CTGCAATCAG AGGAGCACCC ATCAAATGAT CCACAGGAAT ATCAAAGAAT CCTTGAATTT 5880 

GCGCAGCATG CAAGTCGATG GTCAATAAAC CATCCACTCC AGCTACTTCA AGCATATTTC 5940 

CGACAAGTTT TGAAGTCATT GGCTCACGCG CTCTCGCCTT TCTATCCTGA CGTGCATACC 6000 

CATACTAAGG CATGACAACA TTGACAGATT CTGCACTCGC ACGCTTCAAA GCATCTACCA 6060 

TAATCAAAAT TTCAAGCAGA TTGTCATTTA CAGGCGAACT AGTTGATTGT AAGATAAAGA 6120 

CGTGTTTCCC ACGGATTGAT TCTTCAATGT TGACCTGAAT CTCTCCATCT GAAAATTGCC 6180 

GAACACTTGA TTTCCCCAAC TCTATCCCAA TCTCCTGCGC CACACGTTCT GCCAATTCTT 6240 

TATTAGAAGA AAGGGCAAAC AGCTTTAAAT CAGAAAAAGA CATGATTTCC TCCGGTATAT 6300 

ATGTATAACT TGTGCTTTTC ACAAGATTTT CCATCTACCA TTGTAGCGCT TTTTGCACTA 63 60 

TTTTTCAATC AAAAATAAAA GAAGGGCACC ATATTTGTAC CCTTGCATCA TTCTTTTGAA 6420 
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AAATATTCTA GGTCATCAAC TCATTGTGTT TCTCAACAAA GCAATAAGCA TGATAAAAAC 6480 

CATAGAGAGC AATAGCCGTA ACCACTGGAA TCGCTAAAGG CAACTCTGTT TCCAACTCCA 6540 

CAAAAGGAGA GTTAAACAAG AAGTGAGTTC CCAAGGCTAA ACCTAGAAAA ATAAGGCCCT 6600 

GTTTCTTGCC AACCTTCTG? CCTTTATAGG CTCTGTAAAG CAAGTAAACA CC7ACTACAG 6660 

CTAGACCTGA AAAAGTCCAG TGAGAGGCAA TTCCTGAGAT GATACGCTCT AAAATTCGCG 6720 

AAATAGTAAA GTCAAAGCCC TCTGGCAAAT CCGTACGAAT ATAACCAATA TCCTTAATCA 6780 

TTTGGAATCC CAAACCGGAA GCAATTCCAA GTAAAAACAA AGATTTTAAT TTTCGCACAG 6840 

GAATCAAAGC CAAAACAAAA ACAAGTGACA ATAATTTCAA GGGTTCTTCT ACCAAAGGAG 6900 

CCGCAATAGC ACTTTCAAAG GCATTTAAAA ATGGACTATC TGGGAAAAGA ACCCCCAGTA 6960 

AATCATGGAT ATAAGTATTA GCAAAACTAG ACAACCAGCC TGAAAGGAAC ATCCCTCCCA 7020 

ATAAAGACAG AATCAAAACC TTCTTTGGCA ATTCCCATTT TTCCCAATAC GGAAGAGAAA 7080 

ATAAAGAGCC GGAATCATGT AAAAGAGAGC TAGAAACATA GAAACTCCCA TTAGTCCATA 7140 

TTCCGCACCT GACCTCGAAC CGTCCGTATA GTAGATGGTT TCATACTGTA AACCAATACA 7200 

TAGCAATAAA ATAAAAATAA ATAAAATATT GCTTTTCTTC ATACACTTTC TTTCTAAATG 7260 

AAGTATTTAT AATTCTACGA CTGTCATACT TCCTGTATCA ACATTGTAAA TGGCACCAGA 7320 

GATAATGACA TCGTCTGGTA TTAGGGGAGA CTCGATAAGC AGTTGCATAT CCTCGCGTAC 7 380 

ACTCTCTTCT ATATCTTGGA AGGGCAAGAA GTCCTGGTCT GACACATCGA CACCCAATTC 7440 

TTCCTTCAAA TACTCCTCAA AAGGTTCATT TTCAAAGGTC TGACCACCAC AGTCTGTATG 7500 

ATGCAATACC ACAATTTCTC TTGTCCCCAT TTGTTGCTCC GAAATAACTA GAGAACGAAT 75 60 

CATATCCTCA GTCACTCGAC CACCTGCATT CCGCAAAATA TGAGCATCCC CAAGTGCCAA 7 620 

ACCTAGAGCT TGCGCAACGT GCAAACGTGA GTCCATACAG GTCACAATGC CTACTCTGGT 7 680 

TTTAGGTTTA AGTGGCAGAT TTAACTGCCC ATGTAGGGCA ACATAAGCCT GATTGGCTTG 7740 

CATAAACTGT TCAAAATACG ACACGATTCC CTCCTTGAAA ATTTGATAGT CAAAT ATTIC 7800 

TCCTATCTTA TCATTTTTAA GAGAATTTGT CACGGATTAT GCAAAGACCT TTTTCAAGAC 7860 

TTCCTGAATC GTTGTCACGC CAATGACCTG AATTTCCTTA GGCAGAGTGA TTCCTGTCAA 7920 

GGAATTCTTA GGTACATAAA TCTTAGTAAA GCCCAGTTTA GCAGCTTCGT TGATGCGTTG 7980 

CTCAATACGA TTCACGCGCC GAATCTCTCC TGTCAAGCCC AGTTCTCCGA CAAAACATTC 8040 

CTGAGGATTA GTTGGCTTGT CTTTGTAGCT CGAAGCAATA GCAACTGCAA CAGCCAAGTC 8100 

AATCGCAGGT TCATCCAATT TAACACCACC AGCAGATTTG AGATAGGCAT CCTGATTTTG 8160 

CAAGAGAAGC CCTGCCCGTT TTTCCAAAAC AGCCATAATC AAGCTAGCAC GGTTAAAATC 8220 



WO 98/18931 



PCT/US97/19588 



607 

AAGTCCTGTC GTAGTACGCT TGGCATTTCC AAACATGGTC GGTGTTACCA AAGCCTGAAC 8280 

CTCCGCCAAA ATCGGACGCG TCCCTTCCAT GGTTACAACG ATGGAGGAAC CAGTCGCCCC 8340 

ATCCAAACGC TCTTCTAGGA AAACTTGACT CGGATTGAGT ACCTCAACCA AGCCGCCCGA 8400 

CTGCATCTCA AAAATCCCAA TCTCATTAGT GGAACCAAAA CGATTTTTGA CCGCTCTCAA 8460 

AATACGAAAG GTGTGGTGAC GCTCCCCTTC AAAGTAAAGC ACCGTATCCA CCATATGCTC 8520 

CAACATACGA GGCCCAGCCA AGGTTCCTTC TTTGGTCACA TGACCTACGA TAAAGATGGC 8580 

AATGTTATTG GTCTTGGCCA ACTGCATGAG TTCAGCGGTC ACTTCACGCA CCTGAGAAAC 8640 

AGACCCCTGC ACCCCTGAAA TCTCAGGAGA CATGATGGTC TGGATGGAAT CAATAATGAG 8700 

AAAGTCTGGC TGGATACGCT CCACTTCTGC ACGAACACTC TGCATATTGG TCTCTGCATA 87 60 

GAGATAAAAC TCACTATCAA TATCACCTAA GCGCTCTGCA CGTAGTTTAA TCTGCTGGGC 8820 

AGACTCCTCC CCACTGACAT AGAGAACTGT CCCCACTTGC GACAACTGGG TTGAGACTTG 8880 

TAGGAGAAGA GTTGATTTCC CAATCCCAGG ATCCCCACCG ATAAGGACGA GACTTCCTGG 8940 

TACCACTCCG CCTCCAAGCA CACGCTTGAA TTCCTCCATC TCCGTCTTGG TTCGATTGAC 9000 

ATTGATGGAA GTCACCTCAG CTAGTTTCAT GGGCTTGCTT TTCTCACCTG TCAAGGACAC 9060 

ACGCGCATTC TTAACTTCGG CAACCTCAAC CTCTTCCACA AAAGAAGACC AAGACCCACA 9120 

GTTGGGGCAA CGTCCCAGAT ATTTAGGGCA ATTATACCCA CAATTTTGAC ATACAAATGT 9180 

CGCTTTTTTC TTTGCGATGA CAAACCTCTT TCTATATCTC TAACTCACAC TCAATCACTT 9240 

GGCAAAAATC AATCTTCTCA TTTGGCACAA ACTGGCCCAT GAGCATTCCA TGAGCAACAA 9300 

CTACCACAGT CTGATGTTCT CGATACTTAG ACATACATTC TAGAAACCGA GACTTCATTT 9360 

CCGTAGCTGT CTCATATTGA ATAGGACTAT TAGGAAGCAA CTCCCCCTTG TTTTCTAAAA 9420 

ACAGTCTTCT AGCTGTTTCA AAGTTTTCTA TTCCTGTTTT ATAGACCTGC CATTCATGTA 9480 

ATAAAGGCTC TACTCTTAAA GGAAGACCCG TAGCACAGAC CACATACCAA GCCGTTTCTA 9540 

AAGCTCTTGT GACTGCAGAA GATACGATTA TTTCAGCTGA CG AG ACT AAA GGATTTTTGC 9600 

TCAATTTCTG GACTTGCTGC CGTCCCATCT CAGACAAGGG TGCCAAATCT ATCCCAAATC 9660 

CTATATAAGA ACGCTCCTCT AACTCACGGT AATCTGGCTC CCCATGACGT ACAAAGATAA 9720 

TCTTCATTCT AGTGCCCTGT CGATCCAAAT CCACCAGTTC GAACGCCATC AGCTGCATCT 9780 

CCATCTGCAA TTAAGAAAGT AGCAAAAACA GCCTGGACAA TACGCTCCCC AACTTCAAGA 9840 

ACAACCTCTT GGTCTGTGAT ATTCTTCATC TGCGCAAAAA TATGCCCTTC ATTTCCAGGA 9900 

TTTCCATAAT AATCCCCATC AATCACTCCA ACTGACTTAA TTAAAACCAA GCCCTTCTTA 9960 
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CGAGGATTTG AACAACGATC ATAGAGGTAG AGAACCTCAG TCGGCTGCAT ATAAGCCTTA 10020 

ACCCCTGTCG GAACCAAGAC AATCTCTCCT GGCGCAACAA CTGTACGCAC AGCAACCTTT 10080 

AAGTCGTAAC CAGTCGCATG CGCTGTCTCA CGCTTGGGCA ATAAATTTTC ATCTGTAAAA 10140 

CTCGAAACCA ATTCAAAACC ACGAATTTTC ATAATTTTCT CTTTTCTATT ATCATTTATT 10200 

CTAGATTATT CTATACTTAT TTA 10223 
(2) INFORMATION FOR SEQ ID NO: 74: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TGGTTCTGTC CTTATCGGCG CCTTGTCTTG CTTGCCATGG CTACACCAAC TATCTCATCC 60 

GACGAAAGTA CACCAACCAC TAACGAACCC AACAACAGAA ATACAACCAC CCTTGCCCAA 120 

CCTCTTACTG ATACAGCACC TGGCTCTGGT AAGAACGAAA GTGATATTTC TTCACCTGGA 180 

AATGCAAACG CTTCCCTAGA GAAAACAGAA GAAAAACCTG CTGCAAGCCC AGCCCATCCA 240 

GCACCACAAA CTGGACAAGA TCGTTCAAGT GAGCCAACTA CTTCTACTAG TCCAGTAACA 300 

ACTGAAACTA AGCCAGAAGA GCCCATCGAA GATAACTACT TCCGTATCCA TGTCAAAAAA 360 

CTTCCTGAAG AAAACAAGGA TGCTCAAGGA CTATGGACTT GGGACGATGT TGAAAAACCA 420 

TCTGAAAACT GGCCAAACGG AGCTTTGTCC TTCAAGGATG CCAAGAAAGA TGACTACGGC 480 

TATTACCTAG ATGTCAAATT AAAGGGAGAA CAAGCCAAGA AAATTAGCTT CCTCATCAAC 540 

AATACAGCTG GAAAAAATCT AACCGGCGAT AAATCTGTAG AAAAACTAGT TCCAAAAATG 600 

AACGAAGCTT GGTTAGACCA AGATTACAAG GTTTTCTCTT ACGAGCCACA GCCTGCAGGA 660 

ACTGTTCGCG TCAACTACTA CCGCACAGAT GGCAACTATG ACAAGAAATC TCTCTGGTAC 720 

TGGGGAGATG TGAAAAATCC AAGTAGCGCT CAATGGCCTG ACGGAACAGA CTTTACGGCT 780 

ACAGCCAAAT ATGGCCGCTA TATCGACATT CCTCTTAATG AAGCCGCAAG AGAATTTGGA 840 

TTTTTATTAC TAGATGAGAG CAAACAAGGA GACGACGTGA AAATCCGTAA AGAAAATTAT 900 

AAGTTCACAG ATTTGAAAAA TCATAGCCAA ATTTTCCTAA AAGACGATGA TGAATCGATT 960 

TACACAAATC CATACTATGT CCATGATATC CGTATGACAG GAGCCCAACA CGTAGGCACT 1020 

TCTAGCATTG AAAGTAGCTT TTCAACACTT GTCGGTGCTA AAAAAGAAGA TATCCTCAAA 1080 

CACTCCAACA TCACTAATCA CCTAGGAAAC AAGGTAACTA TTACCGATGT TGCAATCGAT 1140 
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CAAGCTGGTA 


AGAAAGTCAC 


CTACAGCGCA 


GATTTCTCTG 


ACACAAAACA 


TCCTTATACT 


1200 


GTTAGCTACA 


ATTCCGACCA 


ATTCACTACC 


AAAACAAGCT 


GGCGCCTGAA 


AGATGAGACA 


1260 


TACAGCTATG 


ATGGCAAACT 


GGGAGCTGAC 


CTAAAAGAAG 


AAGGAAAACA 


AGTTGATTTG 


1320 


ACCCTTTGGT 


CACCAAGTGC 


TGATAAGGTT 


TCTGTTGTTG 


TCTACGACAA 


GAATGACCCT 


1380 


GACAAAGTAG 


TTGGAACTGT 


CGCTCTTGAA 


AAAGGGGAAA 


GAGGAACTTG 


GAAACAAACT 


1440 


CTAGACAGCA 


CAAACAAACT 


CGGAATCACA 


GATTTCACTG 


GCTACTATTA 


TCAATACCAA 


1500 


ATCGAGCGTC 


AAGGTAAAAC 


TGTTCTTGCA 


CTCGATCCTT 


ACGCTAAATC 


TCTTGCTGCT 


1560 


TGGAATAGCG 


ACGATTCCAA 


GATTGACGAT 


GCCCATAAAG 


TGGCTAAAGC 


CGCCTTTGTA 


1620 


GATCCAGCTA 


AACTCGGACC 


TCAAGACTTG 


ACTTATGGTA 


AGATTCACAA 


TTTCAAGACT 


1680 


CGTGAAGACG 


CCGTTATCTA 


CGAAGCTCAT 


GTGCGTGATT 


TCACTTCAGA 


TCCTGCCATT 


1740 


GCAAAAGACT 


TGACCAAACC 


ATTTGGGACT 


TTTGAAGCCT 


TCATTGAAAA 


ACTAGACTAT 


1800 


CTCAAAGACT 


TGGGTGTAAC 


CCATATCCAG 


CTCCTTCCAG 


TCTTGTCTTA 


CTACTTTGTC 


1860 


AATGAATTGA 


AAAACCATGA 


ACGCTTGTCT 


GACTACGCTT 


CAAGCAACAG 


CAACTACAAC 


1920 


TGGGGATATG 


ACCCTCAAAA 


CTACTTCTCC 


TTGACTGCTA 


TGTACTCAAG 


CGATCCTAAG 


1980 


AATCCAGAAA 


AACGAATCGC 


AGAATTTAAA 


AACCTCATCA 


ACGAAATCCA 


CAAACGTGCT 


2040 


ATGGGAGCTA 


TCCTAGATCT 


CCTTTATAAC 


CACACAGCCA 


AAGTCGATCT 


CTTTGAAGAT 


2100 


TTGGAACCAA 


ACTACTACCA 


CTTTATGGAT 


GCCGATGGCA 


CACCTCGAAC 


TAGCTTTGGT 


2160 


GGTGGACGCT 


TGGGGACAAC 


CCACCATATG 


ACCAAACGCC 


TCCTAATTGA 


CTCTATCAAA 


2220 


TACCTAGTTG 


ATACCTACAA 


AGTGGATGGC 


TTCCGTTTCG 


ATATGATGGG 


ACACCATGAC 


2280 


GCCCCTTCTA 


TCGAAGAAGC 


TTACAAGGCT 


GCACGCGCCC 


TCAATCCAAA 


CCTCATCATG 


2340 


CTTGGTGAAG 


GTTGGAGAAC 


CTATGCCGGT 


GATGAAAACA 


TGCCTACTAA 


AGCTGCTGAC 


2400 


CAAGATTGGA 


TGAAACATAC 


CGATACTGTC 


GCTGTCTTTT 


CAGATGACAT 


CCGTAACAAC 


2460 


CTCAAATCTG 


GTTATCCAAA CGAAGGTCAA CCTGCCTTTA TCACAGGTGG CAAGCGTGAT 


2520 


GTCAACACCA 


TCTTTAAAAA 


TCTCATTGCT 


CAACCAACTA 


ACTTTGAACC 


TGACAGCCCT 


2580 


GGAGATGTCA 


TCCAATACAT 


CGCAGCCCAT 


GATAACTTGA 


CCCTCTTTGA 


CATCATTGCC 


2640 


CAGTCTATCA 


AAAAAGACCC 


AAGCAAGGCT 


GAGAACTATG 


CTGAAATCCA 


CCGTCGTTTA 


2700 


CGACTTGGAA 


ATCTCATGGT 


CTTGACAGCT 


CAAGGAACTC 


CATTTATCCA 


CTCCGGTCAG 


2760 


GAATATGGAC 


GTACTAAACA 


ATTCCGTGAC 


CCAGCCTACA 


AGACTCCAGT 


AGCAGAGGAT 


2820 


AAGGTTCCAA 


ACAAATCTCA 


CTTGTTGCGT 


GATAAGGACG 


GCAACCCATT 


TGACTATCCT 


2880 
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TACTTCATCC ATGACTCTTA CGATTCTAGT GATGCAGTCA ACAAGTTTCA CTGGACTAAG 2 940 

GCTACAGATG GTAAAGCTTA TCCTGAAAAT GTCAAGAGCC GTGACTATAT GAAAGGTTTG 3000 

ATTGCCCTTC GTCAATCTAC AGATGCCTTC CGACTTAAGA GTCTTCAAGA TATCAAAGAC 3060 

CGTGTCCACC TCATCACTGT CCCAGGCCAA AATGGTGTGG AAAAAGAGGA TGTAGTGATT 3120 

GGCTACCAAA TCACTGCTCC AAACGGCGAT ATCTACGCAG TCTTTGTCAA TGCGGATGAA 3180 

AAAGCTCGCG AATTTAATTT GGGAACTGCC TTTGCACATC TAAGAAATGC GG AAGTTTTG 3240 

GCAGATGAAA ACCAAGCAGG ACCAGTCGGA ATTGCCAACC CGAAAGGACT TGAATGGACT 3 300 

GAAAAAGGCT TGAAATTGAA TGCCCTTACA GCTACTGTTC TTCGAGTCTC TCAAAATGGA 3360 

ACTAGCCATG AGTCAACTGC AGAAGAGAAA CCAGACTCAA CCCCTTCCAA GCCTGAACAT 3420 

CAAAATGAAG CTTCTCACCC TGCACATCAA GACCCAGCTC CAGAAGCTAG ACCTGATTCT 3480 

ACTAAACCAG ATGCCAAAGT AGCTGATGCG GAAAATAAAC CTAGCCAAGC TACAGCTGAT 3540 

TCACAAGCTG AACAACCAGC ACAAGAAGCA CAAGCATCAT CTGTAAAAGA AGCGGTTCGA 3600 

AACGAATCGG TAGAAAACTC TAGCAAGGAA AATATACCTG CAACCCCAGA TAAACAAGCT 3660 

GAACTTCCAA ATACAGGAAT CAAAAACGAA AACAAACTCC TATTTGCAGG AATCAGCCTC 3720 

CTTCCGCTCC TTGGTCTCGG TTTCTTACTA AAAAATAAAA AAGAGAACTA AACTAGCCCT 3780 

CCTATAGAAA AATCCCCCAA GCATTATAGC TCGGGGGATT AATTTTTGTA CAATATTTGT 3 840 

TGTCCTAATA AACTTGATTA GGATTTTTTA TTAAGCCTCT TTCATAGCAA AATAACCTCG 3900 

TACTTTGGGT GCAACTTGTG TTCCGAAGAG TTCAATAGCT CTCAGAACCT GGTCATGAGG 3960 

CATAGAACCA AGCGGTACAT GAAGCATGAA GCGGTCCAAT CCTAAATCCT CTATCATGCG 4020 

AATCAATTTT TCGGCCACCT CATCTCGATT GCCAACAAAC ATGGCGCCAT TTGGCCCTAC 4080 

CTGCTCCAAA TATTGCTCAT AACGCAATTC CTGCCAGTGC GGACGGTCTT TGGAAATACC 4140 

ATCCACCACT TGCTTAGTCG GATGGAAATA ATCTTTCACC GCCTGCTCAC CATCTTCCGC 4 200 

AATCCACCCC CAAGAATGGG CTCCCACTTT CAAGTCTTTG TCAGCATGGC CCCTTCGCTT 4260 

CCAATCTCAC GATAAGCCTG AATCAACTTT TTAAAATAAC GTGGATTACC ACCAATAATA -1320 

GCATATACAA TCGGTAGACC AGCCTGAGCA ATCTTCACTG TTGATTCGAC ATGACCACCT 4 380 

GTAGCTATCC ACAAGGGCAA TTTGTCCTGA ACTGGACGAG GATAAACTTC TTTACCAGCA 4440 

ATCGTTTGAG TCAATCGACC TTGCCAGTCT AACTTGGTCT TTTCATTGAC TAACTGAAGC 4500 

AAGTCTAATT TCTCATCAAA AACAGAGTCG TAGTCTTTCA AGTCATAACC AAACAGAGGG 4560 

AAAGATTCCG TGAAAGAGCC CCTTCCAGCC ATAATCTCCG ATCGTCCATT TGACAAAGCA 4 620 

TCGATAGTGG CATACTGTTG GAACAAACGA ATCGGGTCCA TGCTTGACAG AATCCTGACT 4680 
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GCACTGGTCA AACGGATTTT CTTGGTATTG ACTGCCCCAG CGGCCAGAAC AATCTCTGGG 474 0 

GCTCATACTG CAAAATCCGC CCGATGGTGC TCACCAATCC CATATACATC CAAACCAACC 4800 

TTGTCAGCCA GCTCAATCTC TGCCACCAAC TGGCGAATGC GTTCAGCATG ACTGTAAGTT 4860 

TGTCCAGTCC CTTCAAGCTC CGTTATTTCC CCAAATGTTG AAATTCCCAA TTCTACCATT 4920 

GTGATTCTCC TTATCTATCT CTGTACTTCA ATTTGAAAAA TTATTCTAAC ACGAATCTTG 4 980 

ACT ACAAGCA ACCGATTTGC TCATTAGAAA AAGCCTAGAT AACTAGACTT TTTTAGCTTA 5040 

TTCTACCGTT ACTGACTTGG CAAGGTTACG TGGTTTGTCC ACATCGAGGC CACGGTGGAG 5100 

GGTTGCAAAG TAAGCGACTA ATTGCGTTGG TACGACCATT GAAATTGGTG AGAGGTATGG 5160 

ATGTACGGTC GTAAGGACGA TATCGTCGGT ATCTTTGGCT ACATTCTCTT CTGCGATAGT 5220 

GAGGACTTTG GCACCACGGG CTGCGACCTC TTGGATATTT CCACGAGTAT GATTGGCAAG 5280 

AACTGGATCT GACAAGAGAG CCAAAACAGG CGTTCCTTCT TCAATCAAGG CAATGGTTCC 5340 

GTGCTTGAGT TCTCCTGCAG CAAAGCCTTC ACACTGGATA TAAGAAATCT CTTTGAGTTT 5400 

GAGACTTGCT TCCATGGCTA CGTAGTAATC TTGACCACGT CCGATGTAAA AGGCGTTACG 5460 

AGTTGTTTCA AGAAGTTCAC GAACCTTGAC TTCAATGGTT TCTTTCTCTG AAAGAGTTGA 5520 

TTCGATAGAC TGAGCTACGA TTGACAATTC ATGAACCAGC TCAAAGGCTT GCGCTTTAGC 5580 

ATTACCATTT GCTTCTCCGA CTGCTTTTGC AAGGAAGGCA AGGGCTGCGA TTTGCGCTGT 5640 

ATAGGCTTTA GTTGATGCCA CGGCAATTTC AGGACCTGCG TGAAGGAGCA TGGTATAGTT 5700 

GGCTTCACCT GACAGGGTTG AACCTGGAAC GTTTGTCACT GTTAAGCTTG GAATTCCCAT 5760 

TTCATTAGCC TTGACCAAAA CTTGACGACT ATCCGCTGTT TCACCACATT GGCTCATAAA 5820 

GATGAAGAGT GGTTTCTTGC TGAGAAGTGG CATACCGTAG CCCCACTCAG ATGAGATTCC 5880 

AAGTTCAACT GGTGTATCTG TCAATTCTTC CAACATTTTC TTAGAAGCAA ATCCTGCATG 5940 

GTAAGATGTT CCAGCTGCAA GGATGTAGAT GCGGTCTGCG TCTTGAACAG CCTTAATGAT 6O0O 

ATCTGGGTCT ACGACAACTT GACCAGCCTC ATCTGTGTAG GCTTGGATGA GTTTCCGCAT 6060 

AACAGTTGGT TGCTCGTCAA TTTCCTTGAG CATGTAGTAA GGGTAAGTTC CCTTACCGAT 612 0 

ATCTGACAAG TCAAGTTCAG CAGTGTAGCT AGCACGCTCA CGACGATTTC CATCATAGTC 6180 

TTGAACTTCC ACACTATCAG CCTTGACGAT TACCAACTCT TGGTCATGGA TTTCCATGTA 6240 

TTGGTTAGTT TCACGAATCA TAGCCATGGC GTCTGAGCAG ACCATGTTAT AGCCTTCTCC 6300 

AAGACCAATC AAAAGTGGTG ATTTATTTTT AGCTACGTAG ATGACTTCAG GATCTTGTGA 6360 

GTCAACCAAG GCAAAGGCAT AAGAACCACG GATGATGTGA AGGGCTTTTT TGAAGGCTTC 6420 
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AAGAACTGAG AGCCCTTCTT CTTCCGCAAA TTTTCCAATC AAATGAACGG CTATTTCAGT 6480 

ATCTGTCTGC CCCTTGAAGT GGTGACCTGC' AAGGTATTCT TCCTTGATTT CAAGATAGTT 6540 

CTCAATCACC CCATTATGCA CCAAGACAAA ACGTTCCGTC TCAGAGCGGT GTGGGTGAGC 6600 

ATTGTCCTCA GTTGGTTTTC CGTGAGTAGC CCAACGAGTA TGTCCGATAC CAGTTGTTCC 6660 

CTCAACACCA GCTGTCTTGG CAGACAATTC TGCAATACGA CCAACCGCCT TCACCAAATG 6720 

GTTATCAGCA CCATCTAGGA CAAAAATTCC CGCAGAATCA TAGCCACGGT ATTCAAGCTT 6780 

TTCAAGCCCT TGAATCAAAA TATCAGTTGC ATTTGTGTTT CCAACAACAC CAACAATTCC 6840 

ACACATAGTA TATACGACAC AGGCAAGCTG TGCTTTCTCC TTAAAATTGG TATAGTCTAA 6900 

TTCATCTTTT ATAGAATCAG CAAAAACAGT ATATACTTGT TTCTTTCACT TGTCAAGAGT 6960 

AAAAATTGGT ATAGTTCAAA TTAAGCTCCT GTAAGCATAA AAACTCTGAC CGATTGGGAT 702 0 

AATCAGTCAG AGTCCTTTTT AAAATCCATT ATTATCGCTT AATTCTTTGA ACCAGTGGCC 7080 

TGATTTCTTC AGACGACGTT CTTGCGTTTC CAAGTCTAAT TCGACCAAAC CATAGCGATT 7140 

TTTATAGCTG TTGAGCCATG ACCAGCAGTC AATAAAGGTC CAAATCAAGT AGCCCTTACA 7200 

GTTGGCACCA TCTTCAATGG CACGGTGAAG TTCACGAAGA TGACCTTTTA CAAAGTCAAT 7260 

ACGGTAATCA TCTTGAATCA TTCCATCTTG ACGGAATTTT TCTTCCCCTT CAACACCCAT 7320 

ACCATTCTCA GTCAACATCC ACTCAATATT GCCATAATTT TCCTTGATAT TTTGGGCGAT 7380 

GTCATAAATC CCTTGCTCAT AAATCTCCCA ACCACGGTGA GAATTGATTT TACGTCCAGG 7440 

CATCACATAA GGCTCGTAAA AATGTTCTGG TAAGAGTGGA CTCTCTGGAT GCTTAGCAAA 7500 

TCGAGGAGCC ATAACACGCA AAGGTTGATA GTAGTTCACA CCAAGGAAGT CCACCGTATT 7560 

ATCACGAATG AGTTCCAACT CTTCCTCTGT AGCATCAGGT AAAAGACCGT GTTCATGCAA 7620 

GATTTCTACC AACTCCTGTG GATAAGTCCC CAAGACAGAT GGATCTAAGA AAGATTGGGC 7680 

CTGAAAAAGG GCCGCAATAC GAGCTGCCTT GACATCAGCA GGATGCTGGC TACGTGGATA 7740 

AGCCGGTGTC AAGTTGAGGA CAATCCCAAT CTTGGAATCA GGCAAAAGTT CATGGCAAGC 7800 

CTTAACAGCC CGGCTGCTGG CCAATTGTGT ATGATAGGCT ACCTTAACAG CTGCCTCTGC 7860 

ATCCACCTTA TGTGGATAAT GGGCATCATA AAAATAACCA AATTCTACAG GAACGATGGG 7920 

CTCGTTAAAG GTAATCCATT GATCCACTAA ATCTCCATAA GTCTCAAAAC AAAAACGAGC 7980 

ATAGTCTTCA TAGGCTGAGA CTGTCGCCTT ATTTTCCCAA CCATCACCAT CCTCTTGAAG 8040 

GGCAAAAGGT AAATCAAAAT GATAGAGATT GACTAACAGA CGAATTCCTT TAGCCTTAAT 8100 

AGCCTCAAAG ACCTTACGAT AAAAATCCAC ACCTTGAGTG TTGACTTTTC CACAGCCTTG 8160 

TGGAAAAATC CGTGACCACT GAATAGAAGT CCGAAAGGCT GTGTGACCAG TCTCTAACAA 8220 



WO 98/18931 



PCT/US97A9588 



613 



AAGCTCAATA 


TCCCGCTCCC 


AATTTTCATA 


AAAAGTCGAT 


GTCTTATCTG 


AACCAATCCC 


8280 


ATTATAGTAA 


CGATTTGGCT 


CCACTTGGAA 


CCAGTAATCC 


CAGAGATTGT 


CTCCCTTACC 


8340 


GTCACCAGCT 


ACACGTCCTT 


CTGTCTGCGG 


TCCAGAAGTA 


GAGGATCCCC 


AGACAAAATC 


8400 


CTTTGGAAAT 


CTTAGCATAC 


ATTTACCTCT 


TTATCTACTC 


ATTTCTCCCA 


TTATACAGAA 


8460 


AAAACAAGGT 


AAAAACTACT 


TACATTTTTT 


CCTTGTTTTT 


CTTCTGATTA 


TAGTTTTTAT 


8520 


TTCTTGCTTA 


GGATTTCAAG 


CGTTTCAAGC 


ACGTTATCTG 


CATGAACCTC 


AATGGTGTCA 


8580 


CCAGTTGCCT 


TGATCTTAAC 


TTCTACAATG 


CCATCGGCCG 


CTTTTTTACC 


AACAGTGATA 


8640 


CGGATTGGAA 


GACCAATCAA 


GTCACTATCG 


CTAAATTTAA 


CACCGACACG 


TTCGTTACGG 


8700 


TCATCTGTCA 


AGACTTCATA 


ACCAGCTCCC 


ATCAAGCTTG 


CTTCAAGTTT 


TTCTGTCAAG 


8760 


GCTTGCGCTT 


CTTCATCCTT 


GACATTGACA 


GTAATCAAAT 


GCACATCAAA 


TGGTGCCAAT 


8820 


TCTTTAGGGA 


AATTGATTCC 


CCAAGCGTAA 


CGGTATTCAC 


CTTTTGGCGT 


TTTGTTAACA 


8880 


AAGAGGCGAG 


CGTGTTGCTC 


CATCACTGCT 


GAAAGAAGAC 


GGCTGACACC 


GATACCGTAA 


8940 


CATCCCATGA 


TCATTGCCAC 


ACCACGACCA 


TTTTCATCCA 


AGACATCTCC 


TCCCATGCTT 


9000 


GCTGAATAGC 


GAGTTCCCAG 


TTTGAAAATA 


TGACCGATCT 


CAATACCACG 


CGCAAAGTTA 


9060 


AGGACACCTT 


GTCCATCTCG 


GGAAATTTCA 


CCCTCACGAA 


CTTCACGGAT 


ATCCACATAT 


9120 


TCTGCAGTAA 


AATCACGGCC 


TGGGTTCACA 


CCAGTCAAGT 


GGTAGTCATC 


TTCGTTAGCA 


9180 


CCGACAACTG 


CATTGCGAAC 


ATCTTGTACC 


TTACGATCTG 


CAATAATTTT 


AATATTCTCT 


9240 


GGCAAACCAA 


CTGGTCCAAG 


TGAACCAAAT 


CCTGCTTGAA 


CAACATTCGC 


CACTTCTTCT 


9300 


TCGCTAGCAA 


CGTCAAAGAA 


ATCTGCTCCC 


AAGTGATTTT 


TCAACTTGAC 


TTCGTTGAGT 


9360 


TGGTCATTTC 


CAACTAGAAG 


GGCTGCAACA 


AGCTCACCAT 


CTGCAATGTA 


GAAGAGGGTT 


9420 


TTAATCGTTT 


GTTCTTCTGG 


AACATTGAGG 


AAGGCTGCAA 


CTTCATCAAT 


TGATTTAACA 


9480 


TCTGGCGTTG 


CAACACGAGT 


AACTTCTTCT 


TCAGCGACAA 


CACGGTTGCT 


TGGTTTGTAC 


9540 


TCGTTTGTTG 


CCATTTCTAA 


GTTAGCTGCA 


TAGCTAGACT 


CACTTGAGTA 


AGCAATGGTA 


9600 


TCTTCACCAG 


AGACTATCCA 


TTTGAGCAAT 


TCTGCCTTGA 


TTTCTTCTTG 


CACTTCTGCA 


9660 


GGAATTTCGT 


CAAATGAGGC 


AACTGACTTG 


TCCAAGACAA 


CCCAGCCGTC 


AAGGTCTGTA 


9720 


CGAGCAGATG 


TAATGGCCAT 


AAATTCTTGG 


CTATCCTTAC 


CACCCATGGC 


TCCACCGTCA 


9780 


CCAATAATAG 


CCTTGAAGTC 


TAAACCACTA 


CGAGTGAAAA 


TACGCTCATA 


GGCTGCTTTG 


9840 


TACTCATCAT 


AAACACTATC 


CAAACTATCA 


TAGTTAGCGT 


GGAAACTATA 


AGCATCCTTC 


9900 


ATGATAAACT 


CACGTGTACG 


AAGAAGTCCA 


TTACGCGGGC 


GTTTTTCATC 


ACGATACTTG 


9960 
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GGCTGAATTT GATAAAGGTT GAGTGGCAAT TGCTTGTAAG ATTTAACAGA ATCACGGACA 10020 

ATAGCTGTAA AGGTTTCTTC GTGAGTTGGA CCTAAGATAA AGTCTGATTT TTCACGGTTT 10080 

TTTAGTTTGT AAAGGTCTTC ACCATAGGTT TCGTAACGAC CTGATTCACG CCACAATTCT 10140 

GCACTAAGAA GGGCTGGAGC CAACATCTCA ACAGCACCAA TCTTTTCGAA TTCTTCGCGC 10200 

ATGATGTTTT TAGCTTTTTC AATCACACGG TTGGCAAGTG GTAGATAAGA ATAAACACCT 10260 

GCTGAAACTT GGCGAACATA ACCAGCACGC AACATAAGAG CATGGCTGAT AACTTGAGCA 10320 

TCGCTTGGCA TTTCGCGAAG CGTTGGGATA GGCATTTTAC TTTCTTTCAT AATATTCCTC 10380 

GATTATCTAA AAAAGAGTCG CATAATGTCA TTCCAAGTCA CAGCAATCAT CAAGACAACC 10440 

ATGATGACCA CTCCGGCCAA GGTGACATAG GTTTCAAtTT CTTGTTTCAA TGGTTTGCGG 10500 

CGGATGGCTT CTAGGATATT GAGCACAATC TTACCACCAT CCAAGGCTGG AATCGGAATA 10560 

AGATTAAAAA TCCCAATATT GATGGAAATC ATTGCCAAGA AGTACAAGAT ATTTTCAATT 10620 

CCATTTTTAG CAGCATCACT ACTTGCCTTA AAGATAGCAA CAGGTCCACC CAACTTGTTC 10680 

AAATCTGGTT GGAAAATCAG ATTTTTCAGA GCTGAGAGAA TTCGGAGAGC TGAGTCAGCA 10740 

CCAGTTGTAA AACCACCTAC AAACATGGAT AGAAAATCTG ACTTAACCCC CGGTTGAACA 10900 

CCTAGAAGGT AACGACCTTG ACTATCTTTG GGTGTAACAG TGACTTGTTT GTCACTCCCC 10860 

TTTTCAGAAA TAGTCACATC CAAAGTCGGT GCCGTCTTAT CTTTGGTTTC TGTTTCCACA 10920 

GCTTGGATCA AGCTTTCCCA GTTGCTAACC TCATGTGAGC CAATCTTGGT AATTTGTGCC 10980 

ATTTCTGGTA CTCCTACCTT GGCCAACGCA CCTTGGGGCA TGATATGGAA CTCATTCGTA 11040 

TCAACATCTC TGACACCACC CTGCATAAAG ATTAAAACCC AAAAAACAAC CACACCTAAG 11100 

ATAAAATTGT TCATAGGACC TGCAAAATTG GTAATCAGTT TGCCCCACAT AGTCGCATTT 11160 

TGATATTGAA CATCTAAAGG TGCAATCCGA ACCTCAGTAC CATCTGCTTC CACAACCGTT 11220 

GCATCGTGAT CCACTGCAAA TGTTTTTTCT TCTTCCAGAA CCAATCCTTT GATAAAGAGC 11280 

TTGTCTTCAA AATCAAACTG GGTCACCTGC ATAGGGAGGG CTGTTTGATC CAATTTTT1 A 11340 

CCTGAGAGAT TGATGCGTTT AACCTTACCA TCATCAGCAA GTGTCAAACT AACAGGCCTT -.1400 

CCTGTCTTGA TTTCAGTTGT ATCATCACCC CAACCGGCCA TGCGGACATA GCCACCCAGA 114 60 

GGCAAGATTC GAATGGTATA GGCCGTTCCA TCCTTGCCAA TGTGAGCAAA AATTTTAGGT 11520 

CCCATACCGA TGGCAAATTC ACGTACTAAA ATCCCTGATT TCTTGGCAAA GTAGAAGTGA 11580 

CCGAACTCGT GCACCACTAC AATAATCCCG AAAACCAGAA TAAACGTTAA AATTCCGAGC 11640 

ATAGCGTTTC CTCCGTCTTT TGATTAAAAG AGTCCAAATA AGTGCATGAT TGGAAATACA 11700 

AGCAACATAC TATCGAAACG ATCCAAAACA CCACCATGTC CAGGGATAAA TTTCCCAGAA 11760 
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TCCTTAACAC CAAAATGACG TTTGATCGAA CTTTCTAGTA AATCACCAAA TTGTCCAGCA 11820 

ATGCTAAAGA AAATAGCAAA GACTGACATC TTGTAAATTC CATATGGAAG AGCAACTGTA 11880 

CTGTCAACTA TCATAAGGAT AATGGTTACT AAAATTGCTC CTAAAATACC ACCCAAGGCA 11940 

CCCTCAAGGG TTTTATTAGG CGATACCCTT GGTGCTAACT TTCGTTTCCC ATAGTTCATC 12000 

CCAACAAGAT AGGCACCACT GTCTGTCGCC CAGACGATAC ACAAGGCTAA GAGAGCCTTG 12060 

TCCAAACCTG CAACACGAGC ATCTAGTAAA GCATTAAATC CAAAGCCCAC GTAGAAGCTC 12120 

ATAGCAAGAG GGAAAACCGC ATCCTCAATC GTATAAGACT TGCTAAAAAC GGTCGTTCCT 12180 

AACATGATTG AAATCAAAAC ACTATAGGCA ACCACATTCC CATCAACTGG CAAAAAAGTC 12240 

AGGTAATTCT CCAAGGGAAT GGTCAATGCA AAGGTTGCAA AGAGGGTCAA GAGGCCCTCC 12 300 

ATCGTCATGG TCTCTAGACC TCTCATCTTC AAAAGTTCAT GCATGGCTAG CATGGCTATG 12360 

ATTCCGATTG CTATCTGAAG CAAGAGGCCC CCAATCATTA AAATTGGTAG GAAAATAGCC 12420 

AGGGCAATCC CTGCAAACAA GGTTCTTTTC TGTAAATCCT GGGTCATATT TCCTCCTAAA 12480 

CTCCTCCAAA TCGGCGATGA CGACGATTAT AGGCAAGAAT ACCTTCCTGC AAGGCCGCTT 12540 

CGTCAAAATC AGGCCATAAG GTGTCCGTAA AATAAAGCTC ACTATAGGCT CCCTGCCATG 12600 

GAAGGAAATT GCTCAAACGT AATTCTCCAC TAGTACGGAT AATCAAGTCT GGGTCTCGTA 12660 

AGTCCTTAGG CAAATGCTGA GTAAAGAGAT AGTTACCAAT CAATTCCTCT GTGATCTCAC 12720 

CTGGGTTGAT TTTGGCATCT AAAACATCCT GGGAAATCAA CTTAAGCGCC TGTGTAATCT 12780 

CAGCACGTCC ACCATAGTTA AGACCAAAAT TAAGAATCAA TCCTGTGTTG TTCTTAGTCA 12840 

ATTCCTCAGC CTTGGTTAAA GCTTCAAAGG TTTGCTTAGG CAGGCCGTCT GTCTCCCCAA 12900 

TCATTTGAAT CTTAACATTA TTCGCATGTA GTTCCGGGAC ATAATTATCA TAAAACTCTA 12960 

CTGGCAAGTT CATGATAAAC TTGACTTCCT GATCTGGACG GGTCCAGTTT TCCGTAGAAA 13020 

AAGCATAGAC CGTAATAACC TTGACGCCCA GTTTGTTGGC TCCCTTCGTC ACGGTTTGCA 13080 

ATGCTTCCAT GCCCGCCTTA TGTCCAAAAA CTCGCGGTTC CATACGTTTT TTAGCCCAAC 13140 

CCCCATTGCC ATCCATGATG ATGCCGATAT GAGCAGGAAC CTGTGTCGGA ACCTCTACTT 13200 

CCACAGCCTT ATCTTTCTTA AAAAATCCAA ACATGATCTT ATTCCTATTC AAAAATCTAT 132 60 

CGTTTCATTA TACCATATTT CCCCATTTTC TTCTATCACT AAGCTATTTA TTCTCAGGCA 13320 

CCAAGCCCAT TTTTCAAAAA AATAAGCCGC CTGATTGGGC GACTTTATTT TTATAGGGAG 13380 

ATTATTATGA AAAAGTTTTA GGAGTTTAAG TTAAGGTCTT CTTAACTTAT GAACTTAGTG 13440 

TACACTCCCT AGCTTAAAGT TTCCTTAAGT ATTTTTAAAA ATCAAATTTT TCCATTTCTC 13500 
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CTGCCAATTT TTCTTGGATA AACGTGTTTG ATAGAGTTCC ATTCGGTCTT CATTTTCTAA 13560 

GAAATGAGGA GTTGGACGAA CTTGAAAATT CAAAATATCC TCCAAACCAT AAGGTACATA 13620 

GAGTTCAAAA TCTAATTCTT CATTCAAGCG CAGTCCAACT GCCGTACACC GTTCTGGATA 13680 

CTTACTCATA GCATCACGAG AACTGGTATA GGAAGCAGTG TGAGGACTGT GCTGATGCAT 13740 

ATAGACCTGA TTTTTCAATT CCCACTGGTA CTGAGGAAAA TCCTCTCTCA GCTTTTTCTC 13800 

CAGTAATAAG GTTTCCTCAT AAGAAAAATC TGGATCAAAG AAAATCACAT CTATATCTGT 13860 

TTCATGATCA AAAGGGGATT TCTCTGACAA AAGATTCCAG ATGAAATTTC TGACAGAACC 13920 

TGCTGCCAAC CACGAGTCTT TCAAACCAAG GTCTCCGATC ATCGTCAGAA TGGCCATCAT 13980 

ATCTGGACTT TCTCTAAAAG CCTCTAAGAT TTCTTGCTTA TTTTTCACTG TATTCATAAC 14040 

CTAAGTGCTC ATATGCCTTA GCAGTCGCCA CCCGTCCAGA CCGTGTCCGC ATGATAAAAC 14100 

CTTTTTGAAT CAAGTAAGGC TCATACATGT CTTCAACTGT CTCACGCTCT TCGGCGATAT 14160 

TCACAGAAAG AGTTCCTAGA CCAACAGGTC CTCCACTGTA CATCTCAATC ATGGTGCGAA 14220 

GGATTTTTTG ATCCACATAG TCCAAACCTT CATGGTCAAC ATCCAGCATA GTCAAAGCCT 14 2 80 

TATCGGTAAT AACATCATCG ATAACCCCAT TCCCCATTAT CTGGGCAAAA TCGCGCACGC 14 340 

GCTTGAGGAG ACGATTGGCA ATACGAGGGG TTCCACGACT ACGTAGGGCC AACTCAGATG 14400 

CTGCCTCATG GGTGATTTCC ATCTCAAAAA TATCTGCCGT CCGCTCGACA ATTTCTGTCA 14460 

AGTCAGCATG AGCATAATAC TCCATATGAC CTGTAATCCC AAAACGTGCC CGTAGTGGAT 14520 

TTGAGAGCAT ACCAGCCCGA GTCGTCGCAC CAATCAAGGT AAAAGGAGGC AACTCCAAAT 14580 

GAACACTGCG ACTGCCTTCA CCAGCCCCAA TCATAATATC GATGTAGAAG TCCTCCATGG 14640 

CACTATAAAG CACTTCTTCC ACTGACATGG GTAAGCGATG AATCTCGTCA ATAAAGAGGA 14700 

CATCTCCAGG CTCTAAATCA TTCAAAATCG CTACCAAATC ACCCGCTTTT TCGATAACAG 14760 

GACCAGACGT TTGCTTGAGA TTGACTCCCA GTTCATTGGC AATGACAAAA GCCATGGTTG 14820 

TTTTCCCAAG CCCTGGAGGG CCAAATAAGA GCACATGATC CAGCGCTTCA TCCCGCATTT 14880 

TAGCGGCTTC GATAAAGATC TGAAGTTGAT CCTTAACCTT ATCCTGACCA ATATATTCAC 1494 0 

GTAAATACTG AGGACGGAGC GTGCGTTCTA CTAACTCCTC ATCACCCATC ATCTCATTAT 15000 

CTAAAATTCT ACTCATGGCT CTATTATATC AAAAAAAACA AGCCACAAAC AAAAAAGCCA 15060 

CCTGATTGGG TGACTCCTAA GTTTAGCACT TATGTCGTAT AATATTATAC GGCACTTCTA 15120 

CACCGCCTAC GAAAGGAGGT GAGATAGCCC ATGATGGAAT TAGTACTCAA AACTATTATC 15180 

GGACCAATTG TGGTCGCTGT CGTTCTTCGT ATAGTCGATA AATGGCTAAA CAAGGACAAA 15240 



TAGTGTCAAA AAAGACCTCA AGCTTATTTG GTCGTGAGCT TGGGGTCTTT TCTAGCCTAT 
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GATATAGAAC TAGTACTCAA TTCCTTTTTA TTATCCCATA GTTCACGAAT TTTGTCAAAA 15360 

CTTTACATTT TCTTCAACCG CTGTACGACA AGACGGTTAA GATTAAGAGA ACGTTAGGGA 15420 

TTCTATCAAT TTCATAGAAA TTTTGATTTC GTAAACGAAG AGACAATCTT ACATGTCACT 15480 

TCTCATTTAA TACGCCACTA CTAGACAAGC AAAATCATTA TTACAGTAGT TCCAGTCCTT 15540 

CAATTAACAG TdACTTACAA TCAAATTGAG TTTCAACTAG CTGAAGCGAC CACAGACCTA 15600 

TTTCTTAGTC ATATTCGCTA AAAAAATCCC CGCCAAAATC TCAAAAAGTC CCCGCCAATT 15660 

CCCCGACCAA AATCCGAAAA ATACCGAAAA ATATCGAAAA ATTATTTTTA GAATAGTCCC 15720 

AAAAATCCTG AAATAGAGCT AAAAAACTCC ACCTGATTCG GTGGAGTTAA GGGAGATTAT 15780 

TATGAAAAAG AAAAGTTTAG GATTTTATTA AATAAAGTTA GGAGGTCTTT ATTTAATAAC 15840 

TACATGATAC AAGACGAAAC TTAAAACTAG CTTAACTTTT CTAAAATTTT ACTATTTTGC 15900 

AAAAAATTTC TATCACCAGC ACCTCACCAA TCGAGTAGGG GATAATCTCT AGCCCCTCTC 15960 

ACACCACCGT ACGTGCCGTT TGGCATACGG CGGTTCAACT AACTTTTAAC GCATGTCGTT 16020 

CAAGGTAATA ATCCAAACAC GAAACCAGTC CACGTTTTTC CAGGACTGGT TTTGATATAG 16080 

CACGTTTAAG TACCGACTTC TGAGCTACTA ATTCATAATG GTCGCCCCAG CCAGATACCT 16140 

TATCTGCTAT CCATTTAGGA ACTCCTAACT TAAGCAATCC CCATAATCGT CTCGATTTCT 16200 

TCTTCCATTG CTTCCAGATA ATCACTCGTA GGCGAGTACG CAAGCGCTCA TCTATGCTGC 16260 

CG ACT AT ACT TTTCATATTT CCCAATGAGC AATACTTTAT CCATCCTCGA ATAGACAAAT 16320 

TCAGTTGCTC AATACGTCTT GTTAGGTCTA TACTCCATTT CCTCTCTGTT AGTTTCTTCA 16380 

ATTTAAACTT AAATCTCCGA ACACTATCTT GATGTGGACG GCTTTTCCAA CCATCTGATA 16440 

ATTTCCAGAA CCCAAAACCT AGATATTTCA ACTCTCTTGG TCATGTTTAC TTTCAAACCT 16500 

AGCCGTTTCT CAATAAACGA CTGACTGAAT ACATC 16535 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
( DJ TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

CCAGAGCGTT GCGTCCGAAA GTCTATCCAG ACACGGCTCT TTAAAAACAA AAGGAGAAAT 60 

GATGCATACT TATTTGCAAA AGAAAATTGA AAATATCAAA ACAACCCTAG GTGAAATGTC 120 
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AGGTGGTTAC CGTCGTATGG TTGCGGCTAT GGCTGATTTA GGATTTTCAG GAACTATGAA 180 

GGCTATCTGG GATGACCTCT TTGCCCATCG TACTTTTGCC CAGTGGATTT ATTTGCTGGT 240 

TTTAGGAAGT TTTCCTCTCT GGCTGGAGTT GGTTTACGAA CATCGTATTG TTGACTGGAT 300 

TGGGATGATT TGTAGCTTGA CAGGGATTAT CTGTGTAATC TTTGTATCGG AAGGTCGAGC 360 

AAGTAATTAT CTTTTTGGCT TGATTAACTC TGTTATTTAC CTTATTTTGG CCCTACAGAA 420 

AGGCTTTTAT GGTGAGGTGC TGACGACACT TTACTTCACA GTCATGCAGC CAATTGGACT 480 

TCTAGTTTGG ATTTATCAGG CACAGTTTAA GAAGGAAAAG CAGGAGTTTG TCGCGCGTAA 540 

ACTCGACGGC AAGGGCTGGA CAAAGTATCT TTCCATTAGT GTGCTTTGGT GGTTGGCCTT 600 

TGGCTTCATT TATCAGTCTA TTGGTGCCAA TCGTCCCTAT CGTGATTCAA TCACAGATGC 660 

AACCAATGGG GTAGGGCAAA TCCTCATGAC AGCTGTTTAC CGTGAACAGT GGATATTCTG 720 

GGCGGCTACC AATGTCTTTT CAATCTATCT CTGGTGGGGA GAAAGCCTGC AAATTCAAGG 780 

GAAATATCTA ATTTATCTCA TTAACAGTCT AGTTGGTTGG TATCAATGGA GCAAGGCAGC 840 

TAAGCAGAAT ACTGATTTAC TTAACTAGGA AAAGATGTTT GAAAGTGCTG TTTTGAGATT 900 

TCGATTAAAA CAGATATAGT TGATAATCAA GGATTTATAG TATGAAAAAG AGGATCGGCG 960 

GGTCCTCTTT TGTTGTTGAA AAGATAAAAA ACTCAGTAAC CTAGAAATAA GACAACTGAA 1020 

GCTTTACTCT ATATTCAATT TTTAGGAATG AGAAGGTCTA GATAAAATTG GACAACTTCC 1080 

TGGTCTGTGA AATCTTGACC TTTTTTGAGC CACCAGGTCA ATGTCTCGAT AAAGTTGGAC 1140 

ATGACCAAGT GTTGGAGGTA AGAAGTAGGC AGATTAGGGT GGGCTTCTTT TAAATTATCA 1200 

GCTAGCACGG AATAGACATG GTGTTCTAGC TCTTTATGGA GTTGACGGAG GAAGTAGTCA 1260 

TTTTTGGAAA ATAGCAGACT GGTGATATGG TCTTGGTTTT TATGAAAATG GAGAAAGAGG 1320 

TGGGCGAGGT AGTCCTCGGT TGAAATGGCT TGCTCTCTTT CAAAAAGATG ATGGAAGAGG 1380 

TAGCGGCAGA GCTGGTCCAG AAGAAGCTCC TTACTCTCAT AGTGACAGTA AAAGGTGGAT 1440 

CGTCCCACAT CTGCGAGATC AATGATATCC TGAACAGTAG TGGCCTCGTA GCCCTTACCA 1500 

TTCAAAAGTT GTATAAAAGC TTGATAGATG CCTTTTTTGG TTTTGCTGAT ACGGCGGTCA 1560 

ATGTTAGTCA TATGGACACT TAAGGCAAAT TGTTCAGAAC TGAATAAACC TGACGTTTTG 1620 

CTTCTATCCT TTCTTTGAGT TTTAGTGGAT AATGATAATG AACAAGGTGT TCATAAATCT 1680 

ATTATAACAA AGGAATGAGA AATATGAAGG CAAAATATGC TGTTTGGGTG GCTTTTTTCT 1740 

TAAATTTGAC TTATGCCATT GTTGAGTTTA TTGCAGGTGG AGTATTTGGT TCTAGCGCTG 1800 

TTCTTGCTGA CTCTGTGCAT GACTTGGGAG ATGCGATTGC AATTGGAATA TCAGCTTTTC 1860 

TAGAAACAAT CTCCAATCGT GAAGAAGACA ATCAGTACAC CTTGGGCTAT AAGCCGTTTA 1920 
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CCCTGCTAGG AGCCTTGGTA ACAGCTGTGA TTCTCGTAAC GCGCTCTGTT CTAGTCATTT 
TGGAAAATGT CACCAAGATT TTGCATCCGC AACCAGTCAA TGATGAGGGG ATTCTCTGGT 
TAGGAATTAT TGCGATTACT ATCAATCTGT TAGCGAGTCT GGTGGTTGGT AAGGGAAAGA 
CAAAGAATGA GTCTATTCTG AGTCTGCATT TTCTGGAAGA TACGCTAGGG TGGGTAGCTG 
TTATCCTGAT GGCGATTGTT CTTCGATTTA CGGACTGGTA TATCCTAGAT CCTCTTTTGT 
CCCTTGTCAT TTCTTTCTTT ATTCTTTCAA AAGCCCTTCC ACGTTTTTGG TCTACACTCA 
AGATTTTCTT GGATGCTCTG CCAGAAGGTC TTGATATCAA GCAAGTAAAG AGTGGCCTGG 
AGCGATTGGA CAATGTGGCC AGCCTTAATC AGCTTAATCT CTGGACTATG GATGCTTTGG 
AAAAAAATGC CATTGTCCAT GTTTGTCTAA AAGAAATGGA ACATATGGAA ACTTGTAAAG 
AGTCTATTCG AATTTTCCTA AAAGATTGTG GTTTTCAAAA TATTACCATT GAAATTGATG 
CTGACCTAGA AACTCACCAA ACCCATAAGC GAAAGGTGTG TGACTTGGAA CGGAGTTATG 
ACCATCAACA TTAGAAAAAA GTGAAAAATA CTTGGGTACT ATCTTATTTC GAATAGAGTA 
ATTTCTTTAT TATTTAAATA TTTCAAAAAT TGGTAAGAGA AGAGCATTGT ATAAACTCCA 
GATATATGAT TGTTAATGAT AAAAATTTTT CGATTAGATA CAAAATGCTT GACTTCGAGT 
CAACTCAAAG TTATATAATA AGATAAGTGA GTTAGAATAG CGTGAATTCA GTGAATGAAA 
TGAGAGGAGG TTAGCGTGTG AATATTAAAT CTGCCAGTGA TTTGTTGGGA ATTTCAGCGG 
ATACGATTCG GTATTATGAA CGGGTTGGTC TTGTGCCACC GATTACTCGT ACTGCTACTG 
GGATTCGTGA TTTTCAAGAT CAGGATATCG AAGCGCTGGA ATTTATTAAG TGTTTTCGTT 



CGGCGGGTGT CTCTGTAGAT AGTTTAGTTG 
AAACGAGAGA GGAGAGGCTT GCTATTTTAG 



ACTATATGTC GCTCTACCAA AAGGGAGATG 
AAGAGGAAAA GCAAAAATTA GAGGAGCCCT 



TGTCTCAGCT ACAGACAGCT TTAAATCGTT TAAATCTCAA AATTAAACTT TATAAGGAAG 
GAAAATTTTA AATGAAATCA GCAGTATATA CAAAGGCAGG TCAGGTTGGA CTTGCTAGCA 
TTGAACGTCC GCAAATAATA GAAGCGGATG ATGTGATTAT TCGTGTGCTT CGTGCGTGCG 
TTTGTGGTTC AGATTTATGG AGGTACCGTA ATCCAGAAAC GAAAGCTGGA CACAAAAATA 
GTGGACACGA AGCGATTGGG ATTGTTGAAG AAGCTGGGGA AGCCATTACG ACGGTGAAAG 
CAGGTGATTT TCTGATTGTC CCTTTTACAC ATGGATGTGG TGAGTGTGAT GCCTCTCTTG 
CTGGATTTGA CGGTTCTTGC GACAATCATA TTGGCAATAA TTTGGGGGGT GATTTTCAGG 
CAGAATATAT TCGCTTCCAC TATGCAAACT GGGCGCTGGT TAAAATCCCT GGTCAACCTT 
CTGACTATAC AGAAGGGATG CTCAAGTCCC TTTTGACTCT TGCAGATGTC ATGCCGACAG 
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GCTATCATGC GGCGCGTGTT GCAAATGTTC AAAAAGGGGA CAAGGTTGTT GTTATCGGTG 3720 

ATGGGGCTGT TGGTCAATGT GCTGTCATCG CGGCTAAGAT GCGTGGAGCA TCACAAATTA 3780 

TCCTTATGAG CCGTCATGAA GACCGTCAAA AGATGGCTAT GGAGTCAGGT GCGACAgcTG 3840 

TTGTTGCAGA ACGTGGTCAA GAAGGAATTA CCAAGGTGCG TGAAATCCTC GGTGGAGGAG 3900 

CAGATGCAGC ACTTGAATGT GTTGGTACGG AGGCTGCTAT AGAACAGGCG CTAGGTGTTC 3960 

TTCATAATGG AGGGCGTATG GGCTTTGTAG GAGTCCCACA CTATAATAAT CGTGCTCTTG 4020 

GTTCGACATT TATGCAAAAT ATCTCTGTAG CAGGTGGGGC AGCTTCTGCT ACAACATACG 4080 

ATAAGCAATT TTTACTAAAA GCCGTCCTTG ATGGTGATAT CAATCCAGGT CGCGTCTTTA 414 0 

CTTCAAGTTA TAAACTGGAA GATATCGACC AAGCCTATAA AGATATGGAT GAACCTAAGA 4200 

CAATTAAGTC TATGATTGTA ATCGAATAAA AAACGAATAG GAGTTTTAGA ACTCTATTCG 4260 

TTTTTTATGT TATCCTATTC TTGATTTAGG GTACTTTCTC TTAATGTCAG TCTGGTTCCC 4320 

AGCATGGTCA GGCTAGGGAT TTTCCGACCG TGGAGGACTT CCTTGTTAAG AATATCCATA 4380 

CCTGCTCGGC CCATTTCTTC AGTATAAACT GTAATACTAG AGAGGGGAGG ATAGACCTCT 4440 

TTGGTCAGAC TAGTGTCGTT AAAGGAAATG AGGCTGACGC GATCTGGCAG GCTGATTCCA 4500 

GCTTCTTGGA GGGCACGGAG GGCACCGATA CCTAAACTAT CGCTGGCTGC GAAAAATCCT 4560 

GGCGGAAGTT GGTCTCCCAA GCTCTGAATG GCCTCCTTCA TTAAGTCATA GCCAGACTGG 4620 

GCAGTAAATC TTCCTTGAAA GACCAGTTCA TCATGATAGA TTCCCCTCGC TTGACTATAG 4 680 

TTTTTGAAGT TTTCTAGACG CTTGTCCTGA ATGATTTCTT CTTGGTCTGT TGTTTCTTCA 4740 

AGGCCTGTTA GAATCCCGAT ACGCTCCATT CCTTGACTGA GGAAATAATC GACAACCTGT 4800 

TTCATAGCAG TGTAAAAATC CGTGATAATA CAGGTATGTC CCAGGGAAAG TGTATCG CTG 4860 

TCTAGAAATA CAAGAGGCTT TTGGTATTCT TCAAAGGCAG AAATCTGAGC TCGACTAAAC 4920 

TTTCCGATGC AGAGAATCCC AATCACTTCC TCGCTTAGGG TAAAAGGGTG GTCATTAAAA 4 980 

TAGCGCAAGA TATCATAGTC CAACTCTTGG GCTCTTTTTT CTATTCCTAG GCGAATCTGG 5040 

TAGTAGTAGA GGTCGTCCAG CTCCCCTTGT TCGCTGACCC ATTGGATAAT GGCAATCTTT 5X00 

TGCTTGGGTT TGTGGGACTC GCCTGTCTTG AGGTGCTTGG TGTAGCCCAG CTCTTCAGCA 5160 

ACGGTTAAAA TACGGTGTCT GGTTTCTTCT GTAACAGATA GGCTCTGGTC GCGGTTGAGG 5220 

ACGCGGGATA CGGTCGCGAT AGAGACAGAG GCTAGCTGTG CAATGTCTTT TAAGGTAGCC 5280 

ATAAATCCTC CTTGATTAGG TTAGTATATC ATGTTTTTCT TCTTTTTACT GATATTTTAC 5340 

TAAAATTTTA GTAAAAAGGA TTGACCTTGG AAAATTCCTT GGATATAATA GAAAGAAAAC 5400 

GATTACACGT TAAGATGGCT TAACGGACAG TCAAAGGAGA ATTCATATGG CACAACATCT 5460 
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TACTACTGAA 


GCCCTTCGCA 


AAGACTTTCT 


TGCTGTTTTT 


GGTCAAGAAC 


CAGATCAAAC 


5520 


CTTCTTTTCA 


CCAGGCCGCA 


TTAATTTGAT 


TGGTGAACAC 


ACAGACTACA 


ACGGTGGCCA 


5580 


CGTTTTTCCT 


GCTGCTATTT 


CCTTGGGAAC 


TTACGGTGCA 


GCTCGTAAGC 


GTGACGACCA 


5640 


AGTCTTGCGT 


TTCTACTCAG 


CTAACTTTGA 


GGACAAGGGC 


ATTATCGAAG 


TGCCTCTCGC 


5700 


TGACCTCAAG 


TTOAAAAAG 


AGCACAACTG 


GACCAATTAT 


CCAAAAGGTG 


TCCTTCATTT 


5760 


CTTGCAAGAA 


GCTGGGCACC 


TGATTGACAA 


AGGTTTTGAT 


TTTTATGTTT 


ATGGAAATAT 


5820 


TCCAAATGGT 


GCTGGCTTGT 


CTTCTTCTGC 


ATCCTTGGAA 


CTCTTGACAG 


GAGTCGTGGC 


5880 


TGAGCATCTC 


TTTGATTTAA 


AATTAGAGCG 


TCTCGATTTG 


GTTAAAATCG 


GCAAACAAAC 


5940 


AGAAAACAAC 


TTTATCGGAG 


TAAACTCTGG 


CATTATGGAC 


CAGTTTGCTA 


TTGGTATGGG 


6000 


GGCAGACCAA 


CGTGCTATTT 


ACCTAGATAC 


TAATACTTTA 


GAATACGACT 


TGGTGCCACT 


6060 


TGATTTGAAG 


GACAATGTCG 


TTGTTATCAT 


GAACACCAAC 


AAACGCCGTG 


AATTGGCGGA 


6120 


CTCTAAATAC 


AATGAACGTC 


GTGCTGAGTG 


TGAAAAAGCA 


GTGGAAGAAT 


TGCAAGTTTC 


6180 


CTTGGATATT 


CAGACTCTGG 


GTGAATTGGA 


CGAGTGGGCC 


GTTGACCAAT 


ATAGCTATCT 


6240 


GATTAAAGAT 


GAAAATCGTT 


TGAAACGTGC 


TCCCCATCCT 


GTGCTTGAAA 


ACCAACGTAC 


6300 


CCTCAAAGCT 


CAAGTAGCAC 


TCCAAGCAGG 


AGATTTGGAA 


ACATTTGGAC 


GCTTGATGAA 


6360 


TGCGTCACAC 


CTTTCTCTGG 


AGCATGATTA 


TGAAGTAACT 


GGTTTGGAAT 


TGGATACCCT 


6420 


TGTTCACACA 


GCTTGGGCAC 


AAGAAGGAGT 


TCTCGGTGCT 


CGTATGACAG 


GGGCTCGTTT 


6490 


TGGTGGCTGT 


GCcATTGCCT 


TGGTTCAAAA 


AGATACTGTT 


GAGGCCTTTA 


AGGAACCTGT 


6540 


AGGCAAACAC 


TACGAGGAAG 


TAGTTGGATA 


CGCTCCAACC 


TTCTATATCG 


CTGAAGTTGC 


6600 


AGGTGGCACT 


CGCGTCCTTG 


ACTAGTCAAA 


AGGAGGCTCT 


ATAGTCACCT 


TAGTAAATAA 


6660 


ATTTGTAACA 


CATGTCATTT 


CTGAAAGCTC 


ATTTCACGAA 


ATGGATCGAA 


TCTATCTGAC 


6720 


CAATCGTGTT 


TTGGCACGAG 


TGGGAGAAGG 


TGTTTTGGAA 


GTTGAGACCA 


ATCTGGATAA 


6780 


ATTGATTGAC 


CTCAAGGACC 


AGCTGGTTGA 


AGAAGCCGTT 


CGATTAGAGA 


CGATTGAGGA 


6840 


TAGTCAGACT 


GCGCGTGAAA 


TCCTTGGTGC 


TGAACTGATG 


GATTTGGTGA 


CTCCTTGTCC 


6900 


AAGTCAGGTC 


AATCGTGATT 


TTTGGGCAAC 


CTACGCCCAC 


TCTCCAGAAC 


AAGCGATAGA 


6960 


GGATTTTTAC 


CAACTCAGTC 


AGAAAAATGA 


CTACATCAAA 


CTCAAGGCCA 


TTCCTAGAAA 


7020 


TATCGCTTAT 


CGTGTTCCAT 


CTGACTACGG 


AGAACTTGAA 


ATTACCATCA 


ATCTCTCTAA 


7080 


GCCTGAAAAA 


GATCCCAAAG 


AGATTGTGGC 


AGCCAAGTTG 


GTGCAAGCTA 


GTAATTATCC 


7140 


TCAGTGTCAG 


CTTTGTCTAG 


AGAATGAGGG 


CTACCATGGT 


CGAGTTAACC 


ACCCAGCTCG 


7200 



I 
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TAGCAATCAC CGTATTATCC GTTTTGAAAT GGTTGGTCAC GAATGGGGTT TCCAGTATTC 7260 

GCCCTATGCT TACTTTAATG AGCATTGTAT CTTTTTAGAT GGCCAGCATC GTCCCATGGC 7 320 

CATTAGTCGT CAGAGTTTTG AACGTCTGTT GGCTATCGTA GACCAGTTTC CAGGATATTT 7380 

TGCTGGATCT AATGCCGACC TGCCGATTGT GGGGGGCTCT ATTCTAACTC ATGATCATTA 7440 

TCAGGGAGGC CGTCACGTAT TTCCTATGGA ATTGGCTCCC TTGCAAAAGG CCTTCCGATT 7500 

TGCTGGTTTT GAGCAGGTCA AGGCTGGAAT TGTCAAGTGG CCCATGTCTG TCCTACGTTT 7560 

GACTTCGGAT TCCAAAGAGG ATTTGATCAA TTTGGCTGAT AAGATTTTGC AGGAATGGCG 7620 

CCAGTATTCA GATCCTGCAG TGCAGATTTT GGCAGAGACA GACAGGACAC CGCATCACAC 7680 

TATCACACCC ATTGCCCGCA AACCCGATGG ACAGTTTGAG TTGGACTTGG TCTTCCGAGA 7740 

CAATCAGACT TCACCAGAGT ATCCTGATGG TATCTATCAT CCCCACAAGG ATGTCCAACA 7800 

TATCAAGAAG GAAAATATCG GCTTGATTGA GGTCATGGGC TTGGCAATCT TGCCACCACG 7 960 

TCTGAAAGAA GAAGTGGAGC AACTCGCTAG CTATCTTGTA GGAGAAGCTG TTACAGTTGC 7920 

CGATTATCAT CAGGAGTGGG CAGACCAACT CAAATCCCAA CATCCAGACT AACGGATAAA 7980 

GAAAAAGCCC TTGCAATCGT CAAGGACTCT GTGCGTGCTA TCTTTGCCCG TGTACTTGAC 8040 

GATGCAGGAG TCTACAAGCA GACAGAACAA GGGCAGACAG CCTTTATGCG CTTTGTGGAA 8100 

CAGGTCGGAA TTTTACTAGA CTAGGAGCTT TCTCGG 8136 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CCCATAGTGA AGAGTGGCCA TAAGAAGGTC TTCTAGGCTT AATTTACCTT TTCGTCCACC 60 

TTTTGCGTGT TTAAGTTGAT AAGCTGTTTT TAACACAGCT GAACATCTCT TCAAAAGTCG 120 

TGCGCTGAAC ACCAACAAGA CATTTAAATC GTGTATCAGT TAGTTGTTTA CTTGCTTCAT 180 

CATTCATAGA ACTACTATAC CATGTTTTGT TTCGCAGGAA GTCTAATATT GTCAAATACT 240 

GGAACGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC TGGGATACCT 300 

GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 360 

TCTTCGTTTA CATCTTTCAC CATAACTGCA TCACAGTGAA CATCGTAACC ACGGTTTGAA 420 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 480 
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GCAAGAATTT TAATCATTTG GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAAGC 540 

CCTTCCTTCA GCAATGTAAG CATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 600 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATCTTCG TTTGACTTGA 660 

ACTTGAATTA TTGATGATAA AGAAGAGCAA GGATACTTCT ACTTCCTTAC CTGGCGCAAT 720 

CATATTATGG AAAGTCACCG GTTTCTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 780 

ATGAACAATA TCTGTGTGAG GAATCATTAC ATTTGCAAGT CCTTTCCTAG AAATTCCATA 840 

TATAAACCAG TTGGAAATGA CTTTTCACGC GTGATCAAGG CTTCACGATA AGTTGGAGTG 900 

ACAATTTCTC GTTCTTCCAA CAAGCTTGCT ACCTGATCAA AAAGTTATTC TTGATTATCC 960 

GCTTCTAAGC AAAACACAAG GTTTTTGTCA AAGAAATAAT CTAATACCAT AAGGTTTTCC 1020 

CTTCTTTCCA TTAACTTTAT GCTATAAGTA TAACACTATA TGAAATCGTT GTTAATTACT 1080 

TTCTATTCTT TTTTGTCTCT TTTTTTATAT TTTTGTTTTG TTTATAGTTT GTTATATAAA 1140 

AATAAACACA CAAACAAATA CTCCAAGCAT TTTTCTGTTC TAATACTCAA TGAAAATCAA 1200 

AGAGCAAACT AGGAAGCTAG CCGCAGTTGT TCAAAACACA GTTTTGAGGT TGT AG ATGAA 1260 

ACTGACGAAG TCACTCAAAA CATGGTTTTG AGGTTGTAGA TGAAACTGAC GAAGCAACAg 1320 

CCATACATAC GGTAAGGCGA CGCTGACGTG GTTTGAAGAG ATTTTCGAAG AGTATAAAAA 1380 

CTAAAAAAGC AGACCATCTA AGCCTGCTTT ACTATTGATT CTTATATAAA TTTCCTGTGA 1440 

ACAAGGAAAG GCATTTCTGA TAACTTATTC TTCATCCATA CTCAAGACGC TGAGGAAGGC 1500 

TTCTTGCGGA ACTTCAACTG ATCCGATGGA TTTCATGCGT TTCTTACCAG CTTTTTGTTT 1560 

TTCAAGGAGT TTACGCTTAC GAGAAACGTC ACCACCATAA CATTTAGCAA GTACGTTCTT 1620 

ACGAAGGGCC TTGATATCAG TACGAGCGAC AATCTTGTGT CCAATAGCCG CTTGGATTGG 1680 

AACTTCAAAT TGTTGGCGAG GGATGATTTT CTTGAGTTTA TCAACGATGA GTTTCCCACC 1740 

TTCGTACGCA AAGTCCTTGT GAACGATAAA GCTGAGGGCA TCCACCTTAT CTCCATTGAG 1800 

AAGAATATCC ATTTTCACCA GCTTAGATGG GCGATATTCT GACAATTCGT AGTCAAAGCT 1860 

TGCATAACCA CGTGTCGAAG ACTTAAGTTT ATCAAAGAAG TCAAAGACAA TTTCAGCAAG 1920 

AGGAATTTGA TAGATAACAT TGACACGGTT ATCATCAATA TAGTCCATAG TCACAAAGTC 1980 

CCCACGCTTA CGCTGAGCTA GCTCCATTAC TGCTCCGACG AACTCCTGTG GTACCATGAT 2040 

TTGCGCCTTG ACATAAGGCT CTTCAATGGT CGCAATCTTA GTTGGGTCTG GAAACTCAGA 2100 

TGGGTTAGAC ACATCCATAG ACTCACCGTC GGTCAAATTA ACTTTCTAAA TAACAGACGG 2160 

AGCTGTCATG ATGAGGTCAA TATTGAACTC ACGCTCTAAA CGTTCCTGGA TAACATCCAT 2220 
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ATGGAGAAGT CCAAGAAATC CACAACGGAA ACCAAATCCA AGTGCCTCAG ATGTTTCTGG 2280 

TTCAAACTGA AGACTAGCAT CATTCAGTTG CAATTTTTCA AGcGCTTCAC GCAGGTCATT 2340 

GTAC TT GTTT GATTCGATTG GGTAGAGACC CGCAAAGACC ATAGGATTCA TCTGCTTATA 2400 

ACCATGTAAT GGTTCTGCCG CAGGATTGGT TGCCAAGGTA ACGGTATCAC CCACACGAGT 2460 

ATCCTGAACC GTCTTGATAG ACGCCGCAAT GTAACCAACA TCACCAGTCG CAAGGAAATC 2520 

ACGACCAACC GCTTTTGGTG TAAAAATACC GACTTCGGCC ACATCAAAGG TCTTACTATT 2580 

GCTCATGAGC TGAATCTTAT CACCAGGTTT GACCACTCCG TCCATGACAC GCACTTGGAG 2 640 

GATAACCCCA CGGTAAGCAT CGTAAACAGA GTCGAAAATC AAGGCCTTAA GTGGCGCCGT 2700 

CACATCACCC GTTGGTGCTG GTACTTTTTC TACAATTTGC TCGAGGATTT CTTCAATCCC 2760 

AATACCAGCC TTGGCAGAAG CCAAAACTGC TTCACTCGCA TCCAAACCAA TCACATCTTC 2820 

AATCTCTGTA CGCACGCCCT CCGGATCTGC AGCCGGCAGG TCAATTTTAT TAATGATAGG 2880 

CATGATTTCC AAATCATTAT CCAAAGCCAG ATAAACGTTG GCAAGAGTTT GAGCCTCAAT 2940 

TCCTTGAGCC GCATCGACCA CCAAAATACC ACCCTCACAG GCAGCTAGCG AACGTGAAAC 3000 

TTCATAGGTA AAGTCAACGT GCCCTGGTGT GTCAATCAAG TGGAAAATAT AAGTTTCCCC 3060 

ATCTTTTGCA GTGTAATTCA ACTCGATGCC ATTCAACTTA ATAGTAATTC CACGTTCCCG 3120 

CTCTAGCTCC ATGCTATCCA AAAGCTGGGC CTGCATTTCA CGACTTGAAA CCGTCTCTGT 3180 

TTTTTCCAAA ATGCGGTCTG CTAGAGTTGA TTTTCCGTGC TCAATATGGG CGATAATAGA 3240 

GAAGTTACGG ATCTTCTCCT GTCGTTTTTT CAATTCTTCT AAGTTCATGA TTCTCTTCCT 3300 

TTCAGGGTAT CTATTTATTA TAAATTGTTT TTGATATTTT GACAAGACCA TACCCTGCTA 3360 

GGAGTACTAA TCTTCAGCGA CAAAGCCGTC ATTTTCGATA AAGTGGTGTT CTGTCATTCC 3420 

TTGGTCTGTA AAGACAATCC CGTGAAGGAC ACCACCATAA ACAGCTCCTC CATCCATTCC 3480 

AATCTTGCCA TCTTCTGTAG TCCAAAGCTC AGATGTACCG CCTTCTTGCT GTAACAAACC 3540 

ATAGACCGGT GTATGACCGA AGACAATGGT TTTTCCAGTA TGATTTTCAG CTCCGTGGAA 3600 

TGGTTTTCTA AGCCATACTT TTTTATAATC TGTTGTTTCA TGCCAGTCGT CCAAGGTCAA 3650 

ATCAATACCT GCGTGAACAA AGATATACTT GTCTGTCTCT ACTACAAATC GCATTTGACG 3720 

AATGAATTCG ACCAAGTCTG CCGCTTCAGC GgCAACCCGC TTGGCATCTT CTACTCCATC 3780 

AACTGGTGCA TCCAAGGGAC GACCTAGGAT AGAGTTAATG GTTGTATCTC CACCATTGCG 3840 

ACTATAATGG TCATAACTTT CTTCTGGGTC ATCTAGCCAA GTCAAAAACA TATACTCGTG 3900 

GTTTCCGGAC AAACAGATAG CCCCTTGATT GTCCACCAAG TCCTTGACCA TTTCAAGAAC 3960 

ACGGTGACTA TCCTCACCTC TGTCAATCAA ATCACCTAGA AAGAGCAACT GGGGCTGACC 4020 
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ATCCCAGGTT TTGAGAAGGT CTTCCAGCAT CCCAGCTTTT CCGTGAACAT CTCCAATTAC 
ATAATAATCT GTCATCTTAT TTCTCCCTGT TTCTCAACAA TTCTCTTGCT TGCGTCAGGG 
CTGCTTCTGT CACATCATCA CCTGCCAACA TCTTGGCAAC TTCCTCCACT CGCTCTTCGA 
CCGTCAAGAG ACGAACAGTC GAAACCGTTG AATGGTCATT ACTAATCTTC TCAATAAAGA 
ATTGATAATC TGCAATCGCA ATTACTTGTG GCAAATGGGA GATAGCCAAA ACCTGACCAT 
GCTGACCAAT TTTATGAATT TTCTGAGCAA TAGCTTGAGC AACACGACCT GAAACTCCCG 
TATCCACCTC ATCAAAGACA ATGCTAGTCT TGCCTTCTTT ACGTGAAAAG GCAGACTTAA 
TGGCTAACAT GAGACGAGA? AATTCCCCTC CAGAAGCAAC CTTAACCAAG GGTTTAAAGT 
CTTCTCCAGG GTTGGTTGAA ATATAAAACT CAACCATTTT ATTTCCCTCA CGACTGAATT 
TTCCCTTACT AAAACG AAC C TGAAACTGGG CTTTTTCCAT ATAAAGATCT TCCAGTTCTT 
GTTTAATCTC AGCTTCGAGT TGCTGAGCCA AATTATGACG AGCACAAGCA AGTTGACCTG 
CCAAATTGAC AAGATTGACT TCCAACTTCT TAAGCTCTGC TTCCATGTCC TCAGACGAAA 



GATTATTGCC 

CATCAACAGT 

CCTGCATCAG 

TAATGTCTTC 

GATCATACTC 

AAAAATCTTC 

GGTTGAGGAG 

CTGCCTCAAT 

GGTTTTTCTT 

GATAGGTTTC 

GCAGTTGGGG 

GCCCAATAGC 

GAC C ATTTT G 

CATCAAAAAT 

CCTTTGGTGC 

TGGCATCAAT 

TTTCAAAATT 



TGTCAAGAGA TTGTATTCTT CCGTAATCTT GGCAAAATAA AGCAAAACAT 
CCCACCATAC TTACGAGTAA TAGTATGAAG GAGGTCCAAA CGATTCTCAA 
GCGATTGCCA TCAAAATCAA GGTCCTCAAT GATAGCTTCC AAACGTTTCC 

TGAAATTTCA CGGTATTCAG 
ACGAACATTG GCCAGACTTC 
ATTGTCCAAC ATACTGTAGG CATTGCTCAG TGTATCCGCA ATATTTTTGT 

TTCTCCACCC TGCAAGTTTG 



TAAAACATAG TAGGTCTCAG ACAGATACCT 
TTCGACACTT TCCATGTCAT TCATACCTGA 



TTTATCTCGC TCTTGATTGA GAGCCAAGTC 
CTCTGCCATT TGAAATTCCA ACATTTCGAT 
GACTTCCAGA ACCTGCTTGC GCATTTTCCG 
TTTCAAGTCC CAAAAAGCGG CATCACCAAA 
ACGCATTAAC TCCTCATGGT CATGCTGACC 



ACGTGCCTTG TGTTCCTGTT 
ATAGGCATCA AAACTCCTTT 
TTCATCCAAC ATCTGGATAT 
ATGAATATCT ACAAGATGTT 



TCGCAAAACA GACAGATTAA CCATCTGACC ATTTACACGG CTGATACTAC 

ACCTAATTCT AAACCTTGCT 
GAAAAGCCCC TCAATCTCTG 



CAAGATTTCC CGACGGATGA TAATTTCATC 
TTCCTGTAAA AGACGACTAT TCTCAACTGA 



ACCATGACGA ATAACATCTG TCGTCGCACG AGCTCCCAAC ATCATATTCA 
GATAATCGAC TTCCCTGCAC CCGTTTCACC AGTCAGGACA GTCATCCCCT 
GAGGGAAATA GCCTCAATAA TGGCAAAGTT TTTTATCGAA ATTTCAAGTA 



4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 
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ACATATAGAC CTACCAATTT TTTACTTGTT CAAAGATTTC CTCTGCTAGA CTTCCACTTC 5820 

TGGCAATGAC TAAAATCGAG CTATCATCAG TCAAACAGCT AAAAATCTTG TCTGCAAAAG 5880 

TCTCGATTAA CTGAGCTTTT ACAAAACCCG TATTTCCTGG AATAACTTGG AGATTGATCA 5940 

TCTTATCCAT CAATTCAGCC CATTCGATAT TGTCTTCAGC CAGTTGCAGA CTTTTTACGA 6000 

TTGATTTTGG CAATTCGTAG ACATAGGTGT TGTCTCTCAA AGGAATTTTG ACAATACCTA 6060 

ACTCTTTGAT ATCTCCGGAT ACCGTCGCCT GAGTGGCAGT GATACCTGCT TCTTTCAAAT 6120 

GTTCTACAAT TTCTTCTTCC GTGCCGATTT GATAATCTGT CACCAATCTT CTAATTTTTT 6180 

CAAGTCTCTC TTTTTTATTC ATTTTTAAAT TGACTATGCG CCCTCTCTAC TGCTTCTTTA 6240 

ATCTCAGCAA GAATCTGATT GCTTGCTGAC TTTTCTTTTT TCAAATACGC TAAAAATTCA 6300 

ATATTTCCAT GTCCACCTTG GATGGGAGAA AAGTCCAAGC CAAGGACTGA AAAACCTACC 6360 

TCTACTGCCA TAGCTGTTAC AGATTCAAGG ACATTCTGAT GAACCTTAGC ATCTCGAATA 6420 

ATTCCATTTT TCCCAATCTG CTCACGTCCT GCCTCAAACT GAGGTTTGAC AAGTGCTACC 64 BO 

ACCTGACCTT GATCAGCCAA GACACGGTGC AAGGCTGGCA AAATCAGACT AAGGGAAATG 6540 

AAACTCACAT CAATACTGGC AAAGCTCGGC TCCTGCTCGA AATCAGTCTT TTCAGCATAG 6600 

CGGAAATTGA ACTGCTCCAT GCTGACAACT CGTGGGTCTT GGCGTAATTT CCAAGCCAAC 6660 

TGATTGGTAC CAACATCGAC TGCAAACACC AACTTGGCAC TATTCTGTAC CATGACATCG 6720 

GTAAAACCTC CAGTAGAGGC CCCGATATCA ATCGTAGTCG CGCCATCCAC CGACAAATCA 6780 

AAGACCTGCA AGGCCTTTTC CACTTTCAAA CCACCACGGC TGACATACTT GAGTTTCTCC 6840 

CCCTTGACTT TTAATTCGGT GTCATCTGGA ATTTTCTCTC CTGGCTTGTC AAACCGTTCT 6900 

CCATTAAGGA CTGCTACGAC TAGGCCAGCC ATCACACCTC GCTTGGCCTG CTCTCTCGTT 6960 

TCAAACAACC CCTGTTTATA AGCTAGTACA TCCACTCTTT CCTTAGCCAT TGATTCTCAA 7020 

ACTTTCTACT ACACTTACAA TCGATTCTGT TTCAAAGGGA AGCTGCTGGC CAATTTCTTC 7080 

TAATTTTTCA TTAGCTTGAT CCAGGGTTTG GTTACAAAAG GCAATGGACT CTTCCAAGCC 7140 

CAACAGGGCA GGATAGGTTG ATTTTTCTGC CTGCAGATCC TTTTGAGGTG TCTTGCCGAT 7200 

TTCCTCAAAA CTAGCTGTCA CATCCAGTAC ATCATCTCTG ACTTGAAAAG CAAGTCCAAT 7260 

CAATTCACCC ACAGTTTTCA GCTTCACCTG CATTTCAGGT GACAATTCAG CTATAATAGC 7320 

TGCCGCTTGG AAGGGATAGG CTAGTAACTT CCCAGTCTTA TTGGCATGAA TAGTCTGAAG 7 380 

TTCTTCCAAA GACAAGTGCT GGTGTTCGCC CTCCATATCC AAAACTTGCC CTGCTACCAT 7440 

ACCCAGACTA CCTGAAGCAA GGGATAAGTT GGCAATGAAG TCCACCTTAA TCTGACTTGG 7500 

CAAATCTGCC TGCGCAATCA AGGCATATGA GTCTAAGAAT AAGGCATCTC CAGCCAAAAT 7560 
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GGCCATAGCT TCACCGAATT TCTTGTGATT GGTTAACCGC CCTCTTCGAT AATCGTCATC 7620 

ATCCATAGCA GGAAGGTCAT CGTGAATCAA GCTCCCTGTA TGAATCATCT CTAAGGCAGT 7680 

AGCTACCTGC GCGTGAGCAG GTTTGATGGT AACCTGCAAG GCTTCCAGAA CTTCTAACAA 7740 

GAGAAAAGGC CGAATACGCT TGCCACCAGC ATGAATAGAA TAGAGAACAG ACTCCCGTAA 7800 

ACTAGAGGCA AACTGCTGGT CTCCATAAAA ATCTTCCAAA GCCGACTCGA CAAGAGCTAA 7860 

TTTTTCTTGC TTTTTCATTC AAAATCACTT TCTGTTCCGT CTTCTTGCAT GACCTTGACC 7920 

AAGGTCTTTT CAGCCTTGTC CAGCGTAGCT TGGAGCTCTT TTGACAAGAC CATGCCCTTT 7980 

TCAAAGGCAG TAATCGCATC TTCCAGAGCA ATTTCACCAT TTTCCAAACT TTGGACAATG 8040 

GTTTCCAGTT CTGCTAGATT TTCCTCAAAT TTCTTTTGTT TTGACATCTT TAACCTCTAA 8100 

TTCTACTTGA CCATCTCGCA TCAAAAGCGT TACTTGGTCT TTTTTCTTCA AACTCTCAAC 8160 

CGAATCTACA ACGGACTCTT CTTTTTTGAC AATAGCATAA CCACGCGCCA CGATTCGGCT 8220 

AGTATCCAAC ATGAGCAAAG CTTCCGAAAG TCGCTTGGCC TCAGCAACCT TGGCGTCATA 8280 

AACTAACGCC ATTTGGCTAC CTAAGAGCTT GTCCAACTGT CCTAAACCCT CTTGATAGCG 8340 

TTGGATTTTG GTAACAGGTG ATAATTGTAC TAATTCATGA GTTCTTGCTT GAACTAATTG 8400 

TTTGTTATCA GAAATCCGAG TTCGCAAACT TTGTTTCAAA CGCAGTTGCA GTTGGTCCAA 8460 

GCGTTGCAAA TAACCGTCAT ACAAGCGCTC AGGTTGTCTA AAGATAACAG ACTGACTGCA 8520 

TTTTTTCAAA GCCTCTTGTT TCTTAGATAG AACATTTCGG ACTGCCGTTA CCATCCGTTT 8580 

TTCCTGATTT TGCAAATGAG CTAATACATC CAACTTGGTC ACAGGTGTTG CCAGTTCACC 8640 

CGCCGCTGTT GGCGTTGCAC CGCGTCGATC TGCCACAAAA TCTCCCAAGG TCACATCCCT 8700 

CTCATGCCCC ACACTAGAGA TAACTGGCAA ACGAGATTCA AAAATAGCTC GTACCACAAT 8760 

TTCTTCGTTA AAGGCCCAGA GATCCTCAAT AGAACCACCT CCACGACCAA TAATGAGCAA 8820 

ATCCAAATCG TCCCGTTGAT TACCACGCGC AATATTTCTA GCAATTTCCT CCGCAGCCCC 8880 

TTCACCTTGA ACCTTCGTCG GATAAAGAAG GATGTCAACA CCTGGGAATC CCCTGCTGAC 8940 

GGTCGTGATA ATATCTCGAA TAACGGCTCC ACTACGGCTG GTTACTACAC CAATTCTCTT 9000 

AGAAAATTGG GGCAGAGCTT GCTTGAAGCG TTCTTGAAAC AGGCCTTCTT CTGTCAATTT 9060 

TTTCTTAAGT TGTTCAAACT GAATCGCAAG CGCCCCAACC CCATCAGGCT CAGCTTTTTC 9120 

AATGATGATG GAGTAGCTAC CACTTGGTTC ATAGACCTGT ACACGCCCAA TCACATTGAT 9180 

CTTCATTCCT TCTTCCAGGT CAAACCCTAA TTTCTGATAA ATCCCAGACC AGATGGTCGC 9240 

TTGAATAACT GCATGGTCAT CCTTTAGGGA GAAATATTGG TGAGTAGGTC GTTTACGAAA 9300 
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GTTGGAAACT TGACCAGTTA AATAGACCCG TTCCAAGTAT GGGTCTTTAT 
TTTCAGATAC TTGGTCAAAG TTCTTACCGA TAAATACTTT TCCATCTCCA 
ATTTACTTGC TCTTTCATGG GTATTATTAT ACCAAAAATA TGCCTAAAAA 
TGTACCATTA TGAGGGAAAA ATAGAAAAAG GAGGCAAGGC CTCCACATGT 
TGTTTCGAGC TTCTTCCAAA ATCTTTGCAA TCTTGGTCGT CAACAGGTCG 
TATTGCTAAC CCCTTCAGGA ATGACGATAT CAGCATAACG CTTAGTTGAC 
GGTGGTACAT TGGTTTGACC ACACCTAAGT ACTGGTTAAT AACGCTATCA 
CACGCTCCTC CATATCACGC TTGATACGAC GAATAATGCG CACATCGTCA 
CAAAAATCTT GATATCCATC AAATCGCGCA GACGCTTGTC CTCCAAGACC 
CAACGATAAA GACATCTTGA GGTTCCTGAC GATAGGTCTT GCTACTCCGT 
TATAGTCGTA GGTCGGGATG TCCACCGGAC GCCCTGCCAA CAATTCCTTA 
TCATCAAGTC TGTATCAAAG GCAAAAGGAT GGTCATAGTT GGTTTTGACG 
{2) INFORMATION FOR SEQ ID NO: 77: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



CGAATTTCAT 

CCTACTATTC 

TCTCCATTTA 

GATTATTTGC 

ATAGCCACGG 

TCGATAAACT 

AGGCTACGGC 

TCCGTATCCA 

AAAATACCCT 

GTATGCTCTG 

ATCTGCTCGA 

G 



9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10011 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



CGTGTGGTCT 


TAAAAATAGA 


AGACAAAGAA 


CAAACTGTTG 


GAGGCTTTGT 


CCTTGCAGGC 


60 


TCAGCCCAAG 


AAAAAACCAA 


AACAGCTCAA 


GTTGTGGCTA 


CTGCACAAGG 


TGTTCGTACC 


120 


TTGAACGGTG 


ACTTGGTTGC 


TCCAAGTGTT 


AAAACTGGAG 


ATCGTGTCTT 


AGTTGAAGCC 


L80 


CACGCAGGTC 


TTGATGTCAA 


AGATGGCGAT 


GAAAAGTACA 


TCATCGTAGG 


CGAcTAACAT 


240 


TTTGGCAATC 


ATTGAGGAAT 


AGAAGGAGAA 


AGTAAGTATG 


TCAAAAGAAA 


TTAAATTTTC 


300 


ATCAGATGCC 


CGTTCAGCCA 


TGGTTCGTGG 


TGTCGATATC 


CTTGCAGACA 


CTGTTAAAGT 


360 


AACCTTGGGA 


CCAAAAGGTC 


GCAATGTCGT 


TCTTGAAAAG 


TCATTCGGTT 


CACCCTTGAT 


420 


TACCAATGAC 


GGTGTGACCA 


TTGCCAAAGA 


AATCGAATTG 


GAAGACCATT 


TTGAAAATAT 


480 


GGGTGCTAAG 


TTAGTATCAG 


AAGTAGCTTC 


TAAAACCAAT 


GATATCGCAG 


GTGACGGAAC 


540 


TACGACTGCA 


ACAGTCTTGA 


CCCAAGCTAT 


CGTCCGTGAA 


GGAATCAAAA 


ACGTCACAGC 


600 


AGGTGCAAAT 


CCAATCGGTA 


TTCGTCGTGG 


GATTGAAACA 


GCAGTTGCCG 


CAGCAGTTGA 


660 


AGCTTTGAAA 


AACAACGCCA 


TCCCTGTTGC 


CAATAAAGAA 


GCTATCGCTC 


AAGTTGCAGC 


720 
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CGTATCTTCT CGTTCTGAAA AAGTTGGTGA GTACATCTCT GAAGCAATGG AAAAAGTTGG 780 
CAAAGACGGT GTCATCACCA TCGAAGAGTC ACGTGGTATG GAAACAGAGC TTGAAGTCGT 840 
AGAAGGAATG CAGTTTGACC GTGGTTACCT TTCACAGTAC ATGGTGACAG ATAGCGAAAA 900 
AATGGTGGCT GACCTTGAAA ATCCGTACAT TTTGATTACA GACAAGAAAA TTTCCAATAT 960 

CCAAGAAATC TTGCCACTTT TGGAAAGCAT TCTCCAAAGC AATCGTCCAC TCTTGATTAT 1020 

TGCGGATGAT GTGGATGGCG AGGCTCTTCC AACTCTTGTT TTGAACAAGA TTCGTGGAAC 1080 

CTTCAACGTA GTAGCAGTCA AGGCACCTGG TTTTGGTGAC CGTCGCAAAG CCATGCTTGA 1140 

AGATATCGCC ATCTTAACAG GCGGAACACT TATCACAGAA GACCTTGGTC TTGAGTTGAA 1200 

AGATGCGACA ATTGAAGCTC TTGGTCAAGC AGCGAGAGTG ACCGTGGACA AAGATAGCAC 1260 

GGTTATTGTA GAAGGTGCAG GAAATCCTGA AGCGATTTCT CACCGTGTTG CGGTTATCAA 1320 

GTCTCAAATC GAAACTACAA CTTCTGAATT TGACCGTGAA AAATTGCAAG AACGCTTGGC 1380 

CAAATTGTCA GGTGGTGTAG CGGTTATTAA CCTTCGACCC GCAACTGAAA CTGAGTTGAA 1440 

AGAAATGAAA CTCCGCATTG AAGATGCCCT CAACGCTACT CGTGCAGCTC TTGAAGAAGG 1500 

TATTGTTGCA GGTGGTGGAA CAGCTCTTGC CAATGTGATT CCAGCTGTTC CTACCTTGGA 1560 

ATTGACAGGA GATGAAGCAA CAGGACGTAA TATTGTTCTC CGTGCTTTGG AAGAACCCGT 1620 

TCGTCAAATT GCTCACAATG CAGGATTTGA AGGATCTATC GTTATCGATC GTTTGAAAAA 1680 

TGCTGAGCTT GGTATAGGAT TTAACGCAGC AACTGGCCAG TCCGTTAACA TGATTGATCA 1740 

AGGTATCATT GATCCAGTTA AAGTGAGTCG TTCAGCCCTA CAAAATGCAC CATCTGTAGC 1800 

CAGCTTGATT TTGACAACAG AAGCAGTCGT AGCCAATAAA CCAGAACCAG TAGCCCCAGC I860 

TCCAGCAATG GATCCAAGCA TGATGGGCGG CATGATGTAA GCTTTCTATA GAAAACAACT 1920 
TATAAAAAAC ACAAAAGGAG GGAATGACTA ACCCTTCTTT TTATAGGCTC TTTGTCAACT 1980 

CTACTGGGTT GAAGTCAGCT AAGCTCGAGA AAGGACAAAT TTCGTCCTTT CTTTTTTGAT 2040 
GTTCAAAGCG ATAAAAATCC GTTTTTTGAA GTTTTCAAAG TTTCGAAAAC CAAAGGCATT 2100 
GCGCTTGATA AGTTTGATGA GATTATTGGT CGCTTCCGGT TTGGCGTTAG AATAGTGTAG 2160 
TTGAAGGGCG TTGATAATCT TTTCTTTATC TTTGAGGAAG GTTTTAAAGA CAGTCTGAAA 2220 
AATAGGATGA ACTTGCTTAA GATTGTCCTC AATAAGTCCG AAAAATTTCT CCGGTTCCTT 2280 
ATTCTGAAAG TGAAACAGCA AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA 2340 
ATAGCTCAAA AGCTTGTCTA AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGACG 2400 
ATAAAATCGC TTATCACTCA GTTTACGGCT ATCCTGTTGT ATGAGCTTCC AGTAGCGCTT 24 60 
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GATAGCCTTG TATTCATGGG ATTTTCGATC CAATTGGTTC ATAATTTGAA CACGCACACG 2520 

ACTCATAGCA CGCCTAAGAT GTTGTACAAT GTGAAAGCGA TCCAACACGA TTTTAGCATT 2580 

CGGGAGTGAA ACAGTCTGGG AGACTGTTTC AGCCTGAGCC TAGAAATTTG AAAGCGAAGC 2640 

TGTTTAGCCA AGTCATAGTA AGGACTAAAC ATATCCATCG TAATGATTTT CACTTGACAA 2700 

CGAACGGCTC TATCGTAGCG AAGAAAGTGA TTTCGGATGA CAGCTTGTGT TCTGCCTTCA 2760 

AGAACAGTGA TAATATTAAG ATTATCAAAA TCTTGCGCAA TGAAACTCAT CTTTCCCTTA 2820 

GTGAAGGCAT ACTCATCCCA AGACATAATC TTTGGAAGCC GAGAAAAATC ATCCTCAAAG 2880 

TGAAAGTCAT TGAGCTTGCG AATGACAGTT GAAGTTGAAA TGGCCAGCTG ATGGGCAATA 2940 

TCAGTCATAG AAATTTTTTC AATTAACTTT TGAGCAATCT TTTGGTTGAT GATACGAGGG 3000 

ATTTGGTGAT TTTTCTTTAC CAGGGGAGTC TCACCAACCA TCATTTTTGA ACAGTGATAG 3060 

CACTTGAAAC GACGCTTTCT AAGGAGAATT CTAGAAGGCA TACCAGTCGT TTCAAGATAA 3120 

GGAATTTTAG AAGGTTTTTG AAAGTCATAT TTCTTCAATT GGTTTCCGCA CTCAGGGCAA 3180 

GATGGGGCGT CGTAGTCCAG TTTGGCGATG ATTTCCTTGT GTGTATCCTT ATTGATGATG 3240 

TCTAAAATCT GGATATTAGG GTCTTTAATA TCGAGCAGTT TTGTGATAAA ATGTAATTGT 3300 

TCCATATGAA TCTTTCTAAT GAGTTGTTTT CTCGCTTTTC ATTATAGGTC ATATGGGACT 3360 

TTTTTTCTAC AACAAAATAG GCTCCATAAT ATCTATAAGG GATTTACCCA CTACAAATAT 3420 

TATAGAGCCG AAAATTCACA TCTAATATAT GCAGACTACT TTGAAATGAA ATTAAAAAAA 3480 

TTATTAAAGG ATGACACAAA AGTTTTTGAA AAATCTACAT TCAAATTTGT AGAAGGATAT 3540 

AAAATATACC TGACAGAATC TAAAGAATCT GGAATTAAAC AAATGGACAA TGTCATAAAA 3600 

TATTTTGAGT TTATTGAATC TAAAAGTATT GCTTTATATT TTCAAAAACG ATTAAATGAG 3660 

CTGATAGATT AAATAGCATT TTCTCTGTTG AGATATTGTT TTTAAAATAT TGTACTAAAT 3720 

GATTGATGCT ATGTGGAAAT ACAAAAAAAT GTTTTTGATA CGAAGTTGAC CTGTATTTTT 3780 

TATACTAATC ATTTTCGTAT TTTTTGTATT AAACGATATA AGTTTGTTGT AAACTTACAA 3840 

GGAATAAAGA CATTAAAAAA TAACAGTATA TCTATTTGTT TTATATATTT TACGAATTCT ;'900 

GCATAAATCT CTTTCTAGTA ATGTGTTGTA ACTCTGCTAT AATAGATTTA TTCCTTTTTG 3960 

TGTTTACACA ATTTATTTTA TAGTACCAAA AAAGGTCAGG ATTTTGTTCC TGACCTTTGA 4020 

CAACTTTACC GATTCTTTAG TTCTACATAG CGCTTGTACC AAATCTTTAC ATAGGCTTCT 4080 

GAGAAAGGAC CACGTCCATT GTTAATCCAA TCAACAAGAA TTTTGACATG TTCTTTTAAA 4140 

ATATAGTCCA AGTCATCAGA ATAATTCATT TTGCGTTTGT GACGCTCGTA CTCTTCAACG 4200 

TCCAAGAGAC GTTTTTCCCC ATCTGTAAAA ATTTTAACAT CCAAATCGTA ATCAATATAC 4 260 



WO 98/18931 



PCT/US97/19588 



631 

TTCAGTGCTT CTTCATCCAG ATAGTAGGGG CTAGCCATAT TGCAATAGTA AGAAGTTCCA 4 320 

TTATCACGAA TCATGGCAAT GATATTAAAC CAATATTTCT TGTGAAAGTA AACAATAGCC 4360 

GGTTCTCGAG TGACCCAACG ACGACCATCA CTTTCGGTAA CAAGTGTATG ATCGTTGACA 4440 

CCAATAATGC CGTTTTCTGT TGTTTTTAGT ACCATGGTGT CCCGCCAAGT TCGGTGGAGA 4500 

CTCCCATCAT GCTTATAACT TTGAATTGTA ATAAAGTCGC CTTCTTTTGG AAGCTTCATA 4 560 

ACTAACCAAC TTTCTACAAT TTATAAGTTT ATCATTTACT ATTGTACCAT AAAATTACCC 4620 

AAAATCTGTG AATTTCACTT GGAAATATTA AAGATATTCT CTAAGAGCGC TTGCTATATC 4680 

CGAAAAATCG TAGCCCTTTC GTGCTAAAAC TTGAGTTAAA CGCTGCTTCA GTTCGTATCC 4740 

TTCATACTTT CGGGCATACT TAGTATATTG CTTATCAAGT TCCTTGAAGA TGAGTTCCTG 4 800 

AGTCGTTTCT TCATCAACTT GACTATCCAA TTCGTCAAAG GCAATTTTAG CATCAAAATA 4860 

AGAGAAGCCC TTGTTAGTCA AGTTCTGGAT AATCTTATCT TGCAGGGCAC GAGCTGGAAG 4920 

TTTTCCCTCA TATTTTTTCA ATAGTTTATT GGCTACACGT TGAGCAACTT CCGAAAAATC 4980 

AAAATCATTC AAGATTTCTT CTATAGTAGA TTTTGAAATT CCTTTTTGTG CTAATTTCTG 5040 

AGTCAGTACA TAAGGTCCCT TGTCTCCTGA AAGTTGATTG GCATTGATGA TAGCATAAGC 5100 

GTACTGGCTA TCATTAATCC ACTTCTCTTC TTT^AGATTA GCAATGACTT GAGAAACGAT 5160 

GTTTTCATTA ATATCATATT TTTTCAGATA TTCTCTCACC TCTTTTTCAG TACGTGCTTT 5220 

AAAGGATAAG TGGTAGACGG CCAGATTCTT ACCATAAGAA AATTGAGCAA AGTCTTGAAT 5280 

CTCTTTCAAT TCCTCTTCGC TTATCACCTT ATCTCTCGAT AACATAAAAC GAACAATTGT 5340 

GTCTTCGGTG ATATAGCATT TGTCG 5365 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3636 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

TTTCCAGAAA GAAGTTGAGT AAAGTCTTTA TCAAAGAGAA TGACTTCCGT ATTGGAACTG 60 

ACATTAGGTT TTATTTCTAC TTTACTAGCG TCCGCCCTAG CATTTTCTAA ATCTTTAATC 120 

TCTTCTGTTG CCCTATTTAT AGCCAGCTGA ATAACTGCTT GAGGATTTTC ACTCAGTCCA 180 

TGAAGCTTAT CGTCCACCGA AGTATAAAGA CTCGAATGCA TGACTTGTAA AATAATCAGA 240 



WO 98/18931 



PCTAJS97/19588 



GTCATTGTAG AAAAAATCAG GGTGAAGACA CCGAAGTTGC GGATAAAATA ACTAAAGTCA 300 

TCCGCATACC ATGTTTTTTT AAGTTTACTG AACATCTTTT AAAAGATACC CAACACTACG 360 

CAAAGTTTGC AAATTCTCTG CAAAACTGGT TCCCTTTAAT TTCTTACGGA CTTTTGAAAC 4 20 

ATAGACTTCG ACAACCGAAA TCGTTGTATC ACTATCAAAT CCCCATAGAC GGTCAAAAAT 4 80 

CTGCGTCTTA GGCAAAATCA CATTTTGATT TTGAAGGAAA TAAACTAGTA AATCGAACTC 540 

TTTCCCCAGC AATTCGACAG GAGTATCTTC AACTTTAACG GTATTGGTTG ATAAATTAAC 600 

CACGATATTC CCATAAGTCA AGGTGTTTTC ATTAAACTTC CCTGAACGTT TGAGAACCGC 660 

CTGAATCCGC ATTTTAAGTT CTTCTAGGTA GAAAGGTTTG GTCAGATAAT CATCCGCTCC 720 

CAGTTCAAAT CCATGTCCCT TGTCATCCAA ACTTTCCTTG GCAGTCATAA TCAGAACTGG 7 80 

TGTOGTAATT CCCTTTTCAC GCAATTCTTT TAAGACTTGG AAACCATTTT TTTCTGGCAA 840 

CATCAAATCC AGCAAAATCA AGTCATAGAC ACCACTCTCA GCTTCGTAGA GACCTTCTTC 900 

TCCATCAAAT ACCTGCATAA CATCCGCAAA ATCGTCTAAA AAGTCAAATA CTGAATTTCA 960 

CAGACCTAGG TCATCCTCAA CCAATAAGAT TTTTATCATG AGAAACTCCT CCTTATTAAA 1020 

ACTATTATAC CAAATTTGCC TTAAAAAAAA CTCAACTCTC TGCATTTTAC ATGAGATACC 1080 

TGAGTTTTCT TTTTATTTTA GGCTTATTTA TGCATTTCCG TATTGAAGAA CAACTGCTTC 1140 

GACTGCAGCT TTTTCACGGC TAATCAAGTC AACACGCGCT GCAATTTCCT TGATTCCCAT 1200 

ACCGATGTTA CGGCTAAGAG CAAGGTCAGA AAGTTGCGGT TCAAAGAACT CCTTGTATTC 1260 

CGCCAAGCGT TGCTCAGTCT TAAATACATG AGCACGAAGG ATAACAAAGC TATCAAAGCT 1320 

CATATCTCCT CCAAGGGCTG CCTTAATCCA AGCCCACTTT TCACGCGCCC AAGACCAAGC 1380 

TGTTTTCTGA GTTGCTTGAT GAGCTAGGAA TTGGTAATAC CAAGCAGACA AGTCCTGTGG 1440 

TTTGACCACA AATTTGTCCT TCCAAGAAGT AATCAGGTTT TGGATATTAT CCGCATCTGT 1500 

ACTGTATGCA AGAGCTGCTG CCAACTGGCC TTTAAAGACA GCATCTGTTG CGTGAGTATA 1560 

AGTATCAAGA TAAAGTGCTA ACAAGTCTTT AGTCTCATGA TGTTTCATCT CATTAATCAG 1620 

AACTTGTGAG CGAATAGCTG CTGGGAGTCC TGCAAGATTC TCCTTGTGTG TTGCGAAGAT 1680 

TTGGCTAGCG ACTTGACTAG CTTCTGCATC ATTTGAGCGA ATCATCATCG AAACAGCCAG 1740 

CTGACGAACC AATTCATCCT GATCTGATTC TCCGTCTTTA GCTTCAAAAC CAAGACGGTC 1800 

ATAGTTATGA CGAGCCAATT TAGCAACCAG TCCTTTGAAG GCTGTTTCAG CATCCGTTCC 1860 

TTCATCAATA AAGCGCTCAA GGGCTGAAAT CACTTGAGAA ACAGCTGAAA CCACCAGATA 1920 

AGACTCTTCC TTAGCAAGTT TATCAAGAAC TGGAAGCAAG TCTGCATAAG AAATGTGCCC 1980 

TGCCTCAGCC AACAAACGAC GTTCTTGAAC AATTTGCAGT TTGCTTGTGT TATCAAGTGT 2040 



WO 98/18931 



PCTYUS97/ 19588 



633 



CTCTAGCTCA GCAAGAACAG 


CTGCTAACAA 


GTCTCCTTGA 


TAGTCGGTAA 


TATAGTGGGC 


2100 


AGTATTTTCA GTGTTGAGAC GAAGAGCTCC TTCATTTTCA 


CCAAGAAGAG 


CTGCGTAGCC 


2160 


AGGGATTTCG 


ATACTTTCAG 


TTTCGAGTGT 


ATCAGGCAAG 


CCTTTCCAGT 


TGCTATTGAG 


2220 


GGGCACCACC 


CAGACACGGT 


TCTTGTCTTC 


GTTCTCACCG 


ATGAAGAATT 


GTTTTTGTGA 


22B0 


AATCTTCAAG 


ACATCATTTT 


CAACTTTAAC 


AGTAAGAACT 


GGGTAACCAG 


GCTGTTCCAA 


2340 


CCAAGAATCC 


ATGAAGGCTG 


CGACATCACG 


TCCTGACGCT 


TGACCAAGGG 


CATCCCAAAG 


2400 


GTCACTACCA 


ATGGTGTTGC 


TGTATTGGTG 


TTTTTCAAAG 


TAGGCGTGCA 


AACCTTTAGC 


2460 


AAAATCAGCA 


TCTCCTAGCC 


AACGGCGAAG 


CATGTGCATG 


AGACGGCTTC 


CTTTGGCATA 


2520 


GACGATAGCG 


CCGTCAAAGA 


GTGTATTGAT 


TTCATCTGGA 


TGTTTAACTT 


CGACGTGGAC 


2580 


AGACTGAACG 


CCATCAGTAG 


CGTCACGTTC 


AAGAGCAAGA 


GGTACTCCAC 


CTGTTTGGAA 


2640 


ATCTTCAAAG 


ATATTCCAGC 


TTGGTTCGAT 


GGTATCCACA 


CAGACGTATT 


CCATCATATT 


2700 


AGCGAAACTT 


TCATTGAGCC 


AAAGGTCATC 


CCACCATTTC 


ATAGTCACGA 


GGTTCCCAAA 


2760 


CCATTGGTGA 


GCCAATTCAT 


GGGCCACAAC 


AAGGGCAACT 


TGTTGACGGC 


TAGCAAATGT 


2820 


AGAGTTCTCA 


TCGACAACCA 


AGTAAACTTC 


ACCGTAGGTC 


ACAAGACCCC 


AGTTTTCCAT 


2880 


AGCACCAGCT 


GAGAAGTCAG 


GAAGGGCGAT 


GTCGAGAGAT 


TGAGGAATTG 


GGTACTTAAC 


2940 


TCCATAGTAA 


TCTTCGTAAA 


ACTCGATAGA 


GCGAACAGCG 


ATATCCAGTG 


AGAAATCAAC 


3000 


ATTTGAAAGT 


GGATGTGCTT 


TGGTTGAGTA 


GACACCTACC 


AGGGTACCAT 


TTTTAGTTTT 


3060 


AGCGGTCACC 


CCTTGCAAAT 


CACCAGCAAC 


AAAGGCCAAC 


AAGTAAGAAG 


ACATGCGAGG 


3120 


TGTTGTCTCA 


AACTTCCAGA 


TACCTGTTTC 


CTTACGGTTT 


TCAACATCCA 


TTTCTGGCAT 


3180 


GTTTGACAAG 


GCCAATTCAC 


CTTCTGCTTG 


GTCAAAGCCA 


AGAGAGAGGT 


CAAAAGTTGC 


3240 


TTTGGCTTCA 


GGCTCATCCA 


CACATGGGAA 


AGcTTCGCGC 


GCAAAATGGC 


TCTCGAACTG 


3300 


AGTAGACAAG 


ACCTCCTTCT 


TGACTCCATC 


AACTGTATAA 


TAAGAAGGCT 


AAATCCCTGT 


3360 


CATGTTGTCT 


GTAATTTTAC 


CAGAAAAGGC 


AAGAACCAAT 


TCAACTTGAC 


CAGCCTCAGC 


3420 


CAATTCGATA 


TGAAGGGCTT 


CATTGTCATG 


GTCAACTGTA 


AATGGACGAG 


CTTGACCTGC 


3480 


AACTTCTACA 


GAGGTGATTT 


CCAAATCTTT 


TTGGTGGAGG 


GAGATGCGGT 


CACTCTGTCC 


3540 


TTGACCAGTG 


ATGGTCACTT 


TCCCAGAAAA 


AGTCTTGGTC 


TCACGACTCA 


AATCTAAAAA 


3600 


TAAATCATAA 


TGTTCAGGAA 


CAAATTGCTT 


AATGGG 






3636 


(2) INFORMATION FOR SEQ ID NO: 79: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5066 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



ATAGCGTGTA 


ATAATCGATT 


TTAGAGGTAC 


CATAAGCCAC - 


CTCCTACAAA 


TAGAAACCGA 


60 


TATAAATCAA 


TGCCTTCCAC 


CCTTAGACTT 


CCCTAGTTCC 


TGTCTCAAGC 


GAAACATTTC 


120 


TTTGAAACAG 


GAATAAGTTA 


ACCAATTCAT 


ACCAATAGCT 


AGCAGAATAA 


AAAGAAACCA 


180 


AATGCCCCAT 


AACTTGATAT 


CTGTCACATT 


TCTCAAGACG 


GTATTGAAAA 


ACAGAACTGA 


240 


AACAACTGTC 


CAAGCAAGGC 


TAAAAAGAGA 


ATAGAAGGGG 


ATGTAAAACC 


AGTAAAAATA 


300 


ATAAAAAATT 


GGAAAAAACT 


TACTATTTCT 


GTTGGCCTTT 


TCAATCCACT 


TATCAAAATA 


360 


AAAGTACGGT 


GCTAAAAGTA 


AGAATTTAAA 


CAAATGTTCC 


ATCACCGACA 


TCCCCCCTTC 


420 


TTTTGATAGC 


GTTTTCTATT 


ATTTTATTAT 


ATCAAAAAAA 


TCCGGAACTG 


TCATTCCAGA 


480 


TTCTACTTTT 


TTATTTGCGT 


TTTCTTGCGA 


TGAGATGAAT 


CGGTGTTCCC 


TCAAAAACAA 


S40 


AGGCCTTGCG 


GATTTGATTT 


TCCAAGAAAC 


GCAGGTAAGA 


AAAGTGCATG 


AGTTCTTCTT 


600 


CATTGACAAA 


GATGACAAAG 


GTTGGTGGTT 


TGGTTGCCAC 


TTGGCTCGCA 


TAGAAAATCT 


660 


TGAGACGTTT 


TCCTTTGTCT 


GTCGGTGTTG 


GGTTGATGGC 


AATGGCATCC 


ATGATGACAT 


720 


CGTTCAAGAC 


AGCTGATGGA 


ATACGTGTAT 


TTTGACTTTC 


GCTGATTTGC 


TTAATCATCT 


780 


CAGGAAGTTT 


GTGGAGACGT 


TGCTTGGTTA 


AAGCTGATAC 


AAAGATAATC 


GGTGCGTAAG 


840 


GCAGGTATTG 


GAACTGCTCA 


r^r* pats'? f^ f T lf T K 


Uii ^— v_ i h 


TTTCATAGTG 


TGGTTATCTT 


900 


TTTCAAGCGT 


ATCCCACTTG 


TTGACCACGA 


TAATCATCCC 


TTTACCAGCT 


TCATGGGCAA 


960 


ATCCTGCGAT 


ACGCTTGTCG 


TACTCACGAA 


TGCCTTCTTC 


CGCATTGATG 


ACCATCAAGA 


1020 


CCACATCTGA 


ACGGTCAATA 


GCACGCATGG 


CACGCATAAC 


AGAGTATTTC 


TCAGTATTTT 


1080 


CATAAACCTT 


ACCAGACTTA 


CGCATACCAG 


CCGTATCAAT 


CATGGTAAAC 


TCTTGACCAT 


1140 


CTGTATCTGT 


AAAGTGGGTA 


TCAATGGCAT 


CACGAGTTGT 


TCCAGCAACA 


GGACTAGCAA 


1200 


TAACACGGTC 


TTCTCCCAAG 


ATAGCATTGA 


TCAAGCTTGA 


TTTTCCAACG 


TTAGGACGAC 


1260 


CAATCAAGCT 


AAACTTAATG 


ACATCTGGAT 


TTTCTTCCTC 


ATATTCATTT 


GGAAGATTTT 


1320 


CTACGATCGC 


ATCTAGCACA 


TCCCCTGTAC 


CGATTCCATG 


GACAGATGAG 


ATAGGCAATG 


1380 


GTTCACCCAA 


ACCGAGAGCA 


TAGAAATCAT 


ATATATCATT 


TCTCATCTCA 


GGGTTGTCCA 


1440 


CCTTGTTGAC 


TGCGAGGATA 


ACTGGTTTGT 


GGGTCTTATA 


AAGCTTACGA 


GCTACGTATT 


1500 


CGTCTGCATC 


AGTAATTCCT 


TCCTTACCAG 


1 ACACGACAAA 


. AACGATAACA 


> TCTGCTTCTT 


1560 



WO 98/18931 



PCT/US97/19588 



635 

CCATGGCAAT TTCTGCCTGG TGCTTGATTT GTTCCATGAA AGCAGCATCG ACATCATCAA 1620 

TTCCTCCTGT ATCAATCATG CTAAAAGAAC GATTGAGCCA CTCACCCGTT GCATAAATAC 1680 

GGTCACGTGT CACTCCTTCG ACATCTTCTA CAATGGAGAT TCGCTCACCA GCGATCCGAT 1740 

TAAATAGGGT TGATTTCCCA ACATTGGGAC GTCCTACAAT GGCAATAGTT GGTAGGGCCA 1800 

TAATTTCTCA CTTTCTACAA TAATTTCTTC TGTTCAAGAT TTTTTCTAGT TGAGCTTGGT 1860 

TCAGCTTGAC CAAACTGTTC TGCTAGGCGC TGACTCCAGC TTGTGGTCCC ACGCGCCCCA 1920 

GCATAGTCAG CCTGAACACG GTCATAAGCT TGGATTGCCT CAGTTGACTG TTCTTGGTAT 1980 

TCTTCCTCAA AGACAACATT CTCTAGTGGC AGTCTCGGTT TCATATCATG ATGTTGATTT 2040 

GGCACACCCA GTCCCATCCC AAAGACAGAA TAGGTGTAGT CAGGTAGGTT AAAGAGCTCT 2100 

GCCACTTCTT CAGACTTGTA TCGAACCAAA CCGATAATCA CACCACCATA GCCCAAGCTT 2160 

TCAGCTGCCA ACAAGGCCTT TTGTCCAGCA AGAGCTGCAT CGACCGAACT AATCAAGAGA 2220 

4 CCTTCCACAC CTTGGGGTTG GAAGGTGTCG GTATGAAGTC GGGCTCCCTT TTCTGCTCGG 2280 

TTCAAATCTC CGACAAAGAG AAGGAAAACA GCAGACTGGC GAATGGCTTC TTGAGGTACC 2 340 

AATTCATACA AGGCATCTTT CTTCTCTTGA CTTCGTACCA CAATCACAGA GTAGGATTGG 2400 

AAATTCTTCC AAGATGATGC CATCTGGGCT GCTGTCAAAA TCTCATTTAA GTCTACTTGG 2460 

GGAATTTCTT GCTCTTTAAA CCTGCGCACT GAAGTATGAC CCTTCATCAA TTTAATGGTT 2520 

TCTGTCATCG ACGGTTTACT CCTTCTAAAC GAGTCTCCTC AGCCAAATAA CGGATGCGTT 2580 

CCATGACCCG TCTGGCTTCC CAGGTTTCGT CATTTCCATG TTTCACTTTC GCAAAATGCT 2640 

TCTCCAAATC TTCAAAGTTG AAGTTGGATG TGAAAAAGGT CGGTAAATTT TCCTGCATCC 2700 

GATATTGGAG AATGACCTGC AGGATTTCGT CACGCACCCA AACCGTTGAT TGCTCGGCGC 2760 

CAATATCATC TAAAATCAGG ACCTCAGACA GCTTAATCTC ATCCACCAAG GTCTTAACAT 2820 

TGCCATCACT GATAGCATTT TTGACATCAA TGACAAAGCT AGGATAGTGG AGGAGAGTTG 2880 

ATGAAACACC ACGTTTTTCT GATAAATCAT GAGCTAAGGC CGCCACCATG AAACTTTTAC 2940 

CCACACCAAA GTCTCCATAT AAGTAAAGAC CTTTTCGAAT ACCTGCATAT TGCTCCACGA 3000 

AGGCTAGTAG CTTTTCAAAA ACTCGTAAGC GCCCCAAATC ATCCAAGTCA ACTTGAGCCA 3060 

AACTAGCTTT CTTGAGACTG GCTGGTAGAT TGATTAACTT GAGACGGTTC TTAATAGCCG 3120 

CTTCTTTTTC AGCCGCGATT AGCTCAGGAG TTTCTTCATA TGAAACATCT GCATAACCAT 3190 

GATTCTTAAC CAAAATCGGC TTGTAGCCTT TGGCAATATA AT CCGTATCC CCACGGAGAA 3240 

ACTTGTCACG CTCGGTGATG TACTGATTAA ACTTGGAGAT ACTGCGATTT AATTCCTTTG 3300 
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GAGTTAACGA TTCTTGCTGG ATAAAGGCCG CAACATCAGG GTCCTTCATG ATTTTCTGGA 3 360 

CCAAATCTTG ATAATAAAAA CGGCTGGGTT GACGTTTGAG TACGTCTCCG ACACTTTCCA 3420 

TCTAATCTCC TCCTTTTTCT AATCGAGCTA ATAGTTCTTG CTTCTTACGT TCTAGTTCCA 3480 

GACGAGTTTC CTCGCTGGTT TCATTCTTAT ATTCAGGATT ACTCCATTTA GGAACATTGG 3540 

TTTTTTCTGG GGCAGTCTGA TTCTGTTTTT GTGTTTTTGC TTTCTGCCCT CGATCACGAA 3600 

TTCGTAAAAC GGCCTCTTCT GCCGAATGAA TCTTTTGATA GGCATAGTCA TTGGCTACCT 3660 

TCATGGCATA TTTCTCATTG ATATTTGCCG AATCCACCTT ATTAAAGGTC AATAAGAGAA 3720 

TAATATTGAT GACTTCGTCC AGTAAGCCCA AGCCAGCCAT CTGTTGCAAG AGTTCTCTTT 3780 

CTGTTTGGGT AATGGTTCCC TTGCGTGTTT GCTTGATTTC TGCTAAGAAC TGCAGGGCAG 3840 

TTTTACTTTT AGCTTCTTTG ATAATGGTCG CTTCCTTAAG ACTAAAGTCA GAGGAAACTG 3900 

GTTTTTGAGC AATTTTTTCA CGCATGCGTT TGGTTGAAAT AACCTGGGAA ACAGCTGTTG 3960 

ACTTGGCCAA TTGATAGGTT TCAAACCAAG TCCATTTCTT CTCCTCGGCA ATAGCAAAGA 4020 

GGTTTAAGAC ATCGGACTGC TCATCCGCAA AACGAAGTCC ATCTCGAGCC ATCAGCTGGC 4080 

GAAAATGTTC CAAGTCAAAA TCATTGGCCA CTTTCTTCTT GAGACCAAGG TCTTCTTGAC 4140 

TGCCTAGTTC TGCCAATTCT GGAAAGACTT GATTCACTGA GACAGGTATT TCTTCACCAT 4200 

CAGCACTTTC AACTTTCAAA TCCTCCACAG CTACATCGCC AATCTTTTTC TCTAAGAGTC 4260 

TCCGATAAAC AGGATGCCCC AAGAAGTCTT G ACT AG AT AG AGGAGCATGG AGGGCTACCT 4320 

GATAAACATC ACCCTTTTGA TAGAGGGTCA AGAGATTAAA AGCAGATAAG ATTTTCAATC 4 380 

ATTTTATCAG TCTATCCATC CCAAAGTTGA GATGGTTGAG AATGCTTGAA AAAAGATATT 4440 

CCTTTCTACC ATTATCCCAA AAACTGATTG TATAAAGATA AAGGCTCAGT GCCTCCTGAC 4500 

CGATAATCGG GAGGTAGCAC TGTACCAGAG ATGAGGTATC TTGCGACACC CGATTATTCT 4560 

TTAGATAAGA AAAACGGTCA ATTGGCTTCA TTTATCTTTC CTTTTTCTTT TTAGAGGACT 4620 

GGGTGATTTG TTGGAGCAAG CTCTCTAACT CACTGACATC CTTAAAACTA CGATAGACAC 4680 

TAGCAAAACG TACATAGGTA ATCTCGTCCA ATTCAGCCAA CTCCTCCATG ACGAGTGAAC 4740 

CAATGTCCTC ACTTTGAATT TCATTTTCAT TTCGACCACG GAGTTTCTGT TCGATACGAT 4800 

TGACTACCAT GTTGATTTCA TCACTTGACA CAGGACGTTT CTGGGCTGAG CGGATAATCC 4860 

CATTAAAGAT TTTATCTCTG GAGAATTGTT CCCGTGTGCC ATCTTTTTTA ACAACCACTA 4920 

AGGTTCTTTC TTCTACTCGT TCGTAGGTTG TAAAACGGTG TTGGCATTCG TCGCACTCAC 4980 

GTCTTCTACG AATGGTGTTC CCTTCTTCTG CTTGGCGACT ATCGATAACA CTTGACTTGG 5040 

TAGCCCCACA TTTTGGACAG GGTACC 5066 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9607 base pairs 
(Bl TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



CACTTGAAGT 


ATTTGAAACA 


GCTATGGAAA 


ACATCATGCC 


TGTACTTGAA 


GTACGTGCAC 


60 


GTCCTGTTGG 


TGGTTCTAAC 


TACCAAGTCC 


CAGTTGAAGT 


TCGTCCAGAA 


CGTCGTACAA 


120 


CACTTGGACT 


TCGTTGGTTG 


GTAACAATCG 


CTCGTCTTCG 


TGGTGAACAC 


ACAATGCAAG 


180 


ACCGTCTTGC 


AAAAGAAATC 


TTGGATGCTG 


CTAACAACAC 


TGGTGCAGCA 


GTTAAGAAAC 


240 


GTGAAGATAC 


TCACCGTATG 


GCTGAAGCTA 


ACCCTGCATT 


CGCACACTTC 


CGTTGGTAAG 


300 


ATAGGATGCG 


AAAGCGTTAA 


GAAAGTCCCA 


GAGAAAATAG 


GGAATCGAAG 


CAGGTTGCGG 


360 


TTGCAACCAA 


TGAGATTCAT 


CTTTTTCTCC 


AGACTTTTAG 


CTTGACCTCA 


ACTAAATCAT 


420 


GATGCTAGCA 


ACGGTAAGGA 


TGCAAGGTAA 


AAATAGGAAA 


CTGACGCAGT 


ATTCGACGAA 


480 


TACAAGGAGT 


TTTATCTTTT 


TCACGCAGCA 


TCCCGTTCCA 


GCTCACATCG 


GCTAACTAAC 


540 


TTTAGCCCGG 


GTTCAAATTA 


GCTAAATCGA 


TTAGTATTAG 


CTATAACTCA 


GCTTACCATC 


600 


TCGTAAGTTG 


AAACCAACAA 


TAGCATGAAA 


ACATTGACAA 


CCGGTAGGTC 


CTGCCTATCC 


660 


GTTTTTATTA 


AAATCGTGTT 


ATAATAGAAT 


AGAAATCAAA 


AATAAATAGG 


AGAAACAAAC 


720 


CTCATGGCAC 


GCGAATTTTC 


ACTTGAAAAA 


ACTCGTAATA 


TCGGTATCAT 


GGCTCACGTC 


780 


GATGCCGGTA 


AAACAACAAC 


TACTGAGCGT 


ATTCTTTACT 


ACACTGGTAA 


AATCCACAAA 


640 


ATCGGTGAAA 


CTCACGAAGG 


TGCGTCACAA 


ATGGACTGGA 


TGGAGCAAGA 


GCAAGAACGT 


900 


GGTATCACGA 


TCACATCTGC 


TGCGACGACA 


GCTCAATGGA 


ACAACCACCG 


CGTAAACATC 


960 


ATCGACACAC 


CAGGACACGT 


GGACTTCACA 


ATCGAAGTAC 


AACGTTCTCT 


TCGTGTATTG 


1020 


GATGGTGCGG 


TTACCGTTCT 


TGACTCACAA 


TCAGGTGTTG 


AGCCTCAAAC 


TGAAACAGTT 


1080 


TGGCGTCAAG 


CAACTGAGTA 


CGGAGTTCCA 


CGTATCGTAT 


TTGCCAACAA 


AATGCACAAA 


1140 


ATCGGTGCTG 


ACTTCCTTTA 


CTCTCTAAGC 


ACACTTCACG 


ATCGTCTTCA 


AGCAAATGCA 


1200 


CACCCAATCC 


AATTGCCAAT 


CGGTTCTGAA GATGACTTCC 


GTGGTATCAT 


TGACTTGATC 


1260 


AAGATGAAAG 


CTGAAATCTA 


TACTAACGAC 


CTTGGTACGG 


ATATCCTTGA 


AGAAGACATC 


1320 


CCAGCTGAAT 


ACCTTGACCA 


AGCTCAAGAA 


TACCGTGAAA 


AATTGATTGA 


AGCAGTTGCT 


1380 
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GAAACTGACG AAGAATTGAT GATGAAATAC CTCGAAGGTG AAGAAATCAC TAACGAAGAA 1440 

TTGAAAGCTG GTATCCGTAA AGCGACTATC AACGTTGAAT TCTTCCCACT ATTGTGTGGT 1500 

TCAGCCTTCA AAAACAAAGG TGTTCAATTG ATGCTTGATG CGGTTATCGA CTACCTTCCA 1560 

AGCCCACTTG ACATCCCAGC AATCAAAGGT ATTAACCCAG ATACAGACGC TGAAGAAATT 1620 

CGTCCAGCAT CTGACGAAGA GCCATTTGCA GCTCTTGCCT TCAAGATCAT GACTGACCCA 1680 

TTCGTAGGTC GTTTGACATT CTTCCGTGTT TACTCAGGTG TTCTTCAATC AGGTTCATAC 1740 

GTATTGAATA CTTCTAAAGG TAAACGTGAA CGTATCGGAC GTATCCTTCA AATGCACGCT 1800 

AACAGCCGTC AAGAAATCGA CACTGTTTAC TCAGGTGATA TCGCTGCTGC CGTTGGTTTG 1860 

AAAGATACTA CAACTGGTGA CTCATTGACA GATGAAAAAG CTAAAATCAT CCTTGAGTCA 1920 

ATCAACGTTC CAGAACCAGT TATCCAATTG ATGGTTGAGC CAAAATCTAA AGCTGACCAA 1980 

GACAAGATGG GTATCGCCCT TCAAAAATTG GCTGAAGAAG ATCCAACATT CCGCGTTGAA 2040 

ACAAACGTTG AAACTGGTGA AACAGTTATC TCAGGTATGG GTGAACTTCA CCTTGACGTC 2100 

CTTGTTGATC GTATGCGTCG TCAGTTCAAA GTTGAAGCCA ACGTAGGTGC TCCTCAAGTA 2160 

TCTTACCGTG AAACATTCCG CGCTTCTACT CAAGCACGTG GATTCTTCAA ACGTCAGTCT 2220 

GGTGGTAAAG GTCAATTCGG TGATGTATGG ATTGAATTTA CTCCAAACGA AGAAGGTAAA 2280 

GGATTCCAAT TCGAAAACGC AATCGTCGGT GGTGTGGTTC CTCGTGAATT TATCCCAGCC 2340 

GTTGAAAAAG GTTTGGTAGA ATCTATGGCT AACGGTGTTC TTGCAGGTTA CCCAATGGTT 2400 

GACGTTAAAG CTAAGCTTTA TGATGGTTCA TATCACGATG TCGACTCATC TGAAACTCCC 2460 

TTCAAGATTG CGGCTTCACT TTCCCTTAAA GAAGCTGCTA AATCAGCACA ACCAGCTATC 2520 

CTTGAACCAA TGATGCTTGT AACAATCACT GTTCCAGAAG AAAACCTTGG TGATGTTATG 2580 

GGTCACGTAA CTGCTCGTCG TGGACGTGTA GATGGTATGG AAGCACACGG TAACAGCCAA 2640 

ATCGTTCGTG CTTACGTTCC ACTTGCTGAA ATGTTCGCTT ACGCAACACT TCTTCGTTCT 2700 

CCATCTCAAG GACGTGGTAC ATTCATGATG GTATTTGACC ACTACGAAGA TGTACCTAAG 2760 

TCAGTACAAG AAGAAATTAT TAAGAAAAAT AAAGGTGAAG ACTAATCCGT CCTCACTCTA 2820 

GAAGGAAGTC ACTTAGTGGC TTCCTTTTCT CTTTAGAAAA TACCTCTAAA TATGGTAAAA 2880 

TAGTAGAAGA ATAATGTGAG GAAAATGAAT GTCAAATAGT TTTGAAATTT TGATGAATCA 2940 

ATTGGGGATG CCTGCTGAAA TGAGACAGGC TCCTGCTTTA GCACAGGCCA ATATTGAGCG 3000 

AGTTGTGGTT CATAAAATTA GTAAGGTATG GGAGTTTCAT TTCGTATTTT CTAATATTTT 3060 

ACCGATTGAA ATCTTTTTAG AATTAAAGAA AGGTTTGAGC GAAGAATTTT CTAAGACAGG 3120 

CAATAAAGCT GTTTTTGAAA TTAAGGCTCG GTCTCAAGAA TTTTCAAATC AGCTCTTGCA 3180 
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GTCCTACTAT AGGGAGGCTT TCTCTGAAGG TCCATGTGCT AGTCAAGGTT TTAAGTCCCT 3240 

TTATCAAAAT TTGCAAGTTC GTGCTGAGGG TAATCAGCTA TTTATTGAAG GATCTGAAGC 3300 

GATTGATAAG GAACATTTTA AGAAGAATCA TCTTCCTAAT TTAGCCAAAC AACTTGAAAA 3360 

GTTTGGTTTT CCAACTTTTA ACTGTCAAGT CGAGAAGAAT GATGTCCTGA CCCAAGAGCA 3420 

GGAAGAGGCC TTTCATGCTG AAAATGAGCA GATTGTTCAA GCTGCCAATG AGGAAGCGCT 3480 

CCGTGCTATG GAACAACTGG AGCAGATGGC ACCTCCTCCA GCGGAAGAGA AACCAGCCTT 3540 

TGATTTTCAA GCGAAAAAAG CTGCAGCTAA ACCCAAGCTG GATAAGGCGG AGATTACTCC 3600 

TATGATCGAA GTGACGACAG AGGAAAATCG TCTGGTATTT GAAGGGGTTG TTTTTGATGT 3660 

GGAGCAAAAA GTGACTAGAA CAGGTCGTGT TTTAATCAAC TTTAAAATGA CGCACTATAC 3720 

TTCAAGTTTT TCTATGCAAA AGTGGGTTAA AAACGAGGAA GAGGCCCAGA AGTTTGACCT 3780 

CATCAAGAAG AATTCTTGGC TCCGAGTTCG AGGGAATGTG GAGATGAATA ACTTCACACG 3840 

CGATTTGACT ATGAACGTAC AGGATCTGCA GGAAGTTGTT CACTATGAGC GGAAGGATTT 3900 

GATGCCAGAA GGTGAGCGTC GGGTTGAGTT TCATGCTCAT ACTAACATGT CGACTATGGA 3960 

TGCTTTGCCA GAGGTCGAAG AGATTGTTGC AACAGCTGCT AAGTGGGGAC ACAAGGCGGT 4020 

TGCTATCACG GACCATGGGA ATGTCCAGTC CTTTCCACAT GGCTATAAGG CGGCTAAGAA 4080 

AGCGGGAATC CAGCTGATCT ATGGGATGGA AGCCAATATC GTGGAGGACC GTGTCCCTAT 4140 

CGTCTATAAC GAAGTGGAGA TGGACTTGTC AGAACCAACC TACCTGGTCT TTGACGTGGA 4200 

AACGACGGGA CTTTCAGCTA TCTATAATGA CTTGATTCAG GTTGCGGCTT CTAAGATGTA 4260 

CAAGGGGAAT GTTATTGCTG AATTTGATGA ATTTATCAAT CCTGGCCATC CCTTGTCAGC 4320 

CTTTACTACA GAGTTAACTG GAATTACAGA TGATCATGTC AAAAATGCCA AACCACTAGA 4380 

ACAAGTTTTG CAAGAATTCC AAGAATTTTG CAAGGATACG CTCCTAGTTG CCCACAATGC 4440 

TACCTTTGAC GTTGGCTTTA TGAATGCTAA TTATGAGCGG CATGATCTTC CAAAGATTAC 4 500 

TCAGCCAGTT ATTGATACGC TGGAGTTTGC TAGAAACCTC TATCCTGAGT ATAAACGCCA 4560 

TGGTTTGGGG CCTTTGACCA AGCGTTTTGG TGTGGCCTTG GAACATCACC ACATGGCCAA 4 620 

CTACGATGCG GAAGCGACTG GTCGTCTGCT TTTCATCTTT ATCAAAGAGG TAGCAGAAAA 4680 

ACATGGTGTG ACCGATTTAG CTAGACTCAA CATTGATCTA ATCAGTCCAG ATTCTTACAA 474 0 

AAAAGCTCGG ATCAAGCATG CGACCATCTA TGTCAAGAAT CAGGTAGGTC TAAAAAATAT 4800 

CTTTAAGCTG GTTTCCTTGT CTAATACCAA GTATTTTGAA GGAGTGCCAC GGATTCCGAG 4860 

AACGGTTCTA GATGCCCATC GAGAGGGCTT GATTTTAGGT TCAGCCTGTT CAGAGGGTGA 4 920 
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AGTTTTTGAC GTGGTCCTTT CTCAAGGTGT GGATGCGGCG GTTGAGGTGG CCAAGTATTA 4980 

TCATTTTATC GAGGTCATGC CACCGGCTAT CTATGCACCC TTGATTGCCA AAGAGCAGGT 5040 

CAAGGATATG GAGGAACTCC AGACCATTAT CAAGAGTTTG ATAGAGGTTG GAGACCGCCT 5100 

TGGCAAGCCT GTTCTGGCTA CCGGAAATGT TCACTATATC GAACCCGAAG AAGAGATTTA 5160 

TCGTGAAATT ATCGTCCGTA GTTTGGGACA GGGTGCGATG ATTAATCGAA CTATCGGTCA 5220 

TGGTGAACAT GCCCAACCAG CACCACTTCC AAAGGCTCAT TTTCGAACGA CTAATGAGAT 5280 

GTTGGATGAA TTTGCCTTTT TGGGAGAGGA ACTGGCTCGT AAACTCGTTA TTGAAAACAC 5340 

CAATGCCTTG GCAGAAATAT TTGAATCCGT TGAAGTCGTT AAGGGTGACT TGTATACGCC 5400 

TTTCATCGAC AAGGCTGAAG AAACAGTTGC TGAGTTGACC TATAAGAAAG CTTTTGAGAT 54 60 

TTATGGAAAT CCGCTGCCAG ATATTGTTGA TTTGCGGATT GAAAAAGAAT TAACATCCAT 5520 

ACTGGGGAAT GGATTTGCTG TGATTTATCT GGCATCGCAG ATCCTGGTGC AACGTTCTAA 5580 

TGAACGGGGT TATTTGGTTG GTTCTCGTGG GTCTGTCGGA TCTAGTTTCG TTGCGACCAT 5640 

GATTGGGATT ACGGAGGTCA ATCCTCTCTC TCCTCACTAT GTCTGTGGTC AGTGTCAGTA 5700 

CAGTGAGTTT ATCACAGATG GTTCCTACGG TTCAGGATTT GATATGCCCC ATAAGGACTG 5760 

TCCAAACTGT GGTCACAAAC TCAGTAAAAA CGGACAGGAT ATTCCGTTTG AGACCTTCCT 5820 

TGGTTTTGAT GGGGATAAGG TTCCTGATAT TGACTTGAAC TTCTCGGGAG AAGATCAGCC 5880 

TAGCGCCCAC TTGGATGTGC GTGATATCTT TGGTGAAGAA TATGCCTTCC GTGCGGGAAC 5940 

GGTTGGTACG GTAGCTGCCA AGACTGCCTA TGGATTTGTC AAAGGTTACG AGCGAGATTA 6000 

TGGCAAGTTT TATCGTGATG CAGAAGTAGA ACGCCTCGCT CAAGGAGCGG CGGGTGTCAA 6060 

GCGGACAACA GGCCAACACC CGGGGGGAAT CGTTGTTATT CCGAACTACA TGGATGTCTA 6120 

CGATTTTACG CCTGTCCAGT ATCCAGCAGA TGATGTCACG GCTGAATGGC AGACCACTCA 6180 

CTTTAACTTC CACGATATCG ATGAGAACGT CCTCAAACTC GATCTACTGG GACATGATGA 6240 

TCCGACTATG ATTCGAAAAC TTCAGGATTT GTCTGGTATT GACCCTAATA AAATTCCTAT 6300 

GGATGACGAA GGCGTGATGG CACTCTTTTC TGGGACTGAT GTGCTAGGGG TAACACCTGA 6360 

ACAAATTGGA ACGCCTACGG GTATGTTGGG GATTCCAGAG TTTGGAACAA AXTTCGTACG 6420 

TGGAATGGTA GACGAAACCC ATCCGACAAC CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 6480 

GTCCCACGGT ACTGATGTTT GGTTGGGGAA TGCTCAGGAT CTGATTAAGC AAGGAATAGC 6540 

GGACCTATCG ACTGTTATCG GTTGTCGGGA CGACATCATG GTTTACCTCA TGCATGCGGG 6600 

TCTGGAACCT AAGATGGCCT TTACCATTAT GGAACGGGTA CGTAAGGGTT TGTGGCTAAA 6660 

GATTTCAGAA GAGGAGAGAA ATGGCTATAT CGAAGCAATG AAGGCTAATA AGGTGCCAGA 6720 
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GTGGTATATC GAATCCTGTG CGAAAATTAA GTACATGTTC CCTAAGGCCC ATGCGGCAGC 6780 

CTACGTTATG ATGGCCTTGC GTGTAGCTTA CTTCAAGGTT CACCATCCTA TTTATTACTA 6840 

CTGTGCTTAC TTCTCCATTC GTGCTAAGGC TTTTGATATC AAGACCATGG GTGCGGGCTT 6900 

GGAGGTCATC AAGCGCAGAA TGGAAGAAAT CTCTGAAAAA CGGAAGAACA ATGAAGCCTC 6960 

TAATGTGGAA ATCGATCTCT ATACAACTCT TGAGATTGTC AATGAGATGT GGGAACGAGG 7020 

TTTCAAGTTT GGTAAATTAG ATCTCTACTG TAGTCAGGCG ACAGAGTTCC TCATCGACGG 7080 

GGATACCCTT ATCCCACCAT TTGTAGCAAT GGATGGTCTG GGAGAGAACG TTGCCAAGCA 7140 

ACTGGTGCGG GCGCGTGAAG AGGGAGAATT CCTCTCTAAA ACAGAACTAC GCAAGCGTGG 7200 

TGGACTCTCA TCAACCTTGG TTGAAAAGAT GGATGAGATG GGTATTCTTG GAAATATGCC 7260 

AGAGGATAAC CAGTTGAGTT TGTTTGATGA GTTGTTTTAA AAAATTGCTT AATAATCTAT 7320 

TAAAAGAGGC TAACGTATAT CCAATAGATT TACATTAGCT TTCTTTTTTG TTAAAATAGT 7380 

CTATGGAAAG AGGGTGAGAG TATGTCAAAG ATGAGTATAA GCATCCGTCT GCATAGTGAG 74 40 

GTTAAGGAGC AGGCCCAACA C C TGTTTAGT AATCTGGGAA TGGATATGAC AACAGCTATT 7500 

AATATTTTCC TTCGTCAGGC AATTCAATAT CAGGGATTAC CTTTTGATGT TAGACTAGAC 7560 

GAAAATCGGA AGTTGCTCCA AGCGTTAACG GATTTAGACC AAAATCGTAA TATGAGCCAG 7620 

TCTTTTGAAT CAGTCTCAGA TTTGATGGAG GACTTACGTG CTTAAGATTC GTTATCATAA 7 680 

ACAGTTTAAA AAAGATTTTA AGTTGGCTAT GAAGCGTGGT TTGAAGGCAG AATTATTAGA 7740 

AGAAGTTTTG AATTTTCTGG TTCAAGAAAA AGAACATCCT GCCAGAAATC GTGATCATTC 7800 

ATTGACGGCA TCCAAGCATT TTCAAGGAGT TCGTGAATCC CATACCCACC CAGATTGGCT 7860 

TTTGGTTTAT AAAGTAGACA AGTCGGAATT GATTTTAAAT TTGCTGAGGA CAGGCAGTCA 7920 

CAGTGATTTA TTTTAATCTA TTTTAAGGGG GTTCTCATGA AACTAAGAAT ATTTGCGGAA 7980 

GATAAGCCGG CTAAGAAGGT ATTTGAATAT CAATTAGAAC TTGCTGATCG TACAATTCTT 8040 

CTATCGACAG CACTCTTGTC AGGTGCTATT GCTTTAGCAG GAATCTTTTC TGCTTTGAAA 8100 

GAAAAATAAA AATAGAAAAG AGAAAACAGA ATGGTTTTAC CAAATTTTAA AGAAAATCTA 8160 

GAAAAATATG CGAAATTGTT GGTTGCGAAC GGAATTAACG TGCAACCTGG TCACACTTTG 8220 

GCTCTCTCTA TTGATGTGGA GCAACGTGAA TTGGCACATC TAATCGTGAA AGAAGCTTAT 8280 

GCCTTGGGTG CGCATGAGGT CATCGTTCAG TGGACAGATG ATGTGATTAA CCGTGAGAAA 8340 

TTCCTCCATG CCCCGATGGA GCGTTTGGAC AATGTGCCAG AATACAAGAT TGCTGAGATG 8400 

AACTATCTCT TGGAGAATAA GGCTAGCCGT CTTGGAGTTC GTTCATCTGA TCCAGGTGCC 8460 
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TTGAACGGAG TGCACGCTGA CAAGCTTTCA GCTTCTGCTA AAGCTATGGG ACTTGCCATG 8520 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8580 

GGACTTGAGT GGGCTAAGAA AGTCTTCCCA AATGCTGCGA GCGACGAAGA AGCAGTTGAT 8640 

TTCCTTTGGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGCAGATCC TGTTAAGGCT 8700 

TGGGAGGAAC ATGCAGCCAT TCTCAAGAGC AAGGCCGATA TGCTTAATAA GGAGCAATTT 8760 

TCAGCCCTTC ACTACACAGC GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 8820 

GTTTGGGAAT CAGCTGGTGC TGTCAATGCA CAGGGCGAAG AATTCTTGCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC GCCTGACTTC CGTCGTGCAG ATGGTTATGT CACTTCTACA 8940 

AAACCGCTTA GCTACAACGG AAATATCATT GAAGGCATTA AGGTGACCTT TAAGGATGGA 9000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT GATCAGGTTA TGAAAGACCT TGTCTTTGAA 9060 

AATGCGGGTG CGCGTGCCTT GGGTGAATGT GCCTTGGTAC CAGATCCAAG TCCAATTTCT 9120 

CAGTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCCTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATCCCAC TAGCGTTGTT GATGGAGCGG AGATGAGCGA AGAGGAGCTT 9240 

GAAGCTGCAG GGCTTAACCG TTCAGATGTT CACGTAGACT TTATGATTGG TTCTAACCAA 9300 

ATGGATATCG ATGGTATTCG TGAGGATCGA ACGCGGGTAC CTCTTTTCCC TAATGGGAAT 9360 

TGGGCAAATT AAGGAGATAA TATGTTAGGA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 9420 

TTAGCAGGTG CTATGACCAA TCGTGGAGAG CGAATGGGAT GTTTTGGAAA AATGTTTCTC 9480 

GGTTGGATCG GAGCCTTTCT AGGTCACTTG CTCTTTGGAA CTTGGGGCCC AGTTTTATCA 9540 

GGAACAGCTA TTATCCCAGC GATTTTAGGA GCCATGATTG TTTTAGCTAT TTTTTGGAGA 9600 

CGAGGAA 9607 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i» SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14231 base pairs 
(BJ TYPE: nucleic acid 
(C) STRANDEDNESS : double 
ID> TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATAGTA AGAGCGAGCT CTATGATAAG 60 

GCTCATTAGT TTCACCTCCT CTCACGAACC CATAGGAACG TAATCGGTAA CCGATGACAA 120 

AAATAGTATA CCACAATACA TTTAGATCAT CAAGGTCACT TAATTCTTGA AATATCAGAT 180 

CTAAGAGAAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAACTTGAT 240 
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ACTATCCGTT TTATTGTGGG AAGGTTTACT CCATTTTCTC CTGAAATTGA GTTTTTGTCC 300 

AGCCTCTGTT TTTAGGGTTG CTAAGAAAAT AATGTCATGT CCTGAATATT TGTAAATCAG 360 

TCAGCAGACA GAACGATACT CTTCGAAAAT CTCTTCACAT CATGTCAGCT TCGTCTTTCC 420 

GTATATATGT GACTGACTTC ATCAGTTCTA TCTACAACCT CAAAACAGTG TTTCGAGCTG 480 

ACTTGATCAA TTTTCAAATC TGTACTTTGA GCAAGCTGAG ACTAGCTTCC TATTTGATTT 540 

TCATTGAATA TCAGAAACCC ATTCTCCATC AAATAATTCG ACTGCGTCTA ATAATTTTTG 600 

ATCTGGCACG GTGTCTGAAA TAAAGGTTGT GTATTTGGAG AGGGGATTAA TTTTAAAAAA 660 

TCCAGTCTTG TAAAATTTAG AACTATCAAT CAGTAAGATG GTTTCATGGG CTTTGTCAAT 720 

AATATTCTTT TTTGAAATAG CTTGGCTGAG AGAAGCTTCA TAAACATATT GGTCATCAAT 780 

ACCTCTTGCT GAACAAAATG CTAAATCGAT ATTAAAATGA TCTAATAAAG AATTTTCCTT 840 

ATCATAGTTG ACCACGGAAC AGGATTGATG TTTGACCTCG CCAGATGTGA TAAACATTTT 900 

GGAGCTATCT TTAACAGTTT CAGATAGGGT TTGTGCAGTA TGTAAACCAT TTGTAAAAAT 960 

AATCAAATTA TCAAGTTCAG AAAGATAGGG ACAGAGTTCG TAGACAGTAG TACTAGAATC 1020 

TAGATAGATA CACATACCAG ACCGAATAAA GTCTTTAGCG AGACTAGCGA TTAGTCTTTT 1080 

TTGCCTAGTA CTTTCTCCTT CACGTATTTG ATGAGAAAGT TCAATTGTGT TCATAGAGGA 1140 

CAGGGTCACG TATCCGTGCT TTCTTTTGAT AAGACCTTGA TTTTCTAAGA AAATTAAATC 1200 

ACGACGTAAG GTACTTGTGC TGGACAAAGT GATTTCTGCC AGCTCTTTTA CGCCAATTCT 1260 

TTTTTTCTTT TTGATAATTT CAATCAATTC AAGTACACGT TCATCTTTTA TCATAAGCTC 1320 

CTCCTAATTT ATCATTTCAA CTATATTATA GCACAAATTG GAGGAATTTG AATTATTTTT 1380 

ATGAATATTG GGTTAACATT TGAACATTAT TCAAGTAACC GTTCACATAT TGAAAAAATA 1440 

AAACGTGGGG ATTATAATAA AGTTAATCmA GGACGAAGAG AGAAGAAAAA TGGAAGCGGT 1500 

TTTAGCAATA GATTTAGGTG CGACTTCTGG AAGACCAATC GTTGGTTACC TTTCTGAAAA 1560 

TAAACTAGTA ATGGAAGAAA TAAATCGCTT TTCTAATCTA CCTATTAGAG TAAAAGGGCA 1620 

TTTATCTTGG GATATTGACT TTCTACTAGC TAAAATTCTT GAAAGTATCC GCTTGGCTAA 1680 

TACTAGTTAC AAGATTTTAT CTATCGGTAT TGACACATGG GGAGTTGATT TTGGACTGAT 1740 

TGATAATGAA GGTAAGCTGT TATTACAACC TGTTCATTAT CGTGATGAAA GAACAAAGGG 1800 

AGTGTTAAAG GAAATATCTG AAATGACTGA ATTAGAAAAA CTGTATTCAG AGACAGGAAA 1860 

TCAGATTATG GAGATAAATA CCTTGTTTCA ACTCTTTAAG GCACGTCAAG AATCTCCTGA 1920 

CTCTTTCTAT AAGACCAATA AGATTCTTTT AATGCCAGAT TTGTTTAATT ATCTCTTGAC 1980 
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AGGTAAGTTT GCTACAGAAA AAAGCATTGC TTCAACAACT CAATTATTTC ATCCTAGGAG 2040 

TCAAAATTGG AATCAGAATA TCTTAAAACT ATTTGAATTG GATTCATCTT TACTTCCTGA 2100 

AATTGTTTCA GAGGGAAATG TTCTTGGAAG GATAAAAGAG GAGTATGGTT TAGGCGATAT 2160 

TCCTGTTGTG AATGTTTGTA GTCATGATAC AGCAAGCGCG ATTGTCTCAG TACCTAAGAC 2220 

AGAAGGTAGT TTATTTATTT CATCAGGTAC TTGGTCTTTG GTTGGAGTGG AACTTACTTC 2280 

ACCGATTCTT ACTACCGAAT CCTTCAGTTA TGGATTTACA AATGAAGTCG GTAAAGATGG 2340 

AGTGATTACA TTTCTGAAGA ATTGTACAGG GTTGTGGATC ATAGAGGAAC TAAGACGTTC 2400 

ATTTGAACGA AGAGGGAAAG CCTATTCTTT TGATGATATT AGGACAATGG TGGAGAAAGA 2460 

AAAAGAAAAT CTTCCTCTGA TTGATACTGA ATCAACTGAA TTTGCAACAG AATCTGATAT 2520 

GCACAAGACT TTGACAGAAT ATCTAGCTTA TCATCATGAA ACTAGAGAGT GGACAGATGG 2580 

ACAACTATTT AAGATTGTTT ATGAAAGCCT AGCTGAAACG TATAGGAAAG CCATAGAGTT 2640 

ACTAGAAGAA CTAACTCATA AGGTTTATAA GAGGATATAT GTGATTGCAG GAGGTGCTAG 2700 

AGCCAGTTAC TTTAACCAAA TGATTGCTGA TAGAACTGGT AAAGAGGTTC TTACAGGTTT 2760 

GACTGAGGCT ACAGCTGTGG GGAATATTGT TGTGCAGCTC ATAGCTATGG GACAATTAAA 2820 

AGGGATGGAA GAGGCTCACC ATGTTATTGA GGAGTTTCTA CAATTAGAGA GTTATTACTC 2880 

CCAAAAGAAT TAAAAAGATT GAGAGTTTGT AAATTTGCCT CCCTCCCCCT TCTTAGCTTT 2940 

TGTGCAGGAA GGGGGGATAA TTGGTGAATT GAAAAATATT TAGTGTTTTG ATATGAGGAG 3000 

GACAAGGATG TCAGATGTAA AACAAGAATT AATTAAATAT GGTAAGAAGC TAGTAGAAAC 3060 

AGATTTGACG AAAGGAACAG GTGGGAATCT CAGCGTTTTC GATCGTGAAA AACAATTGAT 3120 

GGCAATTACC CCGTCGGGTA TTGATTTCTT TGAAATCAAA GAATCCGATA TTCTAGTGAT 3180 

GGATATTAAT GGAAATGTTG TAGAGGGAGA ACGCTTGCCA TCTAGCGAAT GGTATATGCA 3240 

TTTGATTCAA TATCAAACTC GTGATGATAT CGATGCAATT ATCCATGCTC ATACAACTTA 3300 

TGCAACAGTA TTAGCTTGTC TCAGAGAACC ACTTCCAGCG AGTCATTATA TGATTGCAGT 3360 

GGCAGGGAAA GATGTTCGGG TAGCTGAGTA TGCAACATAT GGCACGAAAG AATTGGCTGT 3420 

GAATGCAGCT AAAGCAATGG AAGGTCGTAG AGCAGTTTTA CTAGCGAATC ATGGAATTTT 3480 

AGCAGGTGCA CAAAATTTAT TGAATGCATT TAATATTGTT GAAGAAGTTG AATATTGTGC 3540 

AAAAATTTAT TGTTTAGCTA AGAATTTTGG AGAGCCAGTA GTTCTTCCTG ATGAGGAGAT 3600 

GGAATTGATG GCAGAAAAAT TTAAAACATA CGGTCAGAGA AAATAGGGAG GATATTAATG 3660 

TTAAAACATA TACCGAAAAA TATTTCTCCA GATTTATTGA AGACTTTAAT GGAAATGGGA 3720 

CATGGAGATG AAATAGTATT AGCTGACGCG AATTATCCTT CTGCCTCATG TGCAAATAAG 3780 
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CTAATTCGTT 


GTGATGGTGT 


AAATATTCCA 


GAATTATTAG 


ATTCCATTCT 


GTATTTAATG 


3840 


CCATTAGATA 


GTTACGTCGA 


TAGTTCAATT 


CAGTTTATGA 


ACGTTGTTTC 


GGGTGATGAT 


3900 


ATTCCTAAGA 


TATGGGGTAC 


CTATAGACAG 


ATGATTGAAG 


GTCATGGTAC 


AGATCTTAAA 


3960 


ACGATTACTT 


ATCTTAGAAG 


AGAAGACTTT 


TATGAACGTA 


GTAAGAAAGC 


TTATGCTATT 


4020 


GTTGCTACAG 


GAGAAACTTC 


ACTTTATGCT 


AATATTATCC 


TTAAGAAAGG 


AGTAGTTGTT 


4030 


GAAAGAGAAA 


ATGTTCAATA 


GAGGAATTTT 


ACTTCCCAGT 


CATGGTAATT 


TTGCTAGCGG 


4140 


AGCTCTCATG 


ACCGCAGAAA 


TGTTTGTTGG 


TGAGACAACA 


AATGATAGAG 


TTAGGACATT 


4200 


AGGTTTGATG 


CCTGGAGAGA 


ATATTGTAGA 


GTTTGAGCAT 


TATTTTAAAA 


ATCAAGTGGA 


4260 


TGAACTGTTA 


GACTCAAATC 


AAGAGGTTAT 


CGTTTTGACT 


GACTTGATTG 


GAGGAAGTCC 


4320 


TAATAATGTG 


GCTTTGTCAC 


GGTTTTTAAA 


TTTGGATTCA 


GTTGATATTG 


TAACAGGGTT 


4380 


TAATATCCCT 


CTCCTAGTGG 


AATTAATATC 


AAGTTATGAT 


TCAAAAATCA 


ATTTAGAAGA 


4440 


AATTGTTCAC 


AATGCTCAAA 


ATAGTTTGTT 


TAATGTTAAA 


CAACAACTTA 


ACGTAGAGGA 


4500 


GGAAGAAGAT 


TTATGTCTAT 


AGAGTTTGTT 


CGTATTGATG 


ACCCTCTGGT 


ACATGGTCAA 


4560 


GTTGTCACTA 


CGTGGCTAAA 


AAAGTATGAT 


ATTGAGCAAG 


TTATCATTCT 


TAATGATCGC 


4620 


ATCTCAGAAG 


ATAAAACACG 


ACAATCTATT 


TTAAAGATTT 


CTGCACCGGT 


AGGTTTAAAA 


4680 


ATTGTTTTCT 


TTAGTGTAAA 


ACGGTTTGTG 


GAAGTTTTAA 


ACTCTGTGCC 


AATAAAAAAG 


4740 


AGAACAATGC 


TGATATATAC 


AAATCCAAAA 


GATGTGTATG 


ATTCTATTGA 


AGGAAATTTA 


4800 


AAATTGGAGT 


ACCTCAATGT 


AGGACAGATG 


AGTAAAACGG 


AGGAAAATCA 


AAAGGTAACG 


4860 


GGACGTGTAG 


CTCTAGGTGA 


AGAAGACAAA 


TATTATTTTA 


AGAAAATAGT 


TGATAAGGGA 


4920 


ACGAGAGTTG 


AAATTCAAAT 


GGTTCCTAAT 


GATAAAGTTA 


CAATGTTGCA 


AAAATTTTTA 


4980 


TAAAAATAAT 


TTAAGGAGCT 


ACAGTATATG 


CTATTCACAC 


AAGCATTACT 


GGTGACATTA 


5040 


GTTCGGATTA 


TTGCCACTAT 


TGACTATAAT 


GGACCGTTAT 


TTATGATTCA 


CCGTCCGTTA 


5100 


GTTACAAGTG 


CAATGGTTGG 


CTTAGTATTA 


GGAGATTTCA 


CCCAAGCTGT 


TCTTATTGGT 


5160 


TCAGCTCTTG 


AATTAACTTG 


GCTCGGTCTA 


ACAGGTATTG 


GAGGTTATAC 


TCCACCAGAT 


5220 


ACTATTTCAG 


GTGCGATTAT 


TGGTACTGCA 


TTTGGTATTT 


TATCTGGTCA 


AGGAGAAACT 


5280 


GCTGGTATCG 


CTATAGCAGT 


TCCAATTGCA 


GTTGCTACCC 


AACAGTTGGA 


TGTTCTTGCA 


5340 


AAAACTTTAG 


ATGTTTATTT 


TGTGAAAAAA 


GCTGATAATG 


ATGCTAAAAA 


CGGAGATTAT 


5400 


TCAAAGATCG 


GTTTTTATCA 


TTATTCAAGT 


TTGGTTTTAA 


TCACGTTATT 


TAAAATTGTA 


5460 


CCAATTTTCC 


TAGCTATTAT 


GCTTGGAGGG 


GAATATGTGG 


CAGACTTGTT 


TGCTAAGGTT 


5520 
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CCACCAATCG 


TTATGCAGGG 


ACTTAACTCT 


GCAGGTGCTT 


TACTACCTTC 


AATTGGTTTT 


5580 


GGTATGCTTT 


TAAATATGAT 


GCTCAAGAAA 


AATATGTGGG 


TATTCTTGTT 


GATTGGATTC 


5640 


ATTTGTTCTG 


TGTATGGAGG 


AATGTCAACC 


ATTGGGATCT 


CACTAGTTGG 


TATTGCGGTA 


5700 


GCATACTTCT 


ACGATATGAT 


TGGAAGCAAA 


CCACAAGAAA 


CAACTTCAAG 


TAGTGATGTT 


5760 


GAGGAGGATC 


TTGATCTATG 


ATGAATAATA 


AAGTAACTAA 


AGTTGAACTT 


AAAAAAGTTT 


5820 


TCAAACGAAG 


TTTTATGTAT 


GGTTCTTCAT 


GGAACTATGA 


GAGAATGCAG 


AACCTAGGTT 


5880 


TTCTATATAC 


AATTCTTCCA 


GTATTGAAAA 


AACTATACCC 


AGACAAAGAT 


TCAGCTTCTC 


5940 


CTGCAATGAA 


ACGTCACCTT 


GAGTTTTTCA 


ATACTCATCA 


AACAGCGGCA 


CCATTTATTC 


6000 


TTGGAGTTAC 


TTCCGCTATG 


GAAGAACAAG 


AAGGAAATGA 


AGGTGCAGCT 


TCAATTACTG 


6060 


GTATTAAAGT 


TGGCTTGATG 


GGGCCACTGG 


CTGGTCTAGG 


AGATAGTTTG 


TTCTGGCTGA 


6120 


CACTAGTTCC 


TATCTGTTTT 


AGTATTGGTG 


CGTCTTATTC 


TAAAGACGGC 


GGTGCTTTAG 


6180 


GTATCTTTAT 


CGCCTTAATA 


TTGTTTAATA 


TTATTAATAT 


TCCTGTTAAA 


TATTTCGGTT 


6240 


TGAAATATGG 


GTATACTAAG 


GGTTCTAGTC 


TTATCCAAGA 


AAATAATACA 


AAAGGAACAT 


6300 


TGAATCGCGT 


TACGAGTATG 


GCGACAGCAT 


TAGGGCTAGT 


ACTAGTGGGT GGTTTGATTC 


6360 


CATCAATGGT 


TGGTATTAAT 


TTTGGATTAG 


AATTTAAGCA 


GGGGGAACTT 


GTTATTTCTG 


6420 


TTCAAGAAAT 


GATTACAAAA 


TTAATTCCAG 


GATTTATCCC 


TATGCCTTTG 


ACTTTATTAA 


6480 


TGTGTAAATT 


AATTAGAAAA 


GGAAAGAATC 


CGGTTCTACT 


AATCTTTAGT 


GTTATGGCTA 


6540 


TTGGAGTTAT 


TCTAGTTGTT 


TTAGGAATTT 


TGAAGTAGTA 


GAAAGTGTGG 


AGGTGGTATT 


6600 


TGGGATATCA 


CCTCCATTTT 


GGAAGAGAGG 


TAAAGAGTGA 


AATTATGGTA 


TAAGAAAGCT 


6660 


GCCGCAAATT 


GGAATGAAGC 


CTTGCCGATT 


GGGAACGGTC 


ATTTAGGTGG 


TATGATTTAT 


6720 


GGTTCAGCTA 


CAAAAGAATG 


TATTCAACTA 


AACGATGAGA 


CTATTTGGTA 


TAGAGGAAAG 


6780 


TCAGATAGAA 


ATAATCCAGA 


CTCACTATTG 


CATCTTAAAA 


AAATTCGGGA 


ATATCTTTTA 


6840 


GATGGAGAAA 


TTCAGAAAG*- 




ATAAAfTTTAA 


CAGTGTTTGC 


TACCCCAAGA 


6900 


GATCAAAGCC 


ACTATGAATT 


ACTTGGGGAA 


CTTTACATTG 


AGCATATAGA 


TATTCAGTCT 


6960 


TGTGCTCTTT 


CATTGTATGA 


AAGAGAGCTA 


GATTTAGATA 


CACCTATTTC 


TAATGTTGTG 


7020 


TTTCAGCCTA 


ATAGTTGTAA 


TTTACAAATA 


AAAAGAGAAT 


ATTTTACGAG 


TTTTAATAAG 


7080 


AATATTTTAT 


GTTGCCGTAT 


AGTGTCATCA 


GTTCAAAACA 


CATTAAATTT 


AAACATTAAT 


7140 


TTGGGTAGAA 


ATAAACGGTT 


TAATGACGAA 


GTATCTAAAC 


TGGATTCAAG 


TACAATTTTA 


7200 


ATGTCGGCCT 


CTGCTGGAGG 


TAGAAAAGCT 


GTTCAGTTTA 


AAGTAGTATG 


TCATTCTAAG 


7260 


GTTACGGATG 


GTGAAGTAAG 


TGTATTGGGA 


GAGACAATAG 


TTATTCGGAA 


TGCTACAGAG 


7320 
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GTATTTCTTT 


ATCTCAAATC 


AATGACGGAT 


TATTGGGGAA 


ATATAGATAT 


TTCTTCTCTT 


7380 


CAGGGAGAAT 


TTAGTAGTAT 


TGATTACTTT 


ACAGAAAAAG 


ATGAACATCT 


AAAAAAATAT 


7440 


CAGGAGCAAT 


TTAATAGAGT 


TGATTTTAAA 


CTAGACTATA 


GTAAAGGTTG 


TCTTAGCATT 


7500 


CCAACGAATC 


TACTTCTTGA 


AAACACTAAA 


AAGTATAGTA 


ACTACTTGAC 


TAACTTGTTA 


7560 


TTTCATTATG 


GAAGATATCT 


GTTAATATCG 


TCTAGTCAAC 


CGAATGGTTT 


ACCTGCCAAT 


7620 


CTTCAAGGAA 


TATGGTGTGA 


TGAATTAAAT 


CCAATTTGGG 


GTTCTAAATA 


TACGATTAAT 


7680 


ATTAATACTC 


AAATGAATTA 


TTGGATGGTA 


GGTCCATGTG 


ATTTACCAGA 


AGTAGAATAT 


7740 


CCATTATTTG 


ATATGCTCGA 


AAGAATGAGA 


GAACCGGGAA 


GACTAACCGC 


TAAGAAAATG 


7800 


TATGGAGCTA 


GAGGTTTTAC 


AGCACATCAT 


AATACGGATG GTTTTGGCGA 


TACGGCTCCC 


7860 


CAATCTCATG 


CCATGGGGGC 


TGCAATTTGG 


GTATTAACTA 


TTCCATGGTT 


ATGTACTCAT 


7920 


ATTTGGGAAC 


ACTATTTATA 


TTTCCAAGAT 


GAGCGTATTC 


TTACGGAACA 


TTTTGAAATG 


7980 


ATAAAAGAAG 


CATTTCTTTT 


CTTTGAAGAT 


TATTTATTTG 


AGCTGGATGG 


CTACTTGATG 


8040 


ACAGGTCCAA 


GTGTCTCACC 


GGAAAATAAA 


TATCGCTTAA 


AAAATGGTAT 


TGAAGGAAAT 


8100 


GCTTGTCTAT 


CATCTACAAT 


TGATAATCAA 


ATTCTAAGAT 


ATTTTTGTGA 


TPCATGCATT 


8160 


GGCATTGCAA 


AACAATTAGG 


AGACAATTCG 


GATTTTATTA 


GTCGTGTGAA 


GGAGTTAAAA 


8220 


AAGAAACTAC 


CTAAAACAAA 


AATAGGTAGT 


AATGGGCAAA 


TCCAACAATG 


GTTAGAAGAT 


8280 


TATGAAGAAG 


TAGAGCCTGG 


GCATAGACAC 


ATTTCACCTC 


TATTTGGGCT 


TTATCCTTAT 


6340 


AATGAGATTG 


ATATTCATAA 


AACTCCGGAA 


TTAGCAGAAG 


CAGCTAAAAT 


CACTATCAAT 


8400 


AGGAGATTAT 


CAAACGCTAA 


TTTTTTATCT 


TCACAGCAGA 


GGGAGCAAGC 


GATTAATAAT 


8460 


TGGTTAGTAA 


GTGGTTTGCA 


TGCTAGTACA 


CAAACAGGTT 


GGAGTGCTGC 


ATGGCTGATT 


8520 


CATTTTTTTG 


CGAGACTATA 


TCAAGGTGAA 


CCTGCTTATA 


ACCAGATTAA 


TGGTTTGTTA 


8580 


AATAATGCGA 


CTCTTGGCAA 


TTTATTTCTT 


GACCATCCAC 


CATTTCAAAT 


TGATCGTAAT 


8640 


TTAGGTTTGG 


TGAGTGGAAT 


TTGTGAATTA 


TTAGTACAGA GCCATCATAA 


TTGGTTATCA 


8700 


CTAATTCCAG 


CTTTACCTTC 


TGCTTGGTCA 


GAAGGAGAAG 


TGAAAGGTTT 


CAGAGTAAGA 


8760 


GGAGGATATA 


AGGTATCGTT 


TGCTTGGAAA 


AATGGGGATA 


TAACATTCCT 


AAAATTGGAA 


8820 


GGAGGAAACA 


AAGATCAAAA 


AGTAAGAGTA 


AGAATATATG 


GCAAAAATAC 


TGATGTACAA 


8880 


AATATTGAAT 


TGGTATTTAA 


TTCAGAAAAA 


ATTATTGAGT 


TAAATTTTTA 


GGTATAAGTC 


8940 


ATGAATAAAG 


AAAAAATAAA 


AAGAAAATTA 


ATCACAATAT 


TGTTTGTATG 


TATTGGGATG 


9000 


TTATGTTTTG 


GATTGTTAGC 


AGGAGTTAAG 


GCTGATAATC 


GTGTTCAAAT 


GAGAACGACG 


9060 
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ATTAATAATG AATCGCCATT GTTGCTTTCT CCGTTGTATG GCAATGATAA TGGTAACGGA 9120 

TTATGGTGGG GGAACACATT GAAGGGAGCA TGGGAAGCTA TTCCTCAAGA TGTAAAGCCA 9180 

TATGCAGCGA TTGAACTTCA TCCTGCAAAA GTCTGTAAAC CAACAAGTTG TATTCCACGA 9240 

GATACGAAAG AATTGAGAGA ATGGTATGTC AAGATGTTGG AGGAAGCTCA AAGTCTAAAC 9300 

ATTCCAGTTT TCTTGGTTAT TATGTCGGCT GGAGAGCGTA ATACAGTTCC TCCAGAGTGG 9360 

TTAGATGAAC AATTCCAAAA GTATAGTGTG TTAAAAGGTG TTTTAAATAT TGAGAATTAT 9420 

TGGATTTACA ATAACCAGTT AGCTCCGCAT AGTGCTAAAT ATTTGGAAGT TTGTGCCAAA 9480 

TATGGAGCGC ATTTTATCTG GCATGATCAT GAAAAATGGT TCTGGGAAAC TATTATGAAT 9540 

GATCCGACAT TCTTTGAAGC GAGTCAAAAA TATCATAAAA ATTTGGTGTT GGCAACTAAA 9600 

AATACGCCAA TAAGAGATGA TGCGGGTACA GATTCTATCG TTAGTGGATT TTGGTTGAGT 9660 

GGCTTATGTG ATAACTGGGG CTCATCAACA GATACATGGA AATGGTGGGA AAAACATTAT 9720 

ACAAACACAT TTGAAACTGG AAGAGCTAGG GATATGAGAT CCTATGCATC GGAACCAGAA 9780 

TCAATGATTG CTATGGAAAT GATGAATGTA TATACTGGGG GAGGCACAGT TTATAATTTC 9840 

GAATGTGCCG CGTATACATT TATGACAAAT GATGTACCAA CTCCAGCATT TACTAAAGGT 9900 

ATTATTCCTT TCTTTAGACA TGCTATACAA AATCCAGCTC CAAGTAAGGA AGAAGTTGTA 9960 

AATAGAACAA AAGCTGTATT TTGGAATGGA GAAGGTAGGA TTAGTTCATT AAACGGATTT 10020 

TATCAACGAC TTTATTCGAA TGATGAAACA ATGCCTTTAT ATAATAATGG GAGATATCAT 10080 

ATTCTTCCTG TAATACATGA GAAAATTGAT AAGGAAAAGA TTTCATCTAT ATTCCCTAAT 10140 

GCAAAAATTT TGACTAAAAA TAGTGAGGAA TTGTCTAGTA AAGTCAACTA TTTAAACTCG 10200 

CTTTATCCAA AACTTTATGA AGGAGATGGG TATGCTCAGC GTGTAGGTAA TTCCTGGTAT 10260 

ATTTATAATA GTAATGCTAA TATCAATAAA AATCAGCAAG TAATGTTGCC TATGTATACT 10320 

AATAATACAA AGTCGTTATC GTTAGATTTG ACGCCACATA CTTACGCTGT TGTTAAAGAA 10380 

AATCCAAATA ATTTACATAT TTTATTGAAT AATTACAGGA CAGATAAGAC AGCTATGTGG 10440 

GCATTATCAG GAAATTTTGA TGCATCAAAA AGTTGGAAGA AAGAAGAATT AGAGTTAGCG 10500 

AACTGGATAA GCAAAAATTA TTCCATCAAT CCTGTAGATA ATGACTTTAG GACAACAACA 10560 

CTTACATTAA AAGGGCATAC TGGTCATAAA CCTCAGATAA ATATAAGTGG CGATAAAAAT 10620 

CATTATACTT ATACAGAAAA TTGGGATGAG AATACCCATG TTTATACCAT TACGGTTAAT 10680 

CATAATGGAA TGGTAGAGAT GTCTATAAAT ACTGAGGGGA CAGGTCCAGT CTCTTTCCCA 10740 

ACACCAGATA AATTTAATGA TGGTAATTTG AATATAGCAT ATGCAAAACC AACAACACAA 10800 

AGTTCTGTAG ATTACAATGG AGACCCTAAT AGAGCTGTGG ATGGTAACAG AAATGGTAAT 10860 
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TTTAACTCTG GTTCGGTAAC ACACACTAGG GCAGATAATC CCTCTTGGTG GGAAGTCGAT 10920 

TTGAAAAAAA TGGATAAAGT TGGGCTTGTT AAAATTTATA ATCGCACAGA TGCTGAGACT 10980 

CAACGTCTAT CTAATTTTGA TGTGATTCTA TATGACAATA ATAGAAACGA AGTTGCTAAG 11040 

AAACATGTTA ATAATTTGTC GGGTGAATCT GTTAGTCTAG ATTTCAAAGA AAAAGGAGCA 11100 

AGGTATATTA AAGTTAAATT ACTAACGAGT GGAGTGCCTT TGAGTTTAGC AGAAGTAGAG 11160 

GTTTTTAGAG AATCAGATGG TAAGCAATCT GAAGAGGATA TAGATAAAAT AACAGAAGAT 11220 

AAAGTAGTCT CTACAAATAA GGTAGCTACT CAAAGTTCAA CCAATTATGA GGGTGTAGCT 11280 

GCTTTAGCAG TTGATGGTAA TAAAGATGGA GATTACGGAC ATCATTCGGT GACTCATACT 11340 

AAGGCAGATT CTAACGCTTG GTGGCAGGTC GATCTGGGAG AAGAGTTTAC GGTTTCTAAA 114 00 

GTTGATATTT ATAATAGAAC AGATGCCGAA CCTCAGCGTT TATCTAATTT TGATGTTATT 114 60 

TTTCTATCTT CATCAGGAGA AGAAGTTTTT AGAAGACATT TTGATAAAGT AGTTGATGGT 11520 

TTGTTATCTT TAAAAGTACC TTCTGTAGGG GCTAAGCTAG TCAAAATAGA ATTAAAATCA 11580 

GCAGCTATTC CGTTAAGTTT AGCGGAAGTT GAACTCTATG GTTCAAAGAG AACTCCGAAG 11640 

AAACTTTCTA ATATTGCATT AACAAAAGAA ACTCGACAGA GTTCAACGGA TTACAATCGT 11700 

TTTTCTCGTC TAGCAGTTGA TGGAAATAAA AACGGAGATT ATGGTCATCA TTCAGTGACT 11760 

CATACCAAAG AAGATTCTCC TTCATGGTGG GAGATAGATT TAGCACAAAC CGAAGAATTA 11820 

GAAAAGTTAA TTATTTATAA TAGAACAGAT GCTGAAATTC AGAGATTATC AAATTTTGAT 11880 

ATTATTATAT ATGATTCAAA TGATTATGAA GTTTTTACAC AACATATTGA CAGTTTAGAA 11940 

AGCAATAATC TATCCATAGA CTTAAAAGGA CTGAAGGGAA AAAAGCTTAG AATTTCTTTC 12000 

AGAAGCGCAG GAATTCCTTT AAGTTTAGCA GAGGTAGAGG TTTATACTTA TAAGTAATTT 12060 

TAAAAATTAT CACCCAGGCT ACCGTAAATA TAATGGAGAT GGTAGTATGA AAGAAACAGA 12120 

AAAATAAGAG GAAAATAGTA TGATTCAACA TCCACGTATT GGGATTCGTC CGACTATTGA 12180 

TGGTCGTCGT CAAGGTGTAC GCGAATCACT TGAAGTGCAA ACAATGAACA TGGCTAAAAG 12240 

TGTGGCAGAT TTGATTTCAA GCACATTGAA ATATCCAGAT GGGGAACCTG TGGAATCCGT 12300 

GATTTCTCCA TCTACTATTG GCCGTGTACC AGAGGCTGCA GCTTCCCATG AGTTGTTTAA 12360 

AAAATCAAAT GTTTGCGCAA CAATTACAGT TACACCATGC TGGTGTTATG GTAGTGAAAC 12420 

TATGGATATG TCTCCAGATA TTCCTCATGC TATTTGGGGA TTTAATGGGA CAGAACGCCC 12480 

AGGAGCTGTC TATCTTGCAG CTGTACTAGC TTCACATGCT CAAAAAGGGA TTCCAGCCTT 12540 

TGGGATTTAT GGAAGAGATG TTCAGGAAGC TAGTGACACA GATATTCCAG AAGATGTCAA 12600 



WO 98/18931 



PCT/US97/19588 



650 



AGAAAAACTT 


TTACGCTATG 


CGCGTGCAGC 


TCTTGCAACT 


ggcttgatga 


CAGACACTGC 


12660 


TTACCTATCA 


ATGGGTAGTG 


TTTCGATGGG 


GATTGGTGGT 


TCTATTGTAA 


ATCCGGATTT 


12720 


CTTCCAAGAA 


TACTTAGGAA 


TGCGAAATGA 


ATCGGTAGAT 


ATGACGGAGT 


TCACGCGCCG 


12780 


TATGGACCGT 


GGTATTTACG 


ACCCTGAAGA 


GTTCGAACGT 


GCGCTCAAAT 


GGGTGAAAGA 


12840 


AAACGTAAAA 


GAAGGATTCG 


ACCATAACCG 


TGAAGACCTT 


GTTTTAAGCC 


GTGAAGAAAA 


12900 


AGATAGACAA 


TGGGAATTTG 


TTATTAAGAT 


GTTCATGATT 


GGACGTGACT 


TAATGGTTGG 


12960 


TAACCCAAGA 


CTTGCTGAAC 


TTGGTTTTGA 


GGAAGAAGCG 


GTTGGTCACC 


ATGCTTTAGT 


13020 


AGCTGGTTTC 


CAAGGTCAAC 


GTCAGTGGAC 


AGACCATTTT 


CCAAATGGGG 


ACTTTATGGA 


13080 


AACTTTCCTC 


AATACTCAGT 


TTGACTGGAA 


TGGTATTCGA 


AAACCATTTG 


TATTTGCGAC 


13140 


AGAGAATGAT 


TCACTAAATG 


GTGTGTCTAT 


GCTCTTTAAT 


TATCTATTAA 


CAAATACTCC 


13200 


ACAAATCTTT 


GCTGATGTGC 


GTACTTATTG 


GAGCCCAGAG 


GCTGTTAAAC 


GTGTAACGGG 


13260 


ACATACTTTA 


GAGGGTCGTG 


CTGCAGCTGG 


CTTCTTACAT 


CTAATCAACT 


CTGGTTCTTG 


13320 


TACATTGGAT 


GGTACAGGTC 


AAGCTACTCG 


AGATGGCAAA 


CCTATTATGA 


AACCATTCTG 


13380 


GGAGTTGGAA 


GAAAGTGAAG 


TGCAGGCTAT 


CCTTGAAAAT 


ACAGACTTCC 


CACCAGCAAA 


13440 


CCGCGAATAC 


TTCCGTGGAG 


GAGGATTCTC 


AACTCGTTTC 


TTGACGAAGG 


GGGATATGCC 


13500 


AGTAACAATG 


GTACGTCTCA ATCTTCTAAA 


AGGGGTTGGT 


CCAGTGCTAC 


AAATTGCAGA 


13560 


AGGTTACACA 


CTTGAACTTC 


CTGAAGATGT 


TCACCATACT 


TTAGATAATC 


GTACAGATCC 


13620 


AGGATGGCCA 


ACTACTTGGT 


TTGCTCCACG 


TTTGACAGGA 


AAAGGTGCTT 


TCAACTCTGT 


13680 


CTATGACGTC 


ATGAATAATT 


GGGGAGCTAA 


TCACGGAGCC 


ATAACATATG 


GACACATTGG 


13740 


AGCAGACTTG 


ATTACCTTGG 


CTTCTATGTT 


GAGAATTCCT 


GTCAATATGC 


ATAATGTACC 


13800 


TGAGGAAGAT 


ATCTTTAGAC 


CTAAAAATTG 


GTCCTTATTT 


GGAACAGAAG 


ATCTAGAATT 


1 IB&ft 
x J oov 


AGCAGACTAT 


CGTGCATCTC 


AGTTGTTGGG 


GCCACTACAT 


AAATAAAACT 


TGTTTATATA 


13920 


GGAGGTGAAC 


TTACGTCCCT 


CCTATCCTTT 


TAAAAAGATT 


TGTTAAACAA 


TTCACAAA1A 


13980 


ATTGAAAACG 


AATACAAAAA 


GTAATATAAT 


GATGTTAAAT 


AGATAGCCCG 


GAGGCGCAGG 


11040 


AGGAAAATTA 


TATGGCTATA 


TTTTATGTTC 


CGGCAGTCAA 


CCTTATTGGA 


AAAGGTGTTG 


14100 


TAAATGAAGT 


GGGTCCTTAT 


ATCAAGGAAC 


TTGGCTATAA 


AAAGGCACTT 


TTGGTGACAG 


14160 


ATAAGTACAT 


CGAAGGCAGT 


GATATTTTAC 


CTAAGACTTT 


AAAACCACTG 


GATACAGAAG 


14220 


GAATCGAATA 


V 










14231 



(21 INFORMATION FOR SEQ ID NO: 82: 



C i > SEQUENCE CHARACTERISTICS: 



WO 98/18931 



PCT/US97/19588 



651 

(A) LENGTH: 16995 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



AGTTCTCTTA 


ACTTTTTTAG 


GATGGCATTC 


TCCGCTCTCA 


GGTACTCATT 


TTCTGCTgAA 


60 


GACGTTCTAA 


TTCTGTCCTC 


TCTTCAGGTC 


TCGTTTTTCG 


CTTACGTCCC 


ATTTTAGGTA 


120 


CTCTCCCTCT 


TGTTTTCTCA 


ACAATAGTAT 


ACCCGTTTTT 


CCTGTATTGT 


GCTAGCCAGT 


180 


TAAGAAGTAT 


CGTACGACTT 


GGGAGACCGT 


ATTCAAGAGA 


AACTCTATCT 


TTAGTCCAGC 


240 


CTTCATGTCA 


GACTTTATTA 


CTCATTTCTT 


GTTTTAAATC 


AGGAGAATAG 


TAACGATTTT 


300 


TTCCTTTTTT 


GACGAACTCT 


ATTCCGTAAC 


GATCAATCAA 


TTTAATCATG 


TACCTAATAT 


360 


TAGAATTGCT 


TATCCCAAAT 


TTATTTGAAA 


GCTTCTCTAA 


GCTATATCCT 


TGTTTTCTAA 


420 


GTTCATAGAT 


CTGAACTTTA 


TCATCATAAG 


TTAGTTTCAT 


AATAAAAACA 


CCCCAAAAGT 


480 


TAGATTTTTT 


CTGTCTAACT 


TTTGGGGTGT 


AGTTCATGTA 


CACCTGATAT 


GATGCGTTTT 


540 


ATAATTTTTA 


AGCCTTTTTG 


CCCAGCCTCG TCAAAAGTAA TCTTTTGACA 


CAAAATCTGT 


600 


GACAAAACTT 


TAGTTTTAAA 


GGTTTTTAAC 


TTTGTATATA 


CTACTTTTAA 


GAAAAGGAGG 


660 


ATGATCTAAT 


GGAAGAAAAA 


GTATCATTGA 


AAGTCAGGGT 


TCAAAAACTA 


GGGACATCGC 


720 


TTTCAAATAT 


GGTTATGCCC 


AATATTGGAG 


CATTTATTGC 


TTGGGGAGTA 


TTGACTGCCC 


780 


TCTTTATCGC 


TGATGGCTAT 


CTGCCAAATG 


AACAGTTAGC 


TACTGTTGTT 


GGTCCTATGT 


840 


TAACGTATTT 


ATTGCCAATC 


CTGATTGGTT 


ACACAGGTGG 


ATATATGATC 


CATGGCCAAC 


900 


GTGGTGCCGT 


TGTAGGAGCT 


ATTGCTACTG 


TTGGTGCAAT 


CACAGGTTCT 


ACTGTTCCTA 


960 


TGTTTATCGG 


AGCTATGGTA 


ATGGGCCCAC 


TGGGAGGATG 


GACTATCAAG 


AAATTTGATG 


1020 


AGAAGTTCCA 


GGAAAAAATT 


CGTCCCGGAT 


TTGAAATGTT 


AGTTAATAAC 


TTCTCAGCTG 


1060 


GTCTCGTTGG 


TTTTGCATTA 


TTGCTTTTGG 


CTTTCTACGC 


AATCGGTCCA 


GTCGTATCGA 


1140 


CTCTTACTGG 


AGCTGTTGGG 


AATGGTGTTG 


AGGCTATTGT 


CAATGCTCGC 


CTCCTTCCTA 


1200 


TGGCTAATAT 


TATCATCGAA 


CCGGCTAAAG 


TCCTTTTCCT 


CAATAATGCC 


CTCAATCATG 


1260 


GCATTTTTAC 


TCCTCTGGGA 


GTAGAACAGG 


TAGCTCAAGC 


TGGTAAGTCA 


ATTCTCTTCC 


1320 


TATTGGAAGC 


TAATCCTGGA 


CCAGGTCTGG 


GAATTCTATT 


AGCTTATGCT 


GTATTCGGTA 


1380 


AAGGTTCTGC TAAATCTTCT 


TCTTGGGGGG 


CAATGGTTAT 


TCATTTCTTC 


GGAGGGATTC 


1440 


ATGAAATTTA 


CTTTCCTTAT 


GTTATGATGA 


AGCCTACTCT 


ATTTTTAGCT 


GCTATGGCAG 


1500 
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GAGGTATCTC TCGAACTTTT ACTTTTCAAC TCTTAGACGC TGGTCTTAAA TCTCCAGCTT 1560 

CACCAGGTTC TATTATTGCG ATTATAGCTA CGGCGCCAAA AGGTGTTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTGTTTTA GTGGCAGCAG TTGTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCCC TCGAAGCTGC TCAGGCGGCT ACCCAAGCAG 1740 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTCT TGATGCAGTT GTTTCGACAG 1800 

ACTCAGTGGA AAAAATCATT TTCGCCTGCG ATGCTGGTAT GGGAAGCTCT GCTATGGGAG 1860 

CTAGTATTCT TCGAGATAAG GTTAAAAAAG CAGGTCTAGA GATTCCACTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 1980 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCGT 2040 

CCTCTCGTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT ACCAACTTCA GCACCAGTAG ATAGTCAGGA AAGTGACCTT AACCATATTG 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTGC AACTATGCCC TGTGAAACGA 2220 

TTCGGGCTAT TTTTAGAAAC AAGAATATTC GTATTCCACT TTCTACTGCC AAAATTTCAG 2280 

AATTAGGTGA ATTTAATTCT AAAAACATAA TGATTGTAAC AACTATTTCT TTACAGGCAG 2340 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATAGTTTA GTAACAACAC 2400 

CAGAATATGA CAAAATGGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACG 2460 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGCCT TTCCTACATG TACGGAAGCT 2520 

TTCAATGCAA GATATGACTG AAATCTTACA GGTTTCATCT AGAACAATTT ATCGAACTTT 2580 

ATCAGATTTG ACAGATAGCA TGGAGCAATA TGGAATCGAA ATAACGAAGC ATGGGAAATA 2640 

CTATATTTTG ACTGGAGAGT TGGATGATTT GCCGACAGAA CTTGAAGTGT TAGTTGAGTA 2700 

TAGTCCCCAA GAAAGACAAG AGTTGATTAC CTATCGCCTT CTGACTGAGA GTGGTTTTGT 2760 

CACCAATGAA GCATTGCAAG AGTGCACGAA AGTCAGTAAT GTAACTATTA TTCAGGATAT 2820 

TTCAGATATT GATAAGCGTC TTTTAGACTT TGATC^GAAA ATTGAACGAC AAAAAGGTTA 2880 

TCGGATTTCT GGTGATTCAG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 2940 

TATCTCAGTA GCAGATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA TAAGCAACTG TCAGGTTTTC CAGATATGGA 3060 

TGCTAGGATG AAGATGTTTT TTGCGATCTT GTTATCTCTT ATAGGTCAGG AGCAAAACAT 3120 

TGAAAATTCA CCTAATACTA GTAAGCAGGC TTTGGAAATT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTTATAG TATTCAGGAA ATTATCTATT TTGCGAGCAT 3240 

CTTGGATGAA TTAATCATTA AACGTCAGGA CAATCCGCTC TTTACGGAGA AATTTGATGG 3300 
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TGAATTTTTC TACAATATTT CAAATCTGAT TGATACGGTT TCCATGTATA CCAAGATTGA 3360 

CTTTTTTAAG GACAAGGTTT TATTCAATTT TCTTTTCCAT CATATTCGGC TCAGTTTAGG 3420 

CGTCCCTATC CTTTTTCAGG GTGAAAATTT GCCAGAATCT ATCCAGATTT TAGTTGAAAG 3480 

GAATAAATTT CTTTATACAG TCATCAGTCT TTTAGTGAAT GATATTTTTC CGAAATATCT 3540 

TCATACAGAG TATGAGTATG GCATGATTGC CCTACATTTT ATCTCTAGCT TAGGCCGTAG 3600 

TCCAGAGATT TATCCAGTCC GTGTTTTGCT TTTAACGGAT GAACGTCGGG TCACTAGAGA 3660 

TTTATTAGTC AGTAAAATTA AGAGTGTTGC TCCTTTTGTA GAGTTGATAG ATATTCAATC 3720 

TCTAGTAGAT TACCACAGTA TTGATCTCAG TCAGTATGAT TATATTTTAT CTACCAAGCC 3780 

GCTGACTAAT CAGGAAATCG ATGTAATTTC TAGTTTTCCA ACCGTCAAAG AATTGCTTGA 3840 

ATTACAGGAA CGACTTCAGT ATGTACAGGC ACATCGTACA ATTGTCGCGC GTGATGCTAT 3900 

CGCTCCAGAG AAAAGTTATG ACTTGCAAGA TTATTTAATA TCTAGTAGTC AGCTTTTGAG 3960 

TCAATTCGAG TTGGTTCAAT TGGAGAATAA TCAATCATTT GAGCACACGG TAGAACAAAT 4020 

CATCCAATAT CAGAAGAATG TGAGTGACAG AGCTTACCTA ACAAGAAAAT TGTTATCTCA 4080 

CTTCCAGAAT AGTCCTATGG CTATTCCTAA TACTGGTCTG GTGCTTTTAC ATAGTCAGTC 4140 

TAGCAAAGTA ACAACAAATA GTTTTACTAT GTTTGAACTC AAACTACCTA TCTCCGCATT 4200 

GTCAATGAAA CGAGAGGAAG AAGAGGTCAA AAGGTGTCTG CTAATGCTAA TGTCTAAAGA 4260 

AGCTAGCGAG GAAGCTAGAG ATTTAATGAC AGCTATTACT CAGTCGATTA TTGAAAATCA 4320 

TCTTTATACA GAGATTTACA AGACGGGAAA TCAATCCATT ATTTATCAGA TGCTAAATAC 4380 

TATTTTTAAC GAAAAAATTA AGAAATTGGA GAACTAATAT GAAACTTGAA AAACATTTGA 4440 

TTAAGCTTAA TAAACAATTT TCTAACAAGG AGGAAGCTAT TTGTTATTGT GGGCAAGTTC 4 500 

TTTATGAGGG TGGATATGTT AATGAAGACT ATATTGAAGC CATGATTGAG CGAGATAAAG 4 560 

AGCTATCTGT TTACATGGGT AACTTTATCG CCATACCGCA TGGAACAGAT GCAGCAAAAA 4 620 

ATGATGTCCT CAAGTCTGGT ATTACAGTCG TTCAAGTCCC TAGAGGGGTT GATTTTGGGA 4 680 

ATGTATCTAA CCCTCAAGTG GCAACGGTTC TTTTTGGTAT TGCTGGTATT GGTAATGAAC 4740 

ACTTAGAAAT TATTCAGAAA ATTTCTATCT TCTGTGCACA TGTAGATAAT GTTCTTAAAC 4800 

TAGCAGATGC TCAGTCAAAA GAGGAAGTAT TGCGCTTATT TGATCCTGTT GAATAATTGA 4860 

ATTTAGTCAT TTGTCATCTA GTATATATGT CCCTCAAATA GGAAAAGGAG AAATTGAATG 4920 

AAACATTCTG TTCATTTTGG TGCCGGTAAT ATCGGTCGTG GTTTTATAGG TGAAATTCTA 4980 

TTTAAAAATG GTTTCCATAT TGATTTTGTG GATGTCAATA ATCAGATAAT TCATCCTCTG 5040 
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AATGAAAAGG 


GCAAGTATGA 


AATTGAAATT 


GCACAGAAAG 


GACAGTCTCG 


TATAGAAGTA 


5100 


ACTAATGTGC 


CTGGCATTAA 


TAGCAAAGAA 


CATCCTGAGC 


AAGTCATTGA 


AGCGATTCAA 


5160 


AAGACGGATA 


TTATTACTAC 


TGCAATCGGA 


CCTAATATAC 


TCCCTTTTAT 


CGCCGAACTT 


5220 


CTAGCCAAAG 


GAATCGAAGC 


TCGCCGAGTT 


GCAGGAAATA 


CACAGGCATT 


GGATGTTATG 


5280 


GCCTGTGAAA 


ATATGATTGG 


CGGGTCTCAA 


rnvrrrATC 


AAGAAGTCAA 


GAAATATTTA 


5340 


AGTCCGGAAG 


GTTTGACATT 


TGCTGATAAC 


TACATAGGTT 


TTCCAAATGC 


TGCAGTAGAC 


5400 


AGGATTGTTC 


CAGCACAAAG 


TCACGAAGAT 


TcccrrnTG 


TTGTGGTCGA 


GCCCTTTAAT 


5460 


GAATGGGTCG 


TGGAAACCAA 


GCGTCTTAAA 


AATCCAGATT 


TACGTCTAAA 


AGATGTGCAT 


5520 


TATGAAGAAG 


ATTTAGAACC 


CTTTATTGAG 


CGAAAACTTT 


TTTCAGTCAA 


TTCTGGACAT 


5580 


GCAACTTCAG 


CTTACATTGG 


TGCGCATTAT 


GGTGCCAAGA 


CAATTTTGGA 


AGCTCTTCAA 


5640 


AATCCTAATA 


TTAAATCTCG 


GATTGAATCT 


GTATTAGCTG 


AAATTCGGAG 


TCTCTTGATT 


5700 


CCCAAATGGA 


ACTTTGATAA 


AAAAGAATTG 


GAGAATTATC 


ACAAAGTCAT 


TATACAACGA 


5760 


CTTGAAAACC 


CTTTCATAGT 


GGACGAGCTT 


AGTCGCGTAG 


CTCGTACTCC 


AATCCGAAAA 


5820 


TTAGGCTATA 


ATGAACGATT 


CATCCGGCCG 


ATACGTGAAT 


TGAAAGAACT 


CAGTTTGTCA 


5880 


TATAAAAACC 


TACTTAAAAC 


AGTTGGCTAT 


GTCTTTGACT 


ATCGCGATGT 


AAATGATGAA 


5940 


GAAAGTATTC 


GATTAGGTGA 


ATTGTTGGCT 


AAACAATCAG 


TCAAAGATGT 


TGTTATACAA 


6000 


GTTACAGGTT 


TAGACGACCA 


AGAATTGATT 


GAGCAAATTG 


TAGAGTATAT 


TTAATCTTTT 


6060 


TCGAAAATCT 


CTTCAAATCA 


GGTTAGCATC 


GCTTTGTCTT 


AGGCATATGT 


TGTTCTATCT 


6120 


ACAACCTCAA 


AGCAGTGCTT 


TGAGCTGACT 


CCGTCAGTCT 


TATCTGCAAT 


CTCAAAACAC 


6180 


TGTTTGAGTT 


ATCTGCGGTA 


ATCTTTCTAG 


CTTGTCTTTG 


ATTTTTGTTG 


TTATTTATAA 


6240 


GGTAAAAGAA 


GCTGGACAAA 


AAGTCTTCAA 


AATCGGGAAA 


AGGCAGCCTA 


TCGGGTGTTC 


6300 


AAAAATCTTG 


ATAGGATGTC 


CTTTATTATG 


GAAAGCCTTA 


TTGGATTTTC 


TCCTCAGATT 


6360 


GAGTTTTTGA 


TCAGCTTTAT 


GAGATAGGTC 


TTGCTAGAGA 


TGTAGCCCAT 


CATGTTATTT 


6420 


TTATGGACAG 


TGGGAAAATT 


GTTGAAAAAA 


ATAATGCCCA 


TCAATTCTTT AGTCGTCCAA 


6480 


GAGAAGAACG 


AACCAAGCAA 


TTTTGGAACG 


AATTCTTTCG 


AATGCGATCT 


ATATAGTAAA 


6540 


ATGAAACAAG AACAGGACAA ATCGATCAGG ACAGTCAAAT 


CGATTTCTAA 


AAATGTTTTA 


6600 


GAAGTAGAGG 


TGTACTATTC 


TAGTTTCAAT 


CTACTATATA 


ACTGAAAAAT 


TAGATAAATT 


6660 


AGTTTTGGAA 


AATGACTAAC 


CAAAAGATAT 


CCAAAGTAGT 


CTAAAATTGT 


CTATACTTTA 


6720 


TGAGTGTTTT 


AGTTAGGAAA 


AAGGCTTGTT 


GTCTATAATT 


GTCTGCATTA 


GTCTAGATTT 


6780 


TATTTATAGA 


AAATGTTATA 


ATAGACTGTA 


TTTAAAAAAT 


TTTAAGGAGA 


AATGACAGAA 


6840 



i 
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TGTCTGTATC ATTTGAAAAC AAAGAAACAA ACCGTGGTGT CTTGACTTTC ACTATCTCTC 6900 

AAGACCAAAT CAAACCAGAA TTGGACCGTG TCTTCAAGTC AGTGAAGAAA TCTCTTAATG 6960 

TTCCAGGTTT CCGTAAAGGT CACCTTCCAC GCCCTATCTT CGACCAAAAA TTTGGTGAAG 7020 

AAGCTCTTTA TCAAGATGCA ATGAACGCAC TTTTGCCAAA CGCTTATGAA GCAGCTGTAA 7080 

AAGAAGCTGG TCTTGAAGTG GTTGCCCAAC CAAAAATTGA CGTAACTTCA ATGGAAAAAG 7140 

GTCAAGACTG GGTTATCACT GCTGAAGTCG TTACAAAACC TGAAGTAAAA TTGGGTGACT 7200 

ACAAAAACCT TGAAGTATCA GTTGATGTAG AAAAAGAAGT AACTGACGCT GATGTCGAAG 7260 

AGCGTATCGA ACGCGAACGC AACAACCTGG CTGAATTGGT TATCAAGGAA GCTGCTGCTG 7320 
AAAACGGCGA CACTGTTGTG ATCGACTTCG TTGGTTCTAT CGACGGTGTT GAATTTGACG : 7380 

GTGGAAAAGG TGAAAACTTC TCACTTGGAC TTGGTTCAGG TCAATTCATC CCTGGTTTCC 7440 

AAGACCAATT GGTAGGTCAC TCAGCTGGCG AAACCGTTGA TGTTATCGTA ACATTCCCAG 7500 

AAGACTACCA AGCAGAAGAC CTTGCAGGTA AAGAACCTAA ATTCGTGACA ACTATCCACG 7560 

AAGTAAAAGC TAAAGAAGTT CCGGCTCTTG ACGATGAACT TGCAAAAGAC ATTGATGAAG 7620 

AAGTTGAAAC ACTTGCTGAC TTGAAAGAAA AATACAGCAA AGAATTGGCT GCTGCTAAAG 7 680 

AAGAAGCTTA CAAAGATGCA GTTGAAGGTG CAGCAATTGA TACAGCTGTA GAAAATGCTG 7740 

AAATCGTAGA ACTTCCAGAA GAAATGATCC ATGAAGAAGT TCACCGTTCA GTAAATGAAT 7800 

TCCTTGGGAA TTTGCAACGT CAAGGGATCA ACCCTGACAT GTACTTCCAA ATCACTGGAA 7860 

CTACTCAAGA AGACCTTCAC AACCAATACC AAGCAGAAGC TGAGTCACGT ACTAAGACTA 7920 

ACCTTGTTAT CGAAGCAGTT GCCAAAGCTG AAGGATTTGA TGCTTCAGAA GAAGAAATCC 7980 

AAAAAGAAGT TGAGCAATTG GCAGCAGACT ACAACATGGA AGTTGCACAA GTTCAAAACT 8040 

TGCTTTCAGC TGACATGTTG AAACATGATA TCACTATCAA AAAAGCTGTT GAATTGATCA 8100 

CAAGCACAGC AACAGTAAAA TAATCTTAAT AAACAGAAAA CCCACCTGAA TTGGTGGGTT 8160 

TTCTGATGCA CTATTTTCCA AAAATCTCTT TGAGGTCTCT GTCTGTAATC CCAATCATGG 8220 

CTGGGATGCG GTCCCAGTTT TCTTCGGTTA GGATGTAGGA TTGTTCAGAG GCACTTGATG 8280 

TGACTGTTTC AGAGACAGCT TGTTGCTTTT CTTCAACATT CTCCAGTAGA TCACTGAAGC 8340 

GTTCAATCAG ATAGGTTTTT CGGGCAGTTC CGATGTGTTG GGTAGCATAG TCGAAGGCTT 8400 

GTAATTCGCC TAGTAAGATG AGTTTGCTTT TGCCACGTGT AATGGCTGTG TAGATGAGAT 84 60 

TTCGCTCCAG CATACGTCGG CTAGCACTAG TAATCGGTAG GATGACAACT GGGAACTCAC 8520 

TTCCCTGAGA CTTATGAATA CTCATGGCAT AGGCCAAGCG AATCTTGTAC CATTCGTTAC 8580 
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GGGGGTAAGA GACTTCATTA CCATCAAAAT CAATGACAAT 
TGTATTTACC AGGAATCAGG TCTGTGATAG CTCCTAAATC 
CAGCATCGTT AACCAAATGA ATGACCCTGT CTCTCTTACG 
CAAAACTGAG TTGATCTTTT TGTGGGGGAT TGAGCAGGTC 
CATCAATCCC TGCCGTCCCT CGGTACATAG GAGCCAGAAC 
TACCATTTCT GAGGGCGGCA CCTAAGATTT TTTCAATGGT 
CAATTTCAAA GTAGGAACGG TCAGCTTTTT TTTGGGTGAA 



GTCGAATCTG 
CCAAGCGAGT 
TGACAGAAGG 
TGGAGAAGAG 
AGTCAGCATC 
GGCGATGTAT 
CAGTTGGAGC 
AAAGGGCATA 
CTGTCAGGAT 
CATACTCAAT 
CATGACTCTT 
CAGCGAAAAA 
CGATCAGGTA 
ACTCAAGGAG 
TTTCCATACA 
GACTTTCGAT 
TATCCTCAAC 
AGTCTTGAAT 
TCATCTCCGT 
AGAGTCCTGC 
CGCCATAGGT 



ACTAGCTAGG 
CTGAGGAATC 
TAGCTGATCA 
TTGATTGGCC 
TAGGTAATCT 
GGTCGCGCTA 
AGCAAGAAGA 
AACAGCAATG 
AAAGACCTTA 
TCCCAGTTCT 
CTGTTTTCCT 
GAGGCTGTTG 
GGAGAGCTCT 
AGTAAGGGTT 
GGCCTGAAAA 
GCCTAGTTCC 
CAGCTGGTAG 
CTGAAAGGCT 
TCCGTAGTTG 
GATGCCTTCT 
ATCCACGATT 



GTGACGATGG 
AAAGGAATAT 
CTGTCACCTA 
ACCCAAGTAT 
TCCAGATGAC 
GGCAAACCTG 
ATGGGCAGAT 
ATTCCATTGA 
TTCTGGATAG 
TGCTCGACAG 
TTTTCAAGGA 
TCAAAGATCT 
TGGGCAACTT 
TGTTCCAGCA 
AGACTGTGAA 
TCAGCTAGTT 



TTGATTCTTT 
GAAGTAGATC 
CGATGAGGAT 
CTACCATAGA 
TGGTATCATC 
TCAATTCATT 



CTCGTCTTGT TTCGATTCGG 
CCCATTAAAG ACATTGATTT 
ATAGTGACAC TGAGGAGCTT 
TTGCATGAGC TGATTGATAG 
TTGGATATCA CGGGCGGGAA 
GGCAGGAATA TGGCCACTAC 
ATCAGCTGGC AAGATGCCCT 
GCTTTGTCGA TAAATTTTTT 
CGCTAGAACC TGTCCAGGAC 
CTTACTGTTA GAAGAGATAT 
GAATTCATCC ACGATGATAA 
GTCACCTCTC ATTCCCAAGT 
CATGCGACGA GCAGCTCGAC 



TGCTTTTCTT CCTGAAGTCA AGTCCTTCTA 
TAACAGTTGT CTTACCAGTA CCAGGCCCAC 
CATCACAGAT AGCCTGTTTT TGAATGTTAT 



TAGTGATATG TTTTTGAATG 



TACGAACCAA 
TGGTATCAAT 
GGCTGGGGTC 
AATCCCGTGC 
CTAGACCGGC 
GGTCAGCAAT 



GTGACTGCGG 
CTGCTGAACC 
TAGTTCCACG 
TTCAACATAG 
GCGGAAGCGT 
GGTAAAGCCC 



GTTTCTAAAT 
ATGCCTTCCT 
TTGTCTTCTT 
GGACGGGAAG 
GTGTCCCCTG 
TCAGGAGCCT 
AAACCCTTGA 



GGATAATTTT CAACCACATC AAGGGTTTCT TCCTTGTAAA 
AGTTTGTTGG GAATGCCGTA GTTGGCTAGT TTGGCCAAAA 
AGACGGAGAG TGGAGACGAA AGCCTCGCGA TTTTTGGCAG 
AACTTTTCTG GGTGTTGCAA AATTTCGTCA ATGGTATTTT 
TTCTGAGCTG TCTTGAGACC AATCCCCTTG AAATGGCTAC 



TTGAAAAGTA CTTGACCAAG CCCTTACTAG TTGGTTTTGC GCGATCATAA CGACTGATTT 
GCAGTTGTTC TCCATACTTG GAGTGCTGGA CAATTTGCCC CCAAAAAGTA TAGTCTTCGC 



8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 
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9300 

9360 

9420 

9460 

9540 

9600 

9660 
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9340 

9900 

9960 
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10080 

10140 
10200 
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CCTCAATTAC ATCAGCCATG GTTCCTGTGA CAATGATTTC AAAATCATCA AAATCCTCTG 10440 

CCTCCGTATC GTCGATTTCT AGGAGGAGGA TGCGATAAAA ATTGCTCGGA TTTTCAAAAA 10500 

TAATCCGTTC AATAGTTCCT GAAAAATAAA CTTCCATAAA ATTCCTTTGC ATGAATAGGT 10560 

GAGAGTTGGG ATTGTTTTTA TTTTATACTC TTCGAAAATA TCTTCAAACC ACGTCAGCTT 10620 

CCATCTGCAA CCTCAAAACA GTATTTTGAG CTGACTTCGT CAGTTCTATC CACAACCTCA 10680 

AAACACTGTT TTAAGCAGCC TACGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTCATTGAG 10740 

TATTTGTAAA TAAACAATCA CTTCTCACGA TAGAAGAAGA GGCTGAGATT GGTGATTCTC 10800 

TGCCTCTTAG GTTTCTTAAA ATGTTCCGAT ACGGGTGATT GGCCATAAGC GGAATTTAGC 10860 

TTCCCCTGTG ATATCTTTTG CTTTGAAGGT ACCTACGTGG CGGCTGTCGC TCGAAACCAA 10920 

GCGGTCATCT CCGAGGAGAA GGTATTCTCC TTCTGGAACA GTAAAGCTAA AGTTGGTGTT 10980 

GTAGTTGACA TCAACTGTGA AGGCTTGAGC TTTTTGAGCG ATACTTCTAA AGAAAGTTCC 11040 

TTTATTTCCT TCAAAGCCCT TGCCTGAGTA AGTGCTTTGG AGTTTCTCAT CCTTGAAGCG 11100 

TTTGATATAG TCTGCTAGAT AAGGCTCGTC CGTTTCTTTG TCATTGATGT AGAGTTTATC 11160 

ATTTTCGTAA CGAATGGTGT CGCCAGGCAT TCCAATCACG CGCTTGACGA TGTCCTTATT 11220 

GCCATCTTCC TCATGGGCCA CCACGATATC AAAACGGTCA ATAGGAAGGT GTTTTACAAC 11280 

GAAGAGAATT TCGCCATCCG CTAGGGTCGG ATCCATGGAA TGTCCTTCTA CGCGAACATT 11340 

GCTCCAAAAA AAGATACGAC TTAAAGCTAG TAATGACAGA ATTAGGAGGA ACAATCCCCA 11400 

CTCTTTTAAG AAATTTTTAA ATGAATTCAT AACTTACCTT TCTAAGCGTT TTTTCGCTTT 11460 

TTCAGTGTTT TTAAAGTGCA ATTTGGCGCA GAAGCTGAGT CCCTGCATAC CATAGGCTTG 11520 

CAAAATCTGG CTAGCCACCT TGTCAGAAGC CGTTCCAGCT CCACTTGGGA GCTGATAACC 11580 

CAGTTCTCGT CCCAAATTTT CAAGATTTTC CAGAAAGAGA TCACGCGCAA TGACAGAAGA 11640 

AACTGCGACA GACAAGTATT TGCCCTCAGC CTTTTCTTCT AAGCTGATAG GATTGCTGAA 11700 

ACGATTGGCC TCTTGTGCCA AGTACTTGTC ATAATTTTTA GCACTGGTAA AGGCATCAAT 11760 

CACAATTTTC TCAGGCTGAA CACCTTTTTG AAGGAGGAGA TAGATAGCCT GATTATGGAG 11820 

GGCAACCTTA ACCGAAACAG CGTTGTAGCG GTCTCCGATG ACCTCGTTGT ACTTGCTGGG 11880 

TGAGAGAAGG AGTGCCTGGT GCTGAATTTT TTCCTTGAGA ATAGGAGTAA TCTGACGGAT 11940 

CTTTTCGTCG GTCAGAGTCT TAGAATCCCC CACACCGAGT TTTCGTAAAA AGTCGTGCTG 12000 

GTCAGGTGTG ACAAAGGCAG CCACAACTGC AAGCCCACCA AAGTAGGAAC CATTTCCCAC 12060 

CTCATCTGTC CCAATTAAAG GAAGATTTTG TCCGCTGGTT TGCTCTACAG CTTGATAGCC 12120 
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AAAGAAACTG 


GCGTATTTTT 


CAGCCCCTTC 


ACCCTGAAGC 


AAGATTTTTC 


CAGAAGTATA 


12180 


GATAGAAACC 


GTTGCTTGAG 


GTAGTTTCAA 


AAAGTAGCGG 


ATATAGGGAT 


TCTTGCTAGG 


12240 


AGCCAGACTG 


GTTTGATAGT 


GTTCAAGAAA 


AGCCTGAATA 


TCCTTTTCGC 


TTGGTGTGAG 


12300 


TGTGATACTT 


GCCATAGTTT 


CTATTGTACC 


ACAAAAGCAG 


TAAAATTTGT 


AAAAACTGAC 


12360 


AAAATTAGCG 


AATTTTGGTA 


TAATATCGTG 


AGGTGAATTT 


TATGGCAAAT 


CTAAATCGAT 


12420 


TCAAATTTAC 


ATTCGGGAAA 


AAATCGTTAA 


CCTTGACAAG 


CGAACATGAC 


AACCTTTTTA 


12480 


TGGAGGAAAT 


CGCTAAGGTT 


GCGACAGAAA 


AATACCAAGC 


AATTAAAGAA 


CAAATGCCTA 


12540 


GCGCAGATGA 


TGAAACAATC 


GCTCTTTTGT 


TGGCAGTCAA 


CTGTTTATCA 


ACTCAGCTCA 


12600 


GCCGTGAGAT 


TGAATTTGAC 


GATAAGGAGC 


AAGAGCTAGA 


AGAACTCCGT 


CACAAGCTTG 


12660 


TGACTTGTAA 


GCAAGAACAG 


AGCAAGATTG 


AGGATTCCTT 


ATGATTTCAT 


TCCTTCTTCT 


12720 


ATTGGTCTTG 


GTTTGGGGAT 


TTTATATCGG 


CTATCGGAGA 


GGCCTGCTCT 


TACAGGTTTA 


12780 


TTACCTGATT 


TCAGCCATGG 


CATCGGCTTT 


TATGGCTGGC 


CAGTTTTATA 


AGGGGCTTGG 


12840 


AGAGCAATTC 


CATTTATTGC 


TCCCTTATGC 


AAATTCGCAG 


GAAGGTCAGG 


GGACTTTCTT 


12900 


TTTCCCATCG 


GATCAACTCT 


TTCAGCTGGA 


TAAGGTCTTT 


TATGCACCTA 


TCGGCTACTT 


12960 


GCTTGTATTT 


GGGATTGTCT 


ATAGCATTGG 


TCGTTTACTT 


GGTCTTCTCT 


TACACTTGAT 


13020 


TCCTAGCAAA 


AAACTGGGTG 


GTAAGTTGTT 


CCAAGTTTCA 


GCAGGTATCT 


TGTCCATGTT 


13080 


GGTGACCTTA 


TTTGTCTTGC 


AAATGGCCTT 


GACAATCTTG 


GCCACCATCC 


CCATGGCAGT 


13140 


TATACAAAAT 


CCTCTTGAAA AGAGTATCGT 


CGCAAAACAC 


ATCATCCAGA 


GCATACCGGT 


13200 


AACAACCACT 


TGGCTCAAAC 


AAATCTGGGT 


GACAAATTTA 


ATCGGATAAA 


AAGGGCAGGA 


13260 


GTTTTCCTAG 


CCCTTTGTTT 


ACAGATTTGA 


CTCGAATCTA 


TCAGAATGTA 


AAAAGCTACC 


13320 


ACACCTAGAC 


ATTCAAAGAC 


AAGGAAATAA 


AGATGAATAA 


GAAAATATTA 


GAAACATTAG 


13380 


AGTTCGATAA 


GGTCAAGGCC 


TTGTTTGAGC 


CTCATTTGTT 


GACCGAGCAG 


GGCTTGGAGC 


13440 


AATTGAGACA 


ACTGGCTCCG 


ACTGCCAAAG 


f* h *p h. & & ilt 
Lnun I nAfln I 




TTTGCTGAGA 


13500 


TGAAGGAAAT 


GCAGGCTCTT 


TTCGTCGAGC 


AACCCCATTT 


TACTATTCTC 


TCAACTAAGG 


i3560 


AAATTGCAGG 


AGTCTGCAAG 


AGGTTGGAGA 


TGGGAGCGGA 


TCTCAATATC 


GAGGAGTTCC 


13620 


TACTCTTGAA 


ACGCGTGCTT 


CTTGCCAGCC 


GAGAACTTCA 


AAATTTTTAC 


ACCAATCTGG 


13680 


AAAATGTCAG 


CTTGGAAGAA 


TTAGCCCTTT 


GGTTTGAGAA 


ATTACATGAT 


TTTCCGCAAT 


13740 


TACAAGGAAA 


TCTTCAGGCC 


TTTAATGATG 


CGGGTTTCAT 


TGAAAATTTT 


GCCAGTGAAG 


13800 


AATTGGCGCG 


AATCCGTCGA 


AAAATACATG 


ATAGCGAGAG 


TCAGGTACGC 


GATGTTTTAC 


13860 


AAGACTTGCT 


CAAGCAAAAA 


GCGCAGCTGT 


TGACGGAAGG 


AATTGTTGCT 


AGCAGAAATG 


13920 
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CCCGTCAGGT TTTACCACTC AAAAACACCT ACCGCAATAA GATTGCAGGT GTCGTTCATG 13980 

ATATTTCTGC TAGTGGAAAC ACCGTCTATA TCGAACCCCG TGAGGTAGTC AAACTGAGCG 14040 

AAGAAATTGC TAGTCTGCGA GCAGATGAGC GCTATGAAAT GCTTCGCATT CTCCAAGAAA 14100 

TTTCTGAGCG TGTCCGCCCT CATGCGGCTG AGATTGCTAA TGACGCTTGG ATTATCGGTC 14160 

ATCTGGACTT GATTCGTGCC AAGGTTCGAT TTATCCAAGA AAGACAAGCA GTCGTGCCTC 14220 

AGCTCTCAGA AAATCAAGAG ATTCAACTGC TCCATGTCTG CCATCCTTTG GTCAAAAATG 14280 

CCGTCGCAAA TGATGTCTAT TTTGGTCAAG ATTTAACAGC TATTGTCATT ACAGGTCCCA 14340 

ATACAGGTGG GAAGACCATC ATGCTCAAAA CTCTGGGCTT GACACAGGTC ATGGCCCAGT 14400 

CAGGATTGCC GATTTTAGCA GACAAGGGAA GTCCTGTTGG TATTTTTGAA GAAATCTTTG 144 60 

CTGATATTGG AGATGAGCAG TCTATTGAGC AGAGCTTGTC TACCTTCTCT AGTCATATGA 14520 

CCAATATCGT GGATATTCTT GGCAAGGTCA ACCAACATTC ACTCTTACTT TTGGATGAGT 14580 

TGGGGGCTGG TACTGATCCC CAAGAGGGAG CAGCCCTTGC CATGGCTATT CTGGAGGACC 14 640 

TTCGCCTGCG TCAAATCAAG ACCATGGCGA CGACCCACTA TCCAGAACTC AAGGCCTACG 14700 

CTATTGAGAC AGCCTTTGTG CAAAATGCCA GTATGGAGTT TGATACTGCA ACTCTTCGCC 14760 

CGACCTATCG CTTTATGCAG GGTGTTCCTG GCCGAAGTAA TGCCTTTGAA ATTGCCAAAC 14820 

GTCTAGGCCT ATCTGAAGTT ATCGTAGGAG ATGCCAGTCA GCAGATCGAT CAGGACAATG 14880 

ACGTCAATCG TATCATTGAG CAATTAGAAG AGCAGACGCT GGAAAGCCGC AAACCTTTGG 14940 

ACAATATCCG TGAGGTGGAG CAAGAAAATC TCAAGATGAA CCGTGCGCTA AAAAAACTCT 15000 

ACAACGAGCT TAATCGTGAA AAGGAAACCG AGCTTAACAA GGCGCGTGAA CAGGCTGCTG 15060 

AGATTGTGGA TATGGCCCTA AGTGAAAGTG ACCAGATTCT CAAAAATCTC CACAGTAAAT 15120 

CCCAACTCAA GCCCCACGAA ATCATTGAAG CCAAGGCCAA GTTGAAAAAA TTGCCTCCTG 15180 

AAAAAGTGGA CTTGTCTAAA AATAAGGTCC TTCAAAAGGC CAAGAAAAAA CGAGCTCCAA 15240 

AGGTGGGAGA TGATATCGTG GTTCTCAGTT ATGGTCAGCG TGGTACCTTG ACCAGTCAAC 15300 

TCAAGGACGG TCGCTGGGAA GCCCAAGTTG GCTTGATTAA GATGACCTTG GAAGAGAAAG 15360 

AGTTTGATCT TGTTCAAGCC CAGCAAGAAA AACCAGTCAA GAAGAAACAG GTCAATGTTG 15420 

TGAAACGAAC TTCTGGGCGA GGACCTCAAG CTAGACTGGA TCTTCGAGGC AAGCGCTATG 15480 

AAGAAGCCAT GAATGAGCTA GATACCTTCA TCGACCAAGC CTTGCTTAAC AATATGGCTC 15540 

AAGTTGATAT CATCCATGGT ATCGGAACAG GAGTCATCCG TGAAGGAGTT ACCAAATACT 15600 

TGCAAAGAAA CAAACATGTC AAGAGTTTCG GCTATGCCCC ACAAAATGCT GGAGGCAGTG 15660 
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GTGCGACTAT 


TGTCACTTTT 


AAAGGATAGC 


AGTATTCTGG 


ACTTTATAAA 


GTAAAAACTG 


15720 


TTGAACTAAT 


TTTTACTAAT 


AAACACATTG 


ACAAAAGCCA 


ACATTTTTTG 


TAAAATTAGA 


15780 


ATCAATTAAA 


TACCAACACC 


GAATGAAGTT 


TAATAGAAGT 


GGGGAATCGT 


TTGATTTTCC 


15840 


ATGACTGTAA 


ATGGACGGAA 


CTCTGGAGAG 


ACCGTAAAGG 


CACCGAAGGG 


CAAGGCAGGC 


15900 


AACTGCTCAA 


ACTCTCAGGT 


AAAAGGACAG 


AGCTAGGATA 


GATXGCTTTT 


TAGCATTTAT 


15960 


CTAAGCATTC 


CAGAGTACAT 


GTATCTTGCA 


TGTGCTCTTT 


CTTTTGGGGT 


TGAAACGATA 


16020 


GGAGAAGGAA 


ATGTTAGAAT 


TGCTTAAATC 


AATCGATGCT 


TTTGCTTGGG 


GACCGCCCCT 


16080 


CTTGATTTTA 


TTGGTCGGAA 


CAGGGATTTA 


CCTAACTATT 


CGGCTAGGAC 


TCTTGCAGGT 


16140 


TTTGCGTCTA 


CCCAAGGCCT 


TTCAGCTTAT 


TTTTATCGAG 


GATAAGGGAC 


ATGGTGATGT 


16200 


ATCCAGTTTT 


GCAGCTCTGT 


GTACAGCCTT 


GGCATCAACT 


GTTGGAACAG 


GAAATATCAT 


16260 


AGGAGTTGCG 


ACGGCTATCA 


AGGTTGGTGG 


ACCAGGAGCT 


CTATTTTGGA 


TGTGGATGGC 


16320 


GGCTTTCTTT 


GGAATGGCTA 


CCAAGTATGC 


GGAAGGACTC 


TTGGCCATCA 


AATACCGCAC 


16380 


CAAGGACGAC 


CATGGTGCAG 


TAGCGGGAGG 


TCCCATGCAT 


TATATCCTTC 


TAGGGATGGG 


16440 


AGAAAAGTGG 


CGACCACTTG 


CTGTTTTGTT 


TGCAGTAGCA 


GGAGTATTGG 


TTGCTCTCTT 


16500 


GGGAATCGGA 


ACCTTCACCC 


AAGTCAACTC 


GATTGCAGAA 


TCTATCCAAA 


ATACAACGAC 


16560 


GATTTCGCCA 


GCCATCACAG 


CTCTCGTCTT 


GTCTGTCTTT 


GTAGCGATTG 


CAGTCTTTGG 


16620 


TGGACTCAAG 


TCTATTTCTA 


AGGTTTCAAC 


TACTGTTGTT 


CCTTTTATGG 


CCATCATTTA 


16680 


TATCTTAGGA 


ACTCTTACAG 


TTATTTTCTT 


TAATATCGGA 


AAAATCCCTG 


GCACAATCGC 


16740 


TTTAGTCTTT 


ACCTCAGCTT 


TTAGTCCCCT 


TGCTGCGGTA 


GGTGGATTTG 


CTGGTGCTAG 


16800 


CGTTCGGATG 


GCTATTCAAA 


ATGGTGTGGC 


GCGTGGTGTG 


TTCTCAAACG 


AATCTGGTCT 


16860 


GGGTTCTGCT 


CCTATTGCAG 


CTGCAGCTGC 


CAAGACAAAT 


GAACCAGTAG 


AGCAAGGTTT 


16920 


GATTTCCATG 


ACAGGAACCT 


TTATTGATAC 


CCTCATCATT 


TGTACTCTAA 


CTGGTTTGAC 


16980 


CATCTTGGTA 


ACTGG 










16995 


(2) INFORKATION FOR SEQ ID NO: 83: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CCGGGGCTTT TGTAGTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 
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ACTAAGCGCT TTAATAAACA GTTAGATAAA ATTCAAGTTT CGTTGATTCG TCAGTTTGAC 120 

CAGGCTATTT CGGAGATTCC TGGGGTCTTG CGTTTGACCT TGGGGGAACC TGATTTTACA 180 

ACGCCAGACC ATGTCAAGGA GGCGGGCAAG CGAGCGATTG ATCAGAACCA ATCCTACTAT 240 

ACAGGGATGA GTGGTCTGCT GACTCTACGT CAGGCAGCCA GTGACTTTGT TAAGGAAAAG 300 

TACCAACTGG ACTATGCTCC TGAAAATGAA ATCTTGGTTA CAATTGGGGC GACAGAGGCT 360 

TTATCTGCGA CTTTGACGGC TATTTTGGAA GAGGGAGACA AGGTACTTTT GCCAGCTCCT 420 

GCTTATCCAG GCTATGAACC GATTGTTAAC TTAGTTGGGG CAGAAATTCT TGAGATTGAT 480 

ACGACTGAAA ATGGTTTTGT CTTGACTCCT GAGATGTTGG AGAAGGCCAT TTTGGAGCAG 540 

GGTGATAAGC TCAAGGCGGT TATTCTCAAC TATCCAGCCA ATCCGACAGG AATTACCTAC 600 

AGTCGAGAGC AGTTAGAGGC CTTGGCAGCT GTTTTACGCA AGTACGAAAT TTTTGTTGTC 660 

TGTGATGAGG TTTACTCAGA ATTGACCTAC ACAGGCGAAG CCATGTGTCT CTAGGAACGA 720 

TGTTGAGAGA CCAGGCTATT ATTATCAATG GTTTGTCTAA ATCGCATGCC ATGACAGGTT 780 

GGCGTTTGGG GCTGATTTTC GCTCCTGCGA CCTTCACAGC CCAGTTAATC AAGAGTCACC 840 

AGTACTTGGT CACTGCCGCA AATACCATGG CGCAACATGC TGCGGTAGAA GCCTTGACGG 900 

CTGGTAAAAA CGATGCGGAC CCATGAAGAA GGAATATATC CAACGTCGGG ACTATATCAT 960 

CGAAAAAATG ACTGCTCTTG GTTTTGAGAT TATCAAACCA GACGGTGCCT TCTATATTTT 1020 

TGCTAAAATT CCAGCGGGCT ACAATCAAGA CTCCTTTGCT TTTCTGAAGG ATTTTGCTCA 1080 

GAAGAAGGCC GTTGCCTTTA TCCCTGGTGC AGCCTTTGGA CGTTACGGGG AAGGCTACGT 1140 

CCGCCTATCT TATGCAGCCA GCATGGAGAC TATCAAAGAA GCCATGAAAC GACTTGAGGA 1200 

GTACATGAGA GAAGCATGAT TCAGTCTATC ACGAGTCAAG GCTTGGTGCT TTACAATCGC 1260 

AATTTTCGTG AGGATGACAA GCTCGTCAAA ATTTTTACAG AGCAGGTTGG CAAACGCATG 1320 

TTTTTTGTCA AACACGCTGG TCAGTCTAAG CTGGCGCCTG TTATTCAGCC CTTGGTGCTG 1380 

GCACGATTTC TCTTGCGAAT CAATGATGAC GGACTCAGTT ACATCGAAGA CTATCATGAG 1440 

GTCATGACTT TTCCCAAGAT TAATAGTGAC CTCTTTGTCA TGGCCTATGC GACCTATGTG 1500 

GCAGCTCTTG CAGATGCTAG TTTGCAGGAC AATCAGCAGG ATGCTCCCTT GTTTGCTTTT 1560 

TTGCAAAAGA CTTTGGAGTT GATGGAAGCA GGCPTGGATT ATCACGTTTT GACCAATATT 1620 

TTTGAAATTC AAATTTTGAC TCGATTTGGA ATCAGCCTCA ATTTTAATGA GTGTGTCTTC 1680 

TGCCATCGGG TTGGTCAGGC TTTTGACTTT TCTTTCAAAT ATGGAGCCTG CCTCTGTCCA 1740 

GAGCATTATC ATGAGGATAA GAGACGTTGT CATCTCAATC CCAATATCCC CTATCTGCTC 1800 
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AATCAATTTC AAGCTATTGA TTTTGAGACT TTGGAGACCA TTTCGCTCAA GCCTGGAATC 1860 

AAGCAAGAGC TACGCCAATT TATGGATCAA TTATATGAAG AGTACGTTGG GATTCACCTA 1920 

AAATCAAAGA AATTTATTGA TTCCCTAGCA GACTGGGGAC AATTACTAAA AGAGGAAAAG 1980 

AAATGAAAAA AATCGCAGTA GATGCCATGG GGGGCGATTA CGCACCTCAG GCCATTGTTG 2040 

AGGGTGTCAA TCAAGCCCTA TCTGACTTTT CAGATATCGA GGTTCAACTT TACGGAGATG 2100 

AAGCTAAAAT CAAGCAATAT CTGACAGCGA CAGAGCGCGT CAGCATTATC CATACGGATG 2160 

AGAAGATTGA TTCGGATGAT GAACCTACGA GAGCTATTCG GAATAAGAAA AATGCCAGTA 2220 

TGGTATTGGC AGCCAAGGCT GTCAAAGATG GTGAAGCAGA CGCTGTCCTT TCGGCTGGGA 2280 

ATACAGGTGC CTTGTTGGCA GCAGGATTCT TCATCGTGGG TCGTATCAAG AATATCGACC 2340 

GTCCTGGACT CATGTCTACC TTGCCTACCG TTGATGGAAA AGGTTTTGAC ATGCTAGACC 2400 

TTGGTGCCAA TCCAGAAAAT ACAGCCCAGC ACCTCCATCA ATATGCGGTT CTAGGTTCCT 2460 

TCTATGCTAA AAATGTCCGT GGCATTGCGC AACCACGCGT TGGTTTGCTC AACAACGGAA 2520 

CAGAGAGTAG CAAGGGCGAC CCGCTTCGTA AGGAAACTTA TGAATTACTG GCGGCTGATG 2580 

AAAGTTTGAA CTTTATCGGA AACGTGGAAG CGCGTGATTT GATGAATGGC GTTGCAGATG 2640 

TTGTTGTGGC AGATGGTTTC ACGGGAAACG CTGTGCTCAA ATCCATCGAA GGGACAGCTA 2700 

TGGGAATCAT GGGCTTGCTC AAGACAGCTA TTACAGGTGG TGGTCTTCGA GCGAAACTAG 2760 

GTGCCCTCCT TCTCAAGGAC AGCCTCAGTG GTTTGAAAAA ACAGCTCAAT TATTCAGATG 2820 

TTGGTGGAGC GGTCTTGTTT GGTGTTAAGG CACCTGTTGT CAAGACTCAT GGCTCAAGCG 2880 

ATGCCAAGGC TGTTTATAGT ACGATTCGTC AGATCCGTAC CATGCTAGAA ACAGACGTGG 2940 

TTGCCCAGAC TGCGCGTGAA TTTTCAGGAG AATAAAAGAG ATGACAGAAA AAGAAATTTT 3000 

TGACCCTATT GTGACCATTA TCCAAGAGCG ACAGGGAGAG GACTTTGTCG TGACAGAATC 3060 

CTTGAGTCTG AAAGACGATT TGGATGCGGA TTCTGTTGAC TTGATGGAGT TTATCTTGAC 3120 

TCTGGAAGAT GAATTTAGTA TCGAAATCAG CGATGAAGAA ATTGACCAAC TCCAAAACC 7 3L80 

AGGAGATGTG GTTAAAATCA TTCAAGGAAA ATAGCAATCG GAGTTCCAAG TCAACGGAAG 1240 

TAGATGGTTT TTAGAAATGA GAAATATCGG ACAAGCTGGT AAAATCTTGG CTGACAGTGC 3300 

TTATCAAGGG CTCATGAAGA TATATCCTCA AGCACAAACT CCACGTAAAT CCAGCAAACT 3360 

CAAGCCGCTA ACAGTTGAAG ATAAAGCCTG TAATCATGCG CTATCTAAGG AGATAAGCAA 3420 

GGTTGAGAAT ATCTTTGCCA AAGTAAAAAC GTTTAAAATG TTTTCAACAA CCTATCGAAA 3480 

TCATCGTAAA CGCTTCGGAT TACGAATGAA TTTGATTGCT GGTATTATCA ATCATGAACT 3540 

AGGATTCTAG TTTTGCAGGA AGTCTAATAG TAAAAAAGTG ATTAGAAAAC ATCTTTTTTA 3600 
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AAAATAGAGA TGATTTTCAA ACAAAAAAGC TAATTCAAGA CGTTTCGATG CCAATTCAAG 3660 

ATTTGGATGA AAAAAATTAA TAGATACTGT TATACTAAAC TTGTCAAGTT TGTAACAAGA 3720 

CAAATATTAA AAATAAAAAA GAGGTATTCG TTATGAATAC AAAAACGATG TCACAATTTG 3780 

AAATTATGGA TACTGAGATG CTTGCTTGCG TTGAAGGTGG CGCATGCAAT TGGGGAGATT 3840 

TTGCCAAAGC AGGTGTTGGA GGAGGAGCAG CACGAGGTCT TCAGCTAGGA ATTAAAACAA 3900 

GAACATGGCA AGGTGCAGCA ACTGGTGCTG TGGGAGGAGC TATACTTGGA GGTGTGGCCT 3960 

ATGCAGCGAC ATGTTGGTGG TAATTATGGA TTTTAAAAGT TTTATTATTG GTTTAGTAGT 4 020 

TGGTATATTT GGTCCTTATA TGGATGATTT AATTAGAAAA AAATTTTTAA AGTCTTCGGA 4080 

GAAGAAAACA GAAAAATCTG TTAAAAAATA ATCAAAACTA TAAATGATGA ATCTGAATCA 4140 

AAATTATTTT GCGCATGTAA AGAGGAGTCT TATAGTAACG AGTCAAAAAA GGAGTAACTA 4200 

TGAATCGTAA TTTAGAACGG TGTTATCTAT TCTGACTACG AATAGATCAT ACCAGAGGTA 4260 

CCTTAGAAAT AGCAGAGACA TTAGAAATTG AAGTAATAAA TAGGATGTCG TAAGTGTTAC 4320 

TATCAATGAT TTATTTGTTT CAAGCTTGCC TAGGGTGACA CTAAAAAATC AATTTCCTTT 4380 

CAATAGCATA TTTTTAGTGG GCAGGACTCT TGTTCTGCCT ATTTTTTTAT CCAAAAAGTG 4440 

CAGTTGGGAG GGAGATAGGC TCATTTGGGA AGGAAGTCCA GTTTTT G TTT AGTGATTGGG 4500 

GTAAGATAGT TGTTATCAGA TGAGTTAATA CTCTTCGAAA ATCAAATTCA AACCACGTCA 4560 

ACGTCGCCTT GCCGTATATA TGTGACTGAC TTCGTCAGTC CTATCTACAA CCTCAAAACA 4620 

GTGTTTTGAG CAGCCTACGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA 4680 

GGGAAAAGGA GATGAATATG AAATTTGGGA AACGTCATTA TCGTCCGCAG GTGGATCAGA 4740 

TGGACTGCGG TGTAGCTTCA TTAGCCATGG TTTTTGGCTA CTATGGTAGT TATTATTTTT 4800 

TGGCTCACTT GCGAGAATTG GCTAAGACGA CCATGGATGC GACGACGGCT TTGGGCTTGG 4860 

TCAAGGTGGC AGAGGAGATT GGTTTTGAGA CGCGAGCCAT TAAGGCAGAT ATGACGCTTT 4920 

TTGACTTGCC GGATTTAACT TTTCCTTTTG TTGCCCATGT GCTTAAGGAA GGGAAATTGC 4980 

TCCACTACTA TGTGGTGACT GGGCAGGATA AGGATAGCAT TCATATTGCC GATCCAGATC 5040 

CCGGGGTGAA GTTGACTAAA CTGCCACGTG AGCGTTTTGA GGAAGAATGG ACAGGAGTGA 5100 

CTCTTTTTAT GGCACCTAGT CCAGACTATA AGCCTCATAA GGAACAAAAA AATGGTCTGC 5160 

TCTCTTTTAT CCCTATATTA GTGAAGCAGC GTGGCTTGAT TGCCAATATC GTTTTGGCAA 5220 

CACTCTTGGT AACCGTGATT AACATTGTGG GTTCTTATTA TCTGCAGTCT ATCATTGATA 5280 

CCTATGTGCC AGATCAGATG CGTTCGACAC TAGGGATTAT TTCTATTGGG CTAGTCATCG 5340 
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TCTACATCTT CCAGCAAATC TTGTCTTACG CTCAGGAGTA TCTCTTGCTT GTTTTGGGGC 5400 

AACCCTTCTC GATTGACGTG ATTTTGTCCT ATATCAAGCA TGTTTTTCAC CTCCCTATGT 5460 

CCTTCTTTGC GACACGCAGG ACAGGGGAGA TCGTGTCTCG TTTTACAGAT GCTAACAGTA 5520 

TCATCGATGC GCTGGCTTCG ACCATCCTTT CGATTTTCCT AGATGTGTCA ACGGTTGTCA 5580 

TTATTTCCCT TGTTCTATTT TCACAAAATA CCAATCTCTT TTTCATGACT TTATTGGCGC 5640 

TTCCTATCTA CACAGTGATT ATCTTTGCCT TTATGAAGCC GTTTGAAAAG ATGAATCGGG 5700 

ATACCATGGA AGCCAATGCG GTTCTGTCTT CTTCTATCAT TGAGGACATC AACGGTATTG 5760 

AGACTATCAA GTCCTTGACC AGTGAAAGTC AGCGTTACCA AAAAATTGAC AAGGAATTTG 5820 

TGGATTATCT GAAGAAATCC TTTACCTATA GTCGAGCAGA GAGTCAGCAA AAGGCTCTGA 5880 

AAAAGGTTGC CCATCTCTTG CTTAATGTCG GCATTCTCTG GATGGGGGCT GTTCTGGTCA 5940 

TGGATGGCAA GATGAGTTTG GGGCAGTTGA TTACCTATAA TACCTTGCTG GTTTACTTTA 6000 

CTAATCCTTT GGAAAATATC ATCAATCTGC AAACCAAGCT TCAGACAGCG CAGGTTGCCA 6060 

ATAACCGTCT AAATGAAGTG TAT CT ACT AG CTTCTGAGTT TGAGGAGAAG AAAACAGTTG 6120 

AGGATTTGAG CTTGATGAAG GGAGATATGA CCTTCAAGCA GGTTCATTAC AAGTATGGCT 6180 

ATGGTCGAGA TGTCTTATCG GATATCAATT TAACCGTTCC CCAAGGGTCT AAGGTGGCTT 6240 

TTGTGGGGAT TTCAGGGTCA GGTAAGACGA CTTTGGCCAA GATGATGGTT AATTTTTACG 6300 

ACCCAAGTCA AGGGGAGATT AGTCTGGGTA GTGTCAATCT CAATCAGATT GATAAAAAAG 63 60 

CCCTGCGCCA GTACATCAAC TATCTGTCTC AACAGCCCTA TGTCTTTAAC GGAACGATTT 6420 

TGGAGAATCT TCTTTTGGGA GCCAAGGAGG GGACGACACA GGAAGATATC TTACGGGCGG 64 80 

TCGAATTGGC AGAGATTCGA GAGGATATCG AGCGCATGCC ACTGAATTAC CAGACAGAAT 6540 

TGACTTCGGA TGGGGCAGGG ATTTCAGGTG GTCAACGTCA GAGAATCGCT TTGGCGCGTG 6600 

CTCTCTTGAC AGATGCGCCG GTCTTGATTT TGGATGAGGC GACTAGCAGT TTGGATATTT 6660 

TGACAGAGAA GCGGATTGTC GATAATCTCA TTGCTTTGGA CAAGACCTTG ATTTTCATTG 6720 

CTCACCGCTT GACTATTGCT GAGCGGACAG AGAAGGTAGT TGTCTTGGAT CAGGGCAAGA 6780 

TTGTCGAAGA AGGAAAGCAT GCTGATTTGC TTGCACAGGG TGGCTTTTAC CCCCATTTGG 6840 

TCAATAGCTA GAAAGAGGAG AGGATGAAAC CAGAATTTTT AGAAAGTGCG GAGTTTTATA 6900 

ATCGTCGTTA CCATAATTTT TCCAGTAGTG TGATTGTACC CATGGCCCTT CTGCTTGTGT 6960 

TTTTACTTGG CTTTGCAACT GTTGCAGAGA AGGAGATGAG TTTGTCCACT AGAGCTACTG 7020 

TCGAACCTAG TCGTATCCTT GCAAATATCC AGTCAACTAG CAACAATCGT ATTCTTGTCA 7080 

ATCATTTGGA AGAAAATAAG CTGGTTAAGA AGGGGGATCT TTTGGTTCAA TACCAAGAAG 7140 



i 
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GGGCAGAGCG TGTCCAAGCG GAGTCCTATG CCAGTCAGTT GGACATGCTA AAGGATCAAA 7200 

AAAAGCAATT GGAGTATCTG CAAAAGAGCC TGCAAGAAGG GGAGAACCAC TTTCCAGAGG 7260 

AGGATAAGTT TGGCTACCAA GCCACCTTTC GCGACTACAT CAGTCAAGCA GGCAGTCTTA 7320 

GGGCTAGTAC ATCGCAACAA AATGAGACCA TCGCGTCCCA GAATGCAGCA GCTAGCCAAA 7380 

CCCAAGCCGA AATCGGCAAC CTCATCAGTC AAACAGAGGC TAAAATTCCC GATTACCAGA 74 40 

CAGCTAAGTC AGCTATTGAA ACAGGTGCTT CCTTGGCCGG TCAGAATCTA GCCTACTCTC 7500 

TTTACCAGTC CTACAAGTCT CAGGGCGAGG AAAATCCCCA AACTAAGGTT CAGGCAGTTG 7 560 

CACAGGTTGA AGCACAGATT TCTCAGTTAG AATCTAGTCT TGCTACTTAC CGTGTCCAGT 7 620 

ATGCAGGTTC AGGTACCCAG CAAGCCTATG CCTCAGGGTT AAGCACTCAA TTGGAATCCC 7 680 

TTAAATCCCA ACACTTGGCA AAGGTTGGTC AGGAATTGAC CCTTCTAGCC CAGAAAATTT 7740 

TGGAGGCAGA GTCAGGTAAG AAGGTACAGG GAAATCTTTT AGACAAGGGG AAAGTTACGG 7800 

CGAGTGAGGA TGGGGTGCTT CATCTTAATC CTGAGACCAG TGATTCTACC ATGGTTGCAG 7860 

AAGGTGCCCT ACTAGCCCAA CTTTATCCAT CTTTGGAAAG AGAAGGGAAA GCCAAACTCA 7920 

CAGCTTATCT AAGTTCAAAA TATGTAGCAA GAATCAAGGT CGGTGATTCT GTTCGCTATA 7980 

CTACGACTCA TGATGCCGGG AATCAACTTT TCCTAGATTC TACTATTACA AGTATTGATG 8040 

CGACAGCTAC TAAGACTGAG AAAGGGAATT TCTTTAAAAT CGAGCCGGAC ACTAATCTAA 8100 

CTTCGGAGCA GGCTGAAAAA CTTAGGTACG GGGTGGAAGG CCGCTTGCAG ATGATTACGG SI 60 

GCAAGAAAAG TTACCTACGT TATTATTTGG ATCAATTTTT GAACAAAGAG TAATGTTCGT 8220 

CTTTTTAGAG TTAAATAATT TTTAAACTGT GAGAAAGATT CTTCTTGCAG TTTTTTCTTT 8280 

ACAATTTTTG AAAAACATCT ACTATTTATT CGGTTAAATT CTTGTGTTTT TTGGTTTTTT 8 340 

GTGGTAAAAT GTGCTCAAGT AATACGAAAG GCGAACTTTA AAATCTCAAA ACAATTGATC 8400 

TATTCGGGAA AAGCTAAAGA TATCTATACA ACTGAGGATG AAAATCTTAT TATTTCAACT 8460 

TACAAGGACC AGGCGACTGC TTTCAACGGT GTCAAGAAGG AGCAGATTGC AGGTAAGGGA 8520 

GTCTTGAATA ATCAGATCTC ATCTTTTATT TTTGAGAAAT TAAATGTGGC TGGTGTGGCG 8580 

ACTCACTTTG TGGAGAAACT TTCAGACACG GAACAACTCA ATAAAAAGGT TAAGATTATT 8640 

CCTTTGGAAG TCGTGCTCCG CAACTATACT GCTGGTTCCT TTTCAAAACG TTTTGGTGTG 8700 

GATGAGGGAA TCGCCTTGGA GACTCCGATT GTCGAATTTT ACTACAAAAA TGATGATTTG 8760 

GATGATCCAT TTATCAATGA TGAGCATGTG AAATTCCTAC AGATTGCGGG TGACCAGCAG 8820 

ATTGCCTACT TGAAGGAAGA AACGCGTCGT ATCAATGAAC TATTGAAAGT CTCGTTTGCT 8880 
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GAGATTGGGC TTAAATTGAT TGACTTTAAG CTAGAGTTCG GTTTTGACAA GGATGGCAAG 8940 

ATTATCTTGG CAGACGAATT TTCACCAGAT AACTGCCGCT TGTGGGACGC TGATGGCAAC 9000 

CACATGGATA AGGATGTTTT CCCTAGAGGA TTGGGAGAAC TAACCGACGT TTATGAGATT 9060 

GTTTGGGAAA AGTTGCAGGA ATTGAAATAA TCTGTTTGCA ACGGAAAACC TTCGTCTCTC 9120 

AACTAAAAGG ACTCAGGCTG AAAAGGTCCC CCAGACCTTT TCACTCTGTA GAGAACTAGG 9180 

TGAACTAACA GATGTTTACG AAATTGTCTG GGAAAAGTTG CAGGGTTTAA AATAACAACC 9240 

TCAAGGCTGT TTGGGAATAT TGCAAGAGCT GAAATAAAGG AATAAGAATT GATGGATAAA 9300 

CGTATTTTTG TTGAAAAAAA GGCTGATTTT CAGGTCAAGT CAGAGAGTTT GGTTAGAGAG 9360 

CTCCAGCACA ACTTGGGACT GTCAAGCTTG AAAAGTATTC GTATTGTGCA AGTATATGAT 9420 

GTATTTGACT TGGCTGAGGA CTTGTTTGCA CCTGCAGAGA AGCACATTTT CTCTGAGCAG 9480 

GTAACCGACC ATGTTTTAGA TGAAGTATCT GTGCAGGCGG ATCTTGCTAA CTATGCTTTC 9540 

TTTGCCATTG AAAGTCTGCC AGGGCAGTTT GACCAGCGTG CAGCTTCCTC ACAGGAAGCC 9600 

TTGCTTTTGT TGGGAAGTTC GAGTGACGTG ACAGTCAACA CAGCCCAACT TTACTTGGTG 9660 

AATAAAGATA TTGATCCGAC TGAGTTGGAA GCTGTCAAAA ACTACCTGCT CAATCCAGTT 9720 

GATTCTCGTT TCAAGGATAT CACGACAGGG ATTGCCAAGC ACGAGTTTTC AGAGTCAGAC 9780 

AAGACCATTC CCAAATTGAC TTTCTTTGAA AGCTATGCAG CAGAAGACTT TGCTCGCTAC 9840 

AAGGCCGAAC AAGGGATGGC CATCGAAGTG GATGATTTGC TCTTTATCCA AGACTACTTT 9900 

AAGTCAATCG GGCGCGTGCC AACTGAGACT GAACTCAAGG TTTTGGACAC TTACTGGTCT 9960 

GACCACTGCC GTCATACGAC TTTTGAGACA GAGTTGAAAC ACATCGACTT TTCAGCTTCT 10020 

AAATTTCAAA AGCAATTGCA GTCAACCTAT GACAAGTATA TTGCCATGCG CGAGGAATTA 10080 

GGTCGGTCTG AAAAACCACA AACCTTGATG GATATGGCGA CTATTTTCGG TCGTTATGAG 10140 

CGTCCTAATG GACGATTGGA TGATATGGAA GTCTCTGACG AAATCAATGC CTGCTCAGTT 10200 

GAAATTGAAG TGGACGTTGA TGGTGTCAAG GAACCTTGGC TCCTCATGTT TAAAAACGAA 10260 

ACCCACAACC ATCCAACAGA AATTGAGCCA TTTGGTGGAG CGGCTACCTG TATTGGTGGA 10320 

GCTATTCGTG ATCCGTTGTC AGGCCGTTCC TATGTTTACC AAGCCATGCG TATTTCAGGT 10380 

GCTGGTGATA TTACAGCACC GATTTCGGAA ACTCGCGCTG GGAAATTGCC ACAACAAGTC 10440 

ATTTCTAAAA CAGCAGCTCA TGGTTATTCT TCATATGGTA ACCAGATTGG GCTTGCAACA 10500 

ACCTACGTTC GTGAATACTT CCACCCAGGC TTTGTAGCTA AACCTATGGA ACTTGGTGCC 10560 

GTTGTTGGTG CGACTCCCAA GGGCAATGTT GTCCGTGAAA AACCTGAAGC AGGTGATGTG 10620 

ATCATCCTTC TCGGAGGCAA AACAGGTCGT GATGGTGTCG GTGGTGCGAC GGGCTCTTCT 10680 
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AAGGTTCAAA CAGTTGAGTC TGTAGAGACT GCTGGTGCTG AGGTTCAAAA AGGAAATGCC 10740 

ATCGAAGAAC GCAAGATTCA GCGCCTCTTC CGTAATGGCA ATGTCACTCG TCTGATCAAG 10800 

AAGTCCAATG ACTTTGGGGC AGGCCGTCTC TGTGTGGCTA TCGGTGAATT GGCAGACGGT 10860 

CTTGAAATCG ACCTCAACAA GGTGCCTCTT AAATACCAGG GCTTGAATGG TACAGAAATT 10920 

GCCATCTCTG AATCACAAGA ACGGATGGCG GTCCTGCTTC GTCCTGAAGA TGTGGATGCG 10980 

TTCGTTGCCG AATGTAACAA AGAAAATATT GATGCTGTTG TGGTGGCGAC AGTAACTGAA 11040 

AAACCAAATC TTGTCATGCA CTGGAATGGT GAGACAATCG TTGACTTGGA GCGTCGTTTC 11100 

CTTGACACCA ATGGTGTGCG CGTGGTTGTC GATGCCAAAG TTGTGGACAA GGATGTCAAA 11160 

CTCCCAGAAG AGCGTCAAAC ATCTGCTGAA ACACTGGAAT CAGATACCCT TACGGTTCTA 11220 

TCTGACCTCA ACCATGCAAG TCAAAAAGGA TTACAGACTA TCTTTGACTG CTCTGTTGGA 11280 

CGCTCAACGG TTAATCACCC ACTTGGTGGT CCTTACCAAC TCACACCAAC TGAGGCATCT 11340 

GTGCAGAAAT TGCCAGTTCA ACACGGTGTG ACTCATACTG CGTCCGTCAT TGCTCAAGGT 11400 

TTCAACCCAT ATGTAGCTGA ATGGTCTCCA TACCACGGTG CTGCTTATGC GGTTATCGAA 11460 

GCAACTGCTC GTTTGGTGGC TGCTGGTGCC AACTGGTTCA AGGCTCGTTT CTCTTACCAA 11520 

GAGTATTTCG AGCGTATGGA TAAACAAGCA GAGCGTTTCG GTCAGCCAGT AGCTGCTCTT 11580 

CTAGGTTCTA TTGAAGCACA AATTCAGCTT GGCTTGCCAT CTATCGGTGG TAAGGACTCC 11640 

ATGTCTCGTA CCTTTGAAGA ATTGACCGTT CCGCCAACCT TGGCTGCCTT TGGGGTGACG 11700 

ACGGCAGATA GCCGTAAGGT GCTCTCTCCA GAATTTAAAG CTGTTGGGGA AAATATCTAC 117 60 

TACATCCCAG GTCAAGCCCT CTCTGCAGAG ATTGATTTTG ACTTGATTAA GAAAAATTTT 11820 

GCTCAGTTTG AAGCCATCCA AGCTGACCAT AAAGTGACAT CTGCATCAGC TGTCAAATAC 11880 

GCTCGTGTAG TTGAAAGTTT GGCTCTTGCT ACCTTTGGAA ACTATATTGG TGCAGAGGTG 11940 

ACCTTGCCTG AACTTGAAAC AGCTTTGACA GCTCAATTAG GCGGCTTTCT CTTCACATCT 12000 

CCTGAAGAAA TTGCTGGAGT AGAGAAGGTT GGACAAACGA AAGCAGACTT TACACTGACT 12060 

GTCAACGGTG TGAAGCTAGA TGGACACAAG CTTGACAGTG CATTTCAAGG GACATTGGAA 12120 

GAACTTTACC CAACAGAATT TACCCAAGCG AAAGAACTAG AAGAAGTACC AGCTGTGGCA 12180 

TCAGATGTTG TGATTAAAGC CAAAGAAAAG GTTGAAAAAC CTGTGGTTTA CATCCCAGTC 12240 

TTTCCAGGAA CCAACTCAGA ATATGATTCA GCTAAGGCCT TCGAAAAAGA AGGTGCAGAG 12300 

GTCAATTTGG TGCCATTCGT GACCTTGAAT GAAGAAGCTA TTGTCAAGTC AGTTGAAACT 12360 

ATGGTTGACA ATATCGACAA GACTAATATT CTCTTCTTTG CTGGTGGATT CTCGGCTGCG 12420 
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GATGAACCAG 


ATGGTTCAGC 


TAAGTTTATC GTCAATATCC 


TGCTTAATGA 


AAAAGTGCGT 


12480 


GTGGCTATTG 


ATAGCTTTAT 


CGCCCGTGGT GGTTTGATTA 


TCGGTATTTG 


TAATGGATTC 


12540 


CAAGCCTPAG 


TCAAATCGGG 


TCTCCTACCC TACGGAAACT 


TTGAAGCTGC 


TAACAGTACT 


12600 


AGCCCAACCC 


TCTTCTACAA 


TGATGCCAAC CAACACGTGG 


CCAAGATGGT 


GGAAACTCGC 


12660 


ATTGCCAATA 


CCAACTCACC 


ATGGTTGGTT GGTGTGCAAG 


TGGGCGATAT 


CCACGCTATT 


12720 


CCTGTTTCGC 


ACGGTGAAGG 


GAAGTTTGTC GTGACGGCTG 


AGGAATTTGC 


AGAGCTCCGT 


12780 


GACAATGGAC 


AAATTTTCAG 


CCAATACGTT GACTTTAACG 


GTAAACCAAG 


TATGGATTCT 


12840 


AAGTACAATC 


CGAATGGTTC 


TCTCCATGCC ATCGAAGGAA 


TTACCAGCAA 


GAATGGTCAA 


12900 


ATCATCGGTA 


AGATGGGCCA 


CTCAGAACGT TATGAGGATG 


GTCTTTTCCA 


AAATATCCCA 


12960 


GGCAATAAAG 


ACCAACACCT 


GTTCGCATCA GCGGTTAAAC 


ATTTCACTGG 


AAAATAAGAC 


13020 


TTACAGATTT 


TCTAATAGAT 


AGTATCAGTA ATGTAAAAGT 


CATGTAAATC 


TAGCTCTTGA 


13080 


TGATTACAAA 


TGAAAATTAG 


GTATAAAAAA TGACATACGA 


AGTAAAATCT 


CTTAATGAAG 


13140 


AATGTGGTGT 


TTTCGGTATT 


TGGGGACATC CAGATGCTGC 


TAAGTTGACC 


TATTTTGGAC 


13200 


TCCACAGTCT 


TCAACACCGT 


GGTCAGGAGG GGGCAGGAAT 


CCTCTCCAAT 


GATCAAGGAC 


13260 


AACTGAAGCG 


CCATCGTGAC 


ATGGGGCTTT TATCAGAAGT 


TTTCAGAAAT 


CCAGCTAATT 


13320 


TGGATAAATT 


GACAGGAGCT 


GGTGCGATTG GGCATGTGCG 


TTATGCGACT 


GCTGGCGAAG 


13380 


CTTCTGTAGA 


TAACATCCAG 


CCCTTCCTCT TCCGTTTTCA 


CGATATGCAG 


TTTGGTTTGG 


13440 


CTCATAATGG 


AAATCTGACC 


AATGCAGCCT CTCTCAAGAA 


AGAACTGGAA 


CAAAGAGGAG 


13500 


CAATTTTCAG 


CGCGACTTCG 


GACTCTGAAA TCTTGGCTCA 


CCTCATTCGT 


CGCAGTCATA 


13560 


ATCCTAGCCT 


GATGGGCAAA 


ATCAAGGAAG CGCTCAGCCT 


TGTCAAAGGT 


GGTTTTGCCT 


13620 


ATATCTTGCT 


GTTTGAGGAC 


AAGTTGATTG CGGCTCTTGA 


CCCAAATGGA 


TTCCGACCGC 


13680 


TTTCGATTGG 


TAAAATGGCT 


AATGGAGCAG TTGTTGTATC 


TTCTGAAACC 


TGTGCTTTTG 


13740 


AGGTCATTGG 


TGCCGAGTGG 


ATTCGTGATT TGAAGCCAGG 


TGAGATTGTG 


ATCATTGATG 


13800 


ACGAGGGCAT 


TCAGTATGAC 


AGCTATACAG ATGATACCCA 


GTTGGCGGTT 


TGTTCTATGG 


13860 


AGTATATCTA 


CTTTGCTCGC 


CCTGATTCTA ATATCCACGG 


TGTCAATGTC 


CATACGGCAC 


13920 


GTAAGAGAAT 


GGGAGCGCAA TTGGCGCGAG AATTTAAGCA TGACGCAGAT 


ATTGTAGTTG 


13980 


GTGTGCCCAA 


TTCTTCCCTA 


AGCGCGGCTA TGGGATTTGC 


GGAAGAATCA 


GGCTTACCAA 


14040 


ATGAAATGGG TCTGATCAAA AACCAATACA CCCAGCGAAC 


TTTTATCCAA 


CCGACTCAAG 


14100 


AATTGCGGGA 


GCAAGGAGTG 


CGGATGAAAC TGTCTGCTGT 


TTCGGGTGTT 


GTCAAAGGCA 


14160 


AACGTGTGGT 


CATGGTGGAT 


GATTCCATTG TACGTGGAAC 


AACCTCTCGT 


CGTATCGTTC 


14220 
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AGCTCTTGAA AGAAGCGGGT GCGACTGAGG TTCACGTTGC CATTGGAAGT CCTGCACTAG 14280 

CGTATCCATG TTTCTACGGG ATTGATATCC AGACCCGTCA GGAGCTGATT GCAGCCAATC 14340 

ATACGGTCGA AGAAACTCGC CAAATCATTG GTGCGGACAG TCTGACTTAT CTTTCAATTG 14400 

ATGGCTTGAT TGAGTCGATT GGTATCGAAA CAGATGCGCC GAACGGTGGT CTCTGTGTCG 14460 

CTTACTTTGA CGGTGACTAC CCAACGCCTC TTTATGACTA CGAAGAAGAC TATCGTAGAA 14520 

GTTTGGAAGA AAAGACCAGT TTTTACAAGT AGGCGACAGA TTCTCCATTA AAGAAAAGGA 14580 

AAAAATAAAT GACAAATAAA AATGCATATG CCTCACGTCT CACTACTGAC TAAAGGCTTA 14640 

AGCATTTAGT CAGTAGACGC TTTGTCCTAT AGGATCAAAG CTAGAGCCCT GACTAGTATT 14700 

TTTAGATAAA AAGATGGTTT ATCTAAAAAT ACGTCGCAGT CTTTCTCAAA AAAAGAAAAG 14760 

GAAAAATAAA ATGGCAAATA AAAATGCGTA CGCTCAATCT GGTGTGGATG TTGAAGCGCG 14820 

TTATGAAGTT GTTGAACGGA TTAAAAAGCA CGTGGCCCGT ACGGAGCGTG CAGGTGTCAT 14880 

GGGAGCTCTT GGTGGCTTTG GTGGTATGTT TGACCTTTCC AAGACTGGGG TTAAAGAACC 14940 

CGTCTTGATT TCAGGGACTG ACGGTGTCGG AACCAAGCTC ATGTTGGCTA TCAAGTACGA 15000 

CAAGCACGAT ACCATCGGGC AGGACTGTGT GGCCATGTGT GTCAACGACA TCATTGCTGC 15060 

AGGTGCGGAA CCCCTCTATT TTCTCGACTA CGTAGCGACA GGGAAGAATG AACCAGCTAA 15120 

GCTAGAACAA GTGGTTGCTG GTGTGGCAGA AGGTTGTGTG CAGGCTGGTG CTGCCCTCAT 15180 

CGGTGGGGAA ACGGCTGAAA TGCCGGGCAT GTACGGCGAA GACGACTATG ACTTGGCTGG 15240 

TTTTGCGGTC GGTGTGGCTG AAAAATCTCA AATCATTGAC GGTTCAAAGG TGGTAGAGGG 15300 

AGATGTTCTT CTCGGACTTG CTTCAAGTGG GATTCACTCA AATGGTTACT CTTTGGTTCG 15360 

TCGTGTCTTT GCGGATTACA CAGGTGAGGA AGTCCTACCA GAATTGGAAG GCAAGAAACT 15420 

TAAGGAAGTT CTACTTGAGC CGACTCGTAT CTATGTCAAG GCTGTCTTGC CGCTCATCAA 15480 

AGAAGAGTTG GTCAACGGCA TTGCCCACAT CACAGGTGGT GGCTTTATCG AAAATGTCCC 15540 

TCGTATGTTT GCAGATGACC TAGCTGCTGA AATTGATGAA AGTAAAGTTC CAGTGCTTCC 15600 

AATTTTCAAA ACCCTTGAAA AATACGGTCA GATTAAACAC GAAGAAATGT TTGAAATCTT 15660 

CAATATGGGT GTGGGACTTA TGTTGGCGGT CAGCCCTGAA AATGTAGAGC GTGTAAAACA 15720 

ATTGTTGGAT GAAGCAGTCT ATGAAATTGG TCGCATCGTC AAGAAAGAAA ACGAAAGTGT 15780 

C ATT AT C AAA TGAAAAAAAT AGCG G TTTTT GCCTCTGGTA ATGGCTCAAA TTTTCAGGTG 15840 

ATTGCCGAAG AATTTCCAGT GGAGTTTGTC TTTTCAGACC ATCGTGATGC CTATGTGCTT 15900 

GAGCGTGCAA AGCAGCTCGG CGTTCTGTCC TATGCTTTTG AACTCAAGGA GTTTGAGAGC 15960 
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AAGGCAGACT ACGAAGCAGC CCTTGTCGAA CTCTTGGAAC AACACCAGAT TGACTTGGTT 16020 

TGCCTAGCAG CCTACATGAA AATCGTTGGA CCAACCTTAT TGTCGCCTTA TGAAGGTCGG 16080 

ATTGTCAACA TTCATCCAGC CTACTTGCCA GAATTTCCAG GAGCTCATGG GATTGAGGAT 16140 

GCTTGGAATG CTGGCGTGGG TCAGTCTGGT GTGACCATTC ACTGGGTGGA TTCGGGTGTG 16200 

GATACAGGCC AGGTCATCAA ACAGGTTCGT CTGCCACGAC TAGCTGATGA TACCATTGAC 16260 

AGATTTGAAG CTCGCATCCA TGAAGCAGAG TACAGGCTGT ATCCGGAAGT AGTGAAGGCT 16320 

CTATTTACAG ATTGACTTTT TGATGATTCA TATGATATCT TTGATTTTAA ATTGGAGTCA 16380 

GTGTTTGTTG AAGACGGCTT CAAACGGAGG TATTTGTAAT GTTAGAATCT AAAAAAACAA 16440 

CTCGATATGT ATTTTATGTC TATCTGATGT TATTAACTTG GGGAATCTTA TTTAAGTTTG 16500 

AAACAAATCC TGAATTTATA GCATTTTTCT TAGCTCCAAG CTATATCAAT TGGATTCCAT 16560 

TTTCAGAACC ACTAATAGTC GATGGAAAAA TTGTTTTTGC TGAAATGTTA TTTAATCTGA 16620 

TTTTCTTTAT TCCATTAGGT GTTTGTTTCC CTTTGATAAA AACTAATTTA TCTAGTTTAA 16680 

GAATAGTCGG GACAGGTTTC TTGATTAGTT TATTGTTTGA GTGCTTACAG TATATTTTAG 16740 

CAATAGGTAT AACAGATATA ACGGATTTGA CTTTAAATAC GCTAGGTGTC TGTCTAGCCT 16800 

TACTGATTTA TCAAATTTTT ATAAGAGTGT TCAAATCACA GACTAGAAAA TGGATCAATA 16860 

TCTTAGGTAT GCTTAGCCTT GGTTTTGCTT ATCTTGTTTT ACTGTTACTG CATTTACTTA 16920 

GTGTTTAACT AATGATTAAA AAGGAGAATA TAATGACTAA ACGCGTCTTA ATCAGCCTCT 16980 

CAGACAAAGC GGGCATTGTT GAATTTGCCC AAGAACTCAA AAAACTTGGT TGGGAGATTA 17040 

TCTCAACAGG TGGAACTAAG GTTGCCCTTG ATAATGCTGG GGTGGATACC ATTGCTATCG 17100 

ATGATGTGAC TGGTTTCCCA GAAATGATGG ACGGTCGTGT GAAGACCCTC CACCCAAATA 17160 

TCCACGGAGG GCTTCTCGCT CGTCGTGACT TGGATAGCCA CTTGGAAGCG GCTAAGGACA 17220 

ACAAGATTGA GCTCATTGAC CTTGTGGTGG TCAACCTTTA CCCATTTAAG GAAACTATCC 17280 

TTAAACCAGA TGTGACTTAT GCTGATGCAG TTGAAAATAT CGATATTGGT GGGCCATCTA 17340 

TGCTTCGTTC AGCAGCGAAA AATCATGCCA GTGTTACAGT TGTGGTAGAT CCTGCTGACT .7400 

ACGCTGTGGT TTTGGATGAA TTGGCAGCAA ACGGCGAAAC CTCTTATGAA ACTCGCCAAC 17460 

GTTTAGCAGC CAAAGTATTT CGTCACACAG CGGCTTATGA CGCCTTGATT GCAGAATACT 17520 

TCACAGCTCA AGTGGGTGAA AGCAAGCCTG AAAAACTCAC TTTGACTTAT GACCTCAAGC 17580 

AACCAATGCG TTACGGTGAG AATCCTCAAC AAGACGCGGA CTTTTACCAG AAAGCTTTGC 17640 

CT AC AG ACT A CTCCATTGCT TCAGCCAAAC AGCTCAACGG GAAAGAATTG TCATTTAATA 17700 

ATATCCGTGA TGCAGATGCT GCTATCCGTA TCATCCGTGA CTTCAAAGAT AGTCCAACCG 17760 
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TTGTGGCTCT CAAACACATG AATCCATGTG GAATTGGTCA AGCTGATGAC ATCGAGACTG 
CTTGGGACTA CGCTTATGAG TCTGACCCAG TGTCTATCTT TGGTGGGATT GTCGTCCTCA 
ACCGTGAGGT GGATGCTGCG ACAGCTGAGA ACATGCACGG CGTTTTCCTC GAAATCATCA 
TTGCACCAAG CTATACGGAT GAAGCGCTAG CCATTTTGAT CAATAAAAAG AAAAACTTGC 
GTATCCTTGC CTTGCCATTT AATGCTCAAC AGGCTAGCGA AGTGGAAGCA GAATACACAG 
GTGTAGTCGG TGGACTTCTC GTGCAAAATC AAGACGTGGT CAAGGAAAGC CCAGCTGACT 
GGCAAGTGGT GACTAAACGT CAGCCAACTG AGACAGAAGC GACTGCTCTT GAGTTCGCTT 
GGAAGGCTAT CAAGTACGTC AAATCAAATG GTATTATCGT GACCAACGAC CACATGACAC 
TTGGTGTTGG TCCAGGTCAA ACCAACCGTG TGGCTTCTGT TCGCCTTGCC ATTGACCAAG 
CCAAAGATCG TCTGGACGGG GCGGTCCTTG CTTCAGATGC CTTCTTCCCA TTTGCGGATA 
ACGTGGAAGA AATCGCCAAA GCAGGAATTA AGGCCATCAT CCAGCCCGGT GGCTCTGTCC 



GTGACCAAGA 
TGAGACATTT 
CTTAAAATAC 
ATTCGCAAAA 
CCGATTGCTA 
AATGATGGGA 
AAATTGATTG 
GCCCTTGCTG 
ACTAGGGCTG 
TACGGCGTTC 
ATCGAAAAGC 
GTCGTCGTTG 
AATAAATTTG 
TTTTCACTCT 
CACAAACGTG 
CCAGTCCCAC 
GTTCTAGAAG 
ATCCTGACAG 



ATCCATCGAA GCAGCGGATA AATACGGCTT CACTATGGTC 



TAGACATTAA GAAGATAAAA 
TAACTGAAAC AAGATTAAAA 
GAGGTTGAGG AATGAAACTG 
AAAAGTTACT TGAATCAAAA 
TGACTCTGGA TGGTTTGGAA 



GGGAAGAAAA CAGTTTCTTT 
CGAACTTTTT TGATATAATG 
CTTGTTGTCG GTTCTGGTGG 
GACGTGGAAA AAGTCTTTGT 
TTGGTAAATA TCTCTATTTC 



ACTTCGCAAA 
CTGGTATCGT 
CAGCGGAGCT 
CGACAGCAAC 
ATGGTGCGCC 
CTGAGACGGT 
GTGACTCAGG 
TTGCCTTTGT 



GACCAATGAT GTTGCTTGGA 
GGATGATTTT AACCAAGCTG 
GGAGTGGTCC AAGGATTTCG 
ATATGGCACA TTTTCAGATT 
TATCGTAGTC AAGGCGGATG 
TGAGCAAGCG GTCGAAGCCG 
TGCGCGCGTG GTTATTGAGG 
CAATGGTGAT AAGTTCTACA 



CCTATGATGG CGACAAAGGG CCTAACACGG 
ACTTACCACA GAGTGTAGTT GATACAGCGG 



CCTTTATCGG 

GACTTAAGGC 

CCAAGGAAAT 

TCGAGGAAGC 

GCTTGGCACT 

CTCATGAGAT 

AATTCCTTGA 

TCATGCCAAC 

GTCCTATGGC 

TTGACACCAT 



GGGTGATTAA AGAAGGTCGC CCTTATCTGG GAGTTCTTTA 
CTGATGGACC GAAAGTCATT GAGTTCAACG CTCGGTTCGG 



TTTACAGCTG 
CCTTTTTTCG 
TTGGTAAATA 
TCGTGAGCAT 
AGCTCCTGGG 
CGAACATTAT 
TCCAGATGAT 
CTTTGGTCCG 
CATGGTCAAA 
CAAAGCCTAT 
TGGGAAGGGT 
GCTTTTGGAC 
AGGAGAGGAA 
GGCTCAGGAC 
TGCCTATGCG 
TGTCAAGCCA 
CGCAGGGCTT 
AGATCCAGAA 
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ACTCAGATTA 


TCTTGCCTCG 


CTTGACCTCT 


GACTTTGCTC 


AAAATATCAC 


AGATATCCTG 


19560 


GATAGCAAGG 


AGCCAAATAT 


CATGTGGACG 


GACAAGGGTG 


TGACTCTGGG 


TGTGGTTGTC 


19620 


GCATCCAAGG 


GCTACCCGCT 


AGACTATGAA AGGGGCGTTG 


AGTTGCCAGC 


CAAGACAGAA 


19680 


GGCGATGTCA 


TCACCTACTA 


TGCAGGGGCT 


AAGTTTGCGG 


AAAATAGCAG 


AGCACTGCTC 


19740 


TCAAACGGCG 


GACGAGTTTA 


TATGCTCGTT ACCACAGCAG 


ATACCGTCAA 


AGAAGCCCAA 


19800 


GCCAGCATAT 


ACCAAGAACT 


ATACCAACAA 


AAAATAGAAG 


GACTCTTCTA 


CCGAACAGAT 


19860 


ATCGGAAGCA 


AGGCAATTAA 


GTAAAGATAT 


AAGAATAACG 


CGCCGTAGTC 


GCCAAACACG 


19920 


ATAATGGTCG 


TCGTGGTGAA 


AAGACCAGAA 


CAGTGAATGT 


TCTGGTCAGG 


GGGAAACTTG 


19980 


GAGACCTTAG 


GCTCAAAGTT 


TAGGAATGAA 


ACCGAAGGTT 


TGCTTCCGCC 


TCCATCACCT 


20040 


AAGACCATTA 


TCAAAAAGAA 


AAATAAAAAT 


TCACAAAATA 


CGTTAATGAT 


CGTATGGTTT 


20100 


GCGAGCGTTA 


GCGAGCTAAT 


ATAGAACAAT 


CACCGCCGTT 


GTGAAAGAAC 


GATTGGATGA 


20160 


TAATCCAATC 


GTTCAGGGAA 


ATTGGAAGAC 


CTTGGGTTTC 


CAATTTAGGC 


ATGAGACACC 


20220 


TTTGGTGGCT 


GCTGCCGTCC 


CTCACAAGCT 


AAGGTGATTG 


TTGAAAAAGA 


GGAAAAAGGA 


20280 


GAAGAAATGA 


AACCAGTAAT 


TTCCATCATC 


ATGGGCTCAA 


AATCCGACTG 


GGCAACCATG 


20340 


CAAAAAACAG 


CAGAAGTCCT 


AGACCGCTTC 


GGTGTAGCCT 


ACGAAAAGAA 


AGTTGTTTCC 


20400 


GCACACCGTA 


CACCAGACCT 


CATGTTCAAA 


CATGCAGAAG 


AAGCCCGTAG 


TCGTGGCATC 


20460 


AAGATCATCA 


TCGCAGGTGC 


TGGTGGCGCA 


GCGCATTTGC 


CAGGCATGGT 


AGCTGCCAAA 


20520 


ACAACCCTTC 


CAGTCATTGG 


TGTGCCAGTC 


AAGTCTCGTG 


CTCTTAGTGG 


AGTGGATTCA 


20580 


CTCTATTCTA 


TCGTTCAGAT 


GCCGGGTGGG 


GTGCCTGTTG 


CGACCATGGC 


TATCGGTGAA 


20640 


GCTGGAGCGA 


CTAACGCAGC 


TCTCTTTGCC 


CTCCGTCTCC 


TCTCTGTAGA 


AGATAAGTCC 


20700 


ATTGCGGATG 


CACTTGCCAA 


CTTTGCTGAA 


GAACAAGGAA 


AAATCGCAGA 


GGAGTCGTCA 


20760 


AATGAGCTCA 


TCTAAAACAA 


TCGGAATTAT 


CGGTGGCGGT 


CAACTGGGTC 


AGATGATGGC 


20820 


A I I W A VV A 


ATCTACATGG 


GCCACAAGGT 


TATCGCGCTG 


GATCCTGCGG 


CGGATTGCCC 


20880 


GGTCTCTCGT 


GTGGCGGAAA 


TCATTGTGGC 


ACCTTATAAC 


GATGTAGACG 


CCCTCCGTCA 


20940 


GTTGGCAGAC 


CGTTGCGATG 


TCCTCACTTA 


TGAGTTTGAA 


AATGTCGACG 


CTGACGGTTT 


21000 


GGATGCCGTT 


ATCAAGGATG 


GACAACTCCC 


TCAAGGAACA 


GATCTGCTCC 


GCATTTCGCA 


21060 


AAATCGTATT 


TTTGAAAAGG 


ACTmTGTC 


AAACAAGGCT 


CAAGTCACTG 


TGGCACCCTA 


21120 


CAAGGTCGTG 


ACTTCTAGCC 


TAGACTTGGC 


AGATATCGAC 


TTGTCGAAAA 


ACTATGTCCT 


21180 


CAAGACTGCG 


ACTGGTGGCT 


ACGATGGTCA 


TGGACAAAAG 


GTTATTCGTT 


CAGAAGCAGA 


21240 


CTTGGAAGCA 


GCCTATGCGC 


TAGCAGACTC 


AGCAGACTGC 


GTCTTGGAAG 


AATTTGTCAA 


21300 
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CTTTGACCTT GAGATTTCTG 7CATCGTGTC AGGAAATGGC AAGGAGGTGA CGTTTTTCCC 21360 

AGTTCAGGAA AATATCCACC GCAACAATAT CCTGTCTAAG ACCATCCTAC CAGCCCGCAT 21420 

TTCTGAAAGT CTAGTAGACA AGGCTAAAGC TATGGCAGTG CGAATCGCAG AACAACTCAA 214 80 

CTTGTCTGGA ACTCTCTGTG TGGAAATGTT TGCGACAGCT GATGACATCA TTGTCAATGA 21540 

AATCGCCCCA CGACCACATA ACTCTGGGCA CTATTCTATT GAAGCCTGTG ATTTCTCTCA 21600 

GTTTGACACC CATATTCTGG GTGTTCTCGG AGCACCATTA CCAGTCATCA AACTCCATGC 21660 

GCCAGCCGTT ATGCTTAATG TCCTCGGTCA GCATGTCGAG GCTGCTGAAA AATATGTCAC 21720 

AGAAAATCCA AGCGCCCACC TCCACATGTA TGGTAAAATA GAAGCAAAGC ATAATCGTAA 21780 

GATGGGACAT GTGACTTTGT TTAGTGATCT GCCGGATAGT GTGGAAGAGT TTCGGCAAGC 21840 

GATTGATTTT TAGGACAAGT CTATGATACA AATTATCGTT AATACATTTA TTGAAAAGTA 21900 

TAAGACTGGA GCAGTTGTTG AAGTGTTGTA TGCCAGTGCT GACCAAGATA AGGTACAAGC 219 60 

TAAATATGAA GAACTAGCTG CACAATACCC CGAAAATTAT TTAGCTATCT ATAATGTACC 22020 

GCTGGATACG GATTTGAATA CACTAGATCA TTACCCGTCT GTGTTTATTG GAAAAGAGGA 22080 

GTTTGAGTAG AAATCTTGGT TTACCTAGAT AGCTTATTCC CAACAGCTTA AGAAGAAAGG 22140 

AAAAATTAAC ACATGATCAA CCGTTACTCT CGCCCTGAGA TGGCGAATAT TTGGAGTGAA 22200 

GAAAATAAAT ACCGTGCTTG GCTTGAGGTG GAAATCCTCT CTGACGAGGC ATGGGCTGAG 222 60 

TTGGGGGAAA TCCCTAAGGA AGATGTGGCT TTGATTCGCA AGAAGGCGGA CTTTGACATC 22320 

GACCGTATTT TGGAAATTGA GCAGGAGACG CCCCACGATG TGGTGGCTTT CACGCGTGCG 22380 

CTTTCTGAGA CTCTTGGTGA AGAGCGCAAG TGGGTTCACT ATGGGTTAAC TTCTACTGAC 22440 

CTGCTCGATA CTGCTTATGG TTACCTCTAC AAGCACGCCA ACGACATCAT CCGTCGTGAC 22500 

CTTGAAAACT TCACTAATAT CATCGCTGAC AACGCCAAGG AGCACAAGTT CACCATCATG 22560 

ATGGGGCGTA CTCATGGTGT GCACGCTGAG CCGACAACCT TTGGTCTTAA ATTAGCAACT 22620 

TGGTACAGCG AAATGAAACG CAATATCGAG CGCTTCGAGC ATGCGGCTGC TGCTCTAGAA 22680 

GCTGGTAAGA TTTCTGGTGC GGTTGGGAAC TTTGCCAATA TCCCACCATT TGTAGAGGAG 22740 

TATGTCTGCG ATAAACTTGG CATCCGTGCC CAAGAAATCT CTACACAAGT CCTTCCTCGT 2 2800 

GACCTTCACG CTGAGTACTT TGCGGTTCTT GCCAGCATTG CGACTTCAAT CGAACGTATG 22860 

GCGACTGAGA TTCGTGGTCT ACAAAAATCT GAGCAACGCG AAGTAGAAGA GTTCTTTGCT 22920 

AAAGGGCAAA AAGGGTCTTC AGCAATGCCT CACAAACGCA ACCCAATCGG TTCTGAAAAT 22980 

ATGACTGGTC TGGCGCGTGT CATTCGTGGT CACATGATTA CGGCTTATGA AAACGTCGCT 23040 
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CTCTGGCATG AACGCGATAT TTCTCACTCA TCAGCTGAGC GTATCATCAC ACCAGATACG 23100 

ACCATTTTGA TTGACTACAT GCTCAACCGT TTTGGAAATA TCGTCAAGAA CTTGACAGTC 23160 

TTCCCAGAAA ATATGATCCG AAACATGAAC TCGACTTTTG GTCTTATCTT TAGCCAACGG 23220 

GCTATGTTGA CATTGATTGA AAAAGGCATG ACCCGTGAGC AAGCCTATGA CTTGGTGCAA 23280 

CAAAAACAGC CTACTCTTGG GACAACCAAG TAGACTTTAA ACCACTTCTT GAGGCAGATT 23340 

CAGAAGTAAC ATCACGTCTC ACACAAGAAG AAATCGATGA AATCTTCAAC CCAGTTTATT 23400 

ACACCAAACG AGTGGATGAT ATCTTTGAAC GTCTTGCACT AGGTGATTAA TTAAAAAATA 23460 

AACAGCGAGC TTCAATCTCG CTGTTTATTT TTTATCGAAA AGACTTAGTC TTCTTTTCTT 23520 

TTAGTGAGTC CATAGGCTGC TAGTGTGGAC ATGAGTCCTG CGACTACTAG TCCTGCAGAA 23530 

TCGTGAGTTC CTGTTTCAGG AA G T TT TTTC TCTGTTACCA CAGGAGCTGG ATCTTGAGGA 23640 

AGAACTTTCC TTTCCTCAGC AGGAGCAGTT GATGGAGCTG GTTGGCTTGG GATTTCTAGT 23700 

TTTGGTTTTT CTTCAGCAAT AGCGGCTTGT CCGTTTTCAT CGCCTACATG TGTTACCATA 23760 

GTTCCGACTT CGACTATTTG AGTAACGGCT TCCTGTGCTA CG AC ACT ATT TACAAGTGTT 23820 

TTCACTTCCT TACCATCGGC AGAAGTGCTC ACAGAGTAGA AGTTGCTACC ATGTCCATTG 23880 

ACGCCCTTAG TAATGACTTG TGTTTTTCCT TTGAGTAAGA GTGGATTTTC ACAAGTCACT 23940 

GTGGTAAATG GAATTTCTTC TTCTTGGATA TCCAGTCTAG GTTTTACCTC AGTAGTTGGT 24000 

GCAAGACCAC TTTCATCACC CTTGTGAGTT ACAGGAGCGC CAACTTCAAC CACTTGGTTT 24060 

ATAACTTCTT TGGTTACCTG GCTATCAAGG ACTGTTTCTG TTGTTTTTCC ATTTTCAGTG 24120 

AGTACAGAGA TGTAATGAGT TCGTTCACCT TTGACTCCTG CTGTGATAAT ATTTTCCTGA 24180 

CCGGCTGGGA GGTTAGGATT TTCTTTCTTG ATAACTTCAA ATGGAATTTC TTCAGTTCTT 24 240 

GTGATGAGTT CTGGTCTGGT TTCAACATTG GCAGCCACTT CATTTTCATC TAGGCTTCCT 24 300 

GAATGAGTTA CAGCTGGTTT GAGGCCTTGA AGAGCGGCTT TTAGGTTGGC TACAAGCGTG 24360 

TCAAGCTCAG CTTGTTTATT ACGGTTGAGG TTGTAATTTA GAGCTGTTTT AGCTGCGTCA 24420 

AGGGCCTCAA GACTTTCTTT ACTATATCCT TCTAAGTTTG TAGGAATTTT AGCTAATTCT 2 4480 

TCGCGGAGAG CATTATAATT AGCACGAAAG TAGTCTTTGT TGTGGTCTGC AAAGGCAGTC 24540 

ATGAGTTCAA AGATTTCCTC TTCCTTGTAT TCAGCGCTTG GTCTATCTCC CCAGATTGAA 24600 

AGCATACTTC CGACTGTTGG AAGATCTACT TCAGGATATT TGGTAGAAGC TAGTTGATTG 24660 

AATGGTGTTT TTCCAGTATT CTCAATAGCT TTCTTGAGGA AACCACCACC ATCTTCTGGT 24720 

TTTTGACCAA GAATGTAGTA CCAGTCACCG TTGGTATTCA AGAATTTATA GCCTTTGCTT 24780 

GCTAGGTATT GAGGTGATGC GAGGTTATAT CCCCACCAGC CTTTAGACCA GTAAGAAATC 24840 
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AAGACATCTT TGTCAAACTG AACATCCTCC TTGTCTTCAT AGTAGAAGCC ATCGTTGAAG 24 900 

GCCATTGGTT GAAGCCCTCT TTCTTTGGCC ATAGCTGCGA GGGTGTTGGC ATATTCGGCA 24 960 

AATTTGCCAT AGAGTTGATA CCACTTGAGG TAGTACCAGC CTTGGGCACT AGTCGCATCG 2 5020 

TTGGCGTATT CGTCAGTACC AAAGTTGAAA ATCTTTGTTT TACCTGCAAA GAAGTCCATG 25080 

TATTTACCGA TGAGGGCTTT TACAAAGTTC ATCGCTTCTT CGTTTTTCAA GTCCATAGTT 25140 

GTTTTTGAAA CTTTATCAAA GTGGGCTTGA GGATTTTTAA TACCTAATTT TTCCATGGCA 25200 

ACCAGCATAG CATCCATGTG ACCTGGACTG TTAATAGCTG CGATGAGACC GATGTCCTTA 2 5260 

GATTTAGCGT ATTCAATTAG CTCTGTTACT TCTGCCTGTG TTAGTGCAGT ACCGTTTGGA 25320 

TCGTCGTAGT AAGCTTTAGT TCCTTCGATA ATAGCTTTTT TAACGTCATC ACTAGCATAG 25380 

GTTTTTCCGT TGGCAGTAAT GGTCATATCA TCGAGTAGAA AGCGAAGTCC GTCATTTCCT 25440 

AGAAGGAGAT GCACATCAGA ATATCCGAGC TCACTGGCCT TGTCTACGAT CCGTTTGAGC 2550O 

TGGTTCAGAG TAAAGTATTT GCGTCCAGCA TCGATTGAGA TTACCTTGTT TTTGGCAAGT 25560 

TTTTCAACCT CACGTTTAGC TTCTTCTTCT TTTTGAGCTT CAGGCGTGAG GGTCAAGTTG 25620 

TTGACAGTTT CTTGAAGTTT AGCAATGGCT TGATCAATCG TATCTTGTTG GGCACGGCTA 25680 

AGGTTGCTAT CGAGAGAGCG AATAGCTTTT TCAGCTTCTT TTACGGCCGT GACGCTTTCT 25740 

GCAGTATAAC GGTTCAGGTC TTTTGGTACC TCGTTAAGTG CTTGCTCTGC AGATTCATAA 25800 

TCAGCTGCGA AGTATTCAGC GTTGGCATTT GCAAAATGAC GCATGAGTTT GAAGAGGCGT 25860 

GATGGTGAAT AACGTGCAGA TGGAGTGTCA GCCCAAGCAG CTACCATACC ACCGATGATT 25920 

GGGATATCAG CTCCTTCTGT TTTTGGTACA GAAGTGATTG GTGTGTTTTT AATACCATTG 25980 

AGCCCCTGAT CGAGATTGTA CCAGCCTTGG CCATCAGCGT TTCGTCCAAG AACGTAGTAC 26040 

CAAGCATCAT TGGTATTAAG GATTTGGTGA CCTTTTTCAG CTAGTAGTTT AGAAGAAGCG 26100 

ACATCGTAGC CTCCCCAACC ACCAGTCCAC ATAGAAACGA TGATGTCTTT GTCAAAACTA 26160 

CCAAAGCTTG TGTCGCTATT GTAGTAGATA CCGTCGTTAA AAGCCATTGG TTTGAGACCG 26220 

TGCGATTTTA CAATACGAGC GAGGTCATTG GCGTAGGCAA TAAATTTTTC ATAGCCTTTT 26280 

ACAGGGTAGC CTTCGTTTGG ATAGTATTTA TCAGCTTGAA GCACACTCCA ACCTTTAGCA 26340 

TCTGTCGCAT CATTGGCATA TTCATCAAGT CCGATGTTGA AGATTTCAGT CTTTTTCGCG 26400 

AAATAAGCAG CATACTTGTC GATAAGGGCT TTTGTAAAAG CGACAGCTTG TTCGTTGTCA 26460 

AGATCGACAG TACGGGCTGA TTTCTTCCCA AAATAGCTAA AGTTAGGGTT TTGGATTCCC 26S20 

AATTCTTTCA TGGCATTGAG AATCGCATCC ATGTGTCCAG GACTATTTAC TGTCGGAATG 26580 
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AGACCCATAC 


CTTTATCTTT 


GGCATAGTTA ATCAGATCTG 


TCATTTGACT 


TTCTGTTAAG 


26640 


TGATTGCCGT 


TTGGATCGTT 


GTAATAATCA TTTGTACCTT 


TTTCAATGGC 


GCGTTTGACA 


26700 


TCGTCACTGG 


CATAGGTCTT 


GCCGTTAGCT GTGATGCTCA 


TATCGTCCAA 


CATGAAACGG 


26760 


AGTCCATCAT 


TTCCGACTAA TAGGTGTAAA TCAGTGTAGC 


CATAATGTTT 


CGCTTTATCG 


26820 


ATGATTTCCT 


TGAGCTGTTC 


TGGTGAGAAA TATTTACGTC 


CAGCATCAAT 


AGAAACAATT 


26880 


TTCTTTTTCG 


CTAGTTTTTC 


ATTTACAGTT GCAGCACGTT 


CCTTTCCTGC 


CTCTGTTGCC 


26940 


GGTTTGTCAG 


CCTCTGCTTT 


CCCTTCATCT TTTTTAGCTG 


GTTTATCCTT 


GTCAGTCTTG 


27000 


TCTGTATTTG 


ACTCTTTAGA 


ATCAACCTCT TTCGCTTCTT 


CCTTTTTAGC 


GCTAGCTTCT 


27060 


TCTGCCTTTT 


TATTAGCAGT 


TTCTTTTTCA GCAGAAGTTC 


GAGTTACCAC 


TTCTGCTTTA 


27120 


TCACTAGGAG 


TTGAACTAAC 


TTCCTCTTGT GGTTTTTCTT 


CTGTTTTTGG 


AAGACTAGCT 


271B0 


ACCTTATCAG 


TAGCTGGAGT 


TTCTGTTTCT ACAGTTTTTG 


GAGCTTCTGG 


TTGAAGCACT 


27240 


GCTTTAGGTG 


TTTCCTCAGT 


CCGATTTTCG GATGATTGAG 


GGGAATCAGA 


AACCGTATGG 


27300 


ATGGTCGGTT 


GGTTTTCTGT 


AGTAGTAGGA GTAACTCCAT 


CGGCTGCAAC 


AGTCTGTGCT 


27360 


TGGAAGGCAA 


ATCCAATTAG 


AACAGAAGCT GCTCCTACAG 


CGTATTTACG 


AATAGAAAAA 


27420 


CGCTGTTGTT 


TTTCATGTTT 


CATTGCAAAA CCTCCTGATT 


GCATTGTTAT 


ATTGATAGCG 


27480 


ATTATATAAA 


TCAACGCCTT 


TATTTTATTT CTTATATTAA 


TTTCTTATAT 


TAACGAGAGT 


27540 


CAAGAGGAGA 


TGACAAAAAA 


CTATAATAAG TATAAAAAAA 


TATAAAATTT 


AAACTTAAGA 


27600 


TTTCAGATTG 


GTCGGAAAAA 


ATACGTATAT ATATCTAGTA 


TAATTTTTGG 


TTCTATTTCT 


27660 


ATAAAATATT 


CCACAAATTA 


TAGAATTTTC CAAAAATAGG 


TAAGCGCTAC 


CTTTTTGGTG 


27720 


TAGTATAATA 


AGCATAGAAA 


AAGCCCAAGC GATTAGCTCA 


GGTTTTCTTC 


TTAGTGATCA 


27780 


CGGTCACATG 


AGATAAATTT 


AATCTTGTAC TAATCAGATC 


GTTTGTAACT 


TTCACTGTAT 


27840 


TCTAAAACTT 


GGCCAGTTGA 


TTCGAGTTTG GTGATTTTAG 


TTTGTAGGAC 


AGTAGGGAAT 


27900 


TGTTCATCOA 


CTCCGAGGAC 


TGAAGCTGCA TGTTCTGGAG 


TTGGAAAGAC 


TATTTCGTTG 


27960 


ATTTCTTCAA 


AGTGTTCATC 


ATTCATGTGA ATGTGGTAGT 


CTAACTTGAA 


ACGATTATAG 


28020 


ATAGAACTAT 


AGTATTCAAG 


GTTTGGATAA TTTGCGTTGA 


TATATTGTTC 


TGGGATGTAG 


28080 


GATGTATGGT 


AGATATAAAC 


GACACCGTTT GATTCGCGGA 


TACGTTCAAT 


CTTGTAGTAG 


28140 


AATTGATCGC 


CGCGTAGACC 


CAATTTTTCC AAGTAAACAA 


GCTTGTTTCC 


GCGTTCAATT 


28200 


GAAAGAACAG 


TTACCTTATC 


ATCTTTAGCA TTGAAGAGTT 


CAATATCTGA 


AAACTCTACA 


28260 


AGCTTGTGTT 


TGCGTGCACG 


TGAAACGAAG GTTCCTTTTC 


CTTGTTGGCG 


GACAATATAG 


28320 


CCATCTTTGG 


CAAGGTCGTT 


TAAGGCGCGA ACAACTGTGA 


TAGAGCTGAC 


ATCGTACATT 


283B0 



WO 98/18931 



PCT/US97rt9588 



677 

GAAATGACTT CTGCTTCAGT GTAAAATTTA TCTCCACTGC TAAACTGCCC AGAGATGATT 28440 

TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 28473 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6749 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTGATGCGT GGTATGCGAG GATACAGTTC TGAAAATCCC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTGAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGGAAATGC 120 

AGAAACTGAT GTGCAAATGC CACAACAGGC ATCAGGTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGGT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTGGATC CTGTTATCGG 240 

ACGAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCGCACCA AGAACAATCC 300 

TGTTTTGGTC GGAGATGCAG GTGTTGGTAA GACAGCAGTT GTCGAAGGTC TAGCGCAAGC 360 

CATTGTGAAC GGAGATGTTC CTGCTGCTAT CAAGAACAAG GAAATTATTT CTATTGATAT 420 

CTCAGGTCTT GAGGCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 480 

AGTCAATGAA GTGAAAGAAG CAGGGAATAT TATCCTCTTC TTTGATGAAA TTCACCAAAT 540 

TCTTGGTGCT GGTAGCACTG GTGGAGACAG TGGTTCTAAA GGACTTCCGG ATATTCTCAA 600 

GCCAGCTCTC TCTCGTGGAG AATTGACAGT GATTGGGGCA ACAACTCAAG ACGAATACCG 660 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGGTCAATGC 720 

TCCTTCGGCA GAGAATACTT TTAAAATTCT TCAAGGAATT CGTGACCTCT ATCAACAACA 780 

CCACAATGTC ATCTTGCCAG ACGAAGTCTT GAAAGCAGCG GTCCATTATT CTGTTCAATA 840 

CATTCCTCAA CGTAGCTTGC CAGATAAGGC TATTGACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGGCGGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 960 

AAAAGACAAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 1020 

AACACGCATT GCAGAATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TGAAAGTGAC 1080 

TGCAAGTGTC AACGATGTGG CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTGC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCGTCGTA ACCGTGCTGG 1260 
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TTTTGATGAA 


GGAAATCGCC 


CAATCGGCAA 


CTTCCTCTTT 


GTAGGGTCTA 


CTGGGGTTGG 


1320 


TAAGACGGAG 


CTTGCTAAGC 


AATTGGCACT 


CGATATGTTT 


GGAACCCAGG 


ATGCGATTAT 


1380 


CCGTTTAGAT 


ATGTCTGAAT 


ACAGTGACCG 


CACAGCTGTT 


TCTAAGCTAA 


TTGGTACAAC 


1440 


AGCAGGCTAT 


GTGGGTTATG 


ATGACAATAG 


CAATACCTTA 


ACAGAACGTG 


TTCGTCGCAA 


1500 


TCCATACTCT 


ATCATTCTCT 


TGGATGAAAT 


TGAAAAGGCT 


GACCCTCAAG 


TTATTACCCT 


1560 


TCTCCTCCAA 


GTTCTAGATG 


ATGGTCGTTT 


GACAGATGGT 


CAAGGAAATA 


CAGTAAACTT 


1620 


CAAGAACACT 


GTCATTATTG 


CGACCTCAAA 


TGCTGGATTT 


GGCTATGAAG 


CCAACTTGAC 


1680 


AGAAGATGCG 


GATAAACCAG 


AATTGATGGA 


CCGTTTGAAA 


CCCTTCTTCC 


GTCCAGAATT 


1740 


CCTCAACCGC 


TTTAATGCAG 


TCATCGAGTT 


CTCACACTTG 


ACTAAGGAAG 


ACCTTTCTAA 


1600 


GATTGTAGAT 


TTGATGTTGG 


CTGAAGTTAA 


CCAAACCTTG 


GCTAAGAAAG 


ACATTGACTT 


1860 


GGTAGTCAGT 


CAAGCGGCTA 


AAGATTATAT 


CACAGAAGAA 


GGTTACGACG 


AAGTCATGGG 


1920 


GGTTCGTCCT 


CTCCGTCGCG 


TGGTTGAACA 


AGAAATTCGT 


GATAAGGTGA 


CAGACTTCCA 


1980 


CTTGGATCAT 


TTAGATCCTA 


AACATCTGGA 


AGCAGATATG 


GAAGATGGCG 


TTTTGGTTAT 


2040 


TCGTGAGAAA 


GTCTAAGACA 


GAATTTTGAG 


GATAAAAAAG 


AAGGACCCAG 


CTGAAAAAAA 


2100 


CTGGTTCCTT 


TTTAGGTACG 


ACAGGCATGT 


CGTATAGTAG 


AAGTGTATTA 


TTCTAGTTTC 


2160 


AATATACTAT 


AGTAGCTCAG 


AAGTCGGTAC 


TTAAACGTGC 


TATATCAAAA 


CCAGTCCTCG 


2220 


AAAAACGTGG 


ACTGGTTTCG TGTTTGGATT 


ATTACCTTGA 


ACGACATGCG 


TTAAAAGTTA 


2280 


GTTGAACCGC 


CGTATGCCGA ATGGTACGTA CGGTGGTGTG 


AGAGGGGCTA 


GAGATTATCC 


2340 


CCTACTCGAT 


TTTAAATCAC 


ATGACGTTCA 


AAGGCATCAT 


CTGAAATCCC 


TTGTTCCAAG 


2400 


ATGAGTTTTG 


CCCATTCTTT 


AGCAGAGAAG 


AGGCTGTGGT 


CCTTGTAGTT 


TCCGCAAGAT 


2460 


TCGATGGTTG 


TCCCTGGGAC 


ATCTTCCCAA 


GTAGTAGTTT 


CAGCGATTTC 


CTTGAGCGAA 


2520 


TCCTTGATAA 


CAGCTGCGAT 


TTTAGCACTG 


GTGTGACGTC 


CCCACATAAT 


CATGTGGAAG 


2580 


CCTGTGCGGC 


AACCAAATGG 


TGAACAGTCA 


ATCATGCCGT 


CAATGCGGGT 


ACGGATGAGT 


2640 


TTGGCTAAGA 


GGTGCTCGAT 


AGTGTGAAGG 


CCGGCAGTAG 


GGATAGAGTC 


TTCGTTTGGT 


.1700 


TGCACCAAGC 


GAATATCATA 


ATTGGAGATG 


ATGTCTCCTT 


TTGGTCCTGT 


TTCTTCCCCA 


2760 


ATCAAGCGAA 


CATAGGGTGC 


TTTGACAATG 


GTGTGGTCAA 


GTTCAAAACT 


TTCGACAATA 


2820 


ACTTCTTTTG 


ACATGGTAAA 


TCCTTTCAGT 


TTTCTTCTCT 


CATTATATCA 


TAAAGGTTGC 


2880 


TCCTGAGACA 


GAGAGAAAAC 


CTCTCCGAGG 


CTGGAGAGGT 


TGAAATCTTT 


ACTTACGATA 


2940 


TAAGCGGTCG 


TATTGGTAGT 


ATGGGTCAAA 


GGTTACGTTG 


ATACCCAGTT 


TACGAAGGAC 


3000 


ATTCTTGTCT 


TCATCAGTCA 


AGATGATGGT 


TGAGTGGGCT 


TCGCTTCCTT 


TGAGGTTGCC 


3060 
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GAGTTCTTCC ATAGCGCGGG CAGCATCAGG ATTTTCTGTA GCTGTGATAG CAAGTGCAAT 3120 

CAGGATTTCA TTTGAATGAA GGCGTGGATT GCGGCTACCG AGATGATCGA TTTTAAGACC 31 BO 

TTGGATTGGC TTAACAACTT CAGGCTCGAT TAGTTTTACT TCTTTAGCGA TGTCAGCTGA 3240 

TTTTTTGATG GCGTTGATCA AGGCAGCGGC TGTAGGACCA AAGAGTTCTG AGTTCTTACC 3300 

AGTGATGATT TCCCCATTTG GCAATTCAAA GGCTAGGGCT GGTCCACCAG TTTCTTCTGC 3 3 SO 

TTTTTGGCGC GCAACGACAG CAACCTTACG GTCTGCAGGT GTGATACCGA GGTCGTTCAT 3420 

GAGCAACTCA ATTTTCTTGA CGGCAGCTTC GCCAACTTTT TCAGCTTTGA AGTCAAGAAC 3480 

TGTTTGATAG TAACGGCGGA TGATTTCTTG TTTAGAAGCT TCGACAGCGG CCTCGTCATC 3 540 

TGTAATAGCG AAACCAACCA TGTTGACACC CATATCTGTC GGTGAAGCGT ATGGTGATTT 3600 

TCCGAGAATA CGTTCCAACA TGCGTTTGAG CACTGGGAAG ATTTCGATAT CACGGTTGTA 3660 

GTTGACAGTG GTTTCTCCAT AGGTTTGAAG ATGGAAGGGG TCAATCATGT TGACATCATC 3720 

AAGGTCAGCT GTGGCAGCTT CATAAGCCAA GTTAACTGGA TGATGAAGCG GAAGATTCCA 3780 

AACAGGGAAG GTTTCAAATT TAGCGTAGCC AGATTTGATG CCATTGATTT GGTCGTGGTA 3840 

CATATTGGAC ATACACGTTG CCAATTTTCC AGAACCAGGT CCAGGAGCGG TTACGACAAT 3900 

CAAGTTGCGA CTGGTTTTGA TGTAGTCGTT TTTGCCCATG CCTTCTGGGG AAATGATGTG 3960 

ATCCATATCC GTCGCATATC CTTTGATTGG ATAATGAAGA TAAGAATCAA TTCCGTTTTT 4020 

CTCAAGTTGA TTGCGGAAGG CATCTGCAGC GGGTTGGCCA GCGTATTCTG TAATGACAAC 4080 

GGAACCAACA AAAATCCCTA ATTCATTGAA TTTATCAATC AAACGAAGAA CTTCTTGGTC 4140 

ATAAGAAATG CCTAAGTCGC CACGTGCTTT GGAATGTTCA ATGTTGCTAG CATTAATGGC 4200 

AATCACAACC TCAACCTGCT CTTTCAATTC TTGCAAGAGC TTGATTTTGT TGTCAGGTTC 4260 

ATAACCAGGA AGGACACGAG CAGCCTGGAA ATCTTCTAAC ATTTTACCGC CAAACTCTAA 4320 

GTAGAGCTTG CCGTCAAATT GGTTAATGCG CTCCAAAATA TGGTCGCGTT GTAAATTCAA 4380 

ATATTGTTCA GAACTAAAAG CTTGTTTTTT CATTTTTTTA CCTCTGGACT CTATTATAAT 4440 

AAAAAATTGG AAGTTAGGAA ACTACGGAGC TAAAAAAGAA ATTAAAAAGA TTAAGCAAAC 4500 

GCTTGCACAA AATTTTAAAA AGTGCTATCA TAGACTATAG ATTATGAAAA TAATGAGGTA 4560 

AACAGATGCA AGAAAAATGG TGGCACAATG CCGTAGTCTA TCAAGTCTAT CCAAAGAGTT 4620 

TTATGGATAG TAATGGAGAT GGAGTTGGTG ATTTGCCAGG TATTACCAGT AAGTTGGACT 4680 

ATCTAGCTAA GCTAGGAATC ACAGCAATTT GGCTTTCTCC CGTTTATGAC AGCCCTATGG 4740 

ATGATAATGG CTATGATATT GCTGATTATC AAGCGATTGC GGCTATTTTT GGAACCATGG 4800 
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AGGACATGGA 


TCAGCTGATT 


GCAGAAGCTA 


AGAAGCGTGA 


CATTCGTATC 


ATCATGGACT 


4860 


TGGTGGTCAA 


TCATACCTCA 


GATGAACATG 


CTTGGTTTGT 


CGAAGCCTGT 


GAAAATACTG 


4920 


ACAGCCCTGA 


GCGAGACTAC 


TATATCTGGC 


GCGATGAACC 


CAATGACCTA 


GATTCTATCT 


4980 


TTAGTGGGTC 


TGCTTGGGAA 


TACGATGAAA 


AGTCAGGTCA 


ATACTATCTC 


CACTTTTTCA 


5040 


G C AAGAAAC A 


GCCGGATCTC 


AACTGGGAAA 


ATGAAAAACT 


TCGCCAGAAA 


ATTTATGAGA 


5100 


TGATGAACTT 


CTGGATTGAT 


AAAGGTATTG 


GTGGTTTCCG 


TATGGATGTT 


ATTGACATGA 


5160 


TTGGCAAAAT 


TCCTGACGAG 


AAGGTAGTCA 


ATAATGGTCC 


TATGCTCCAT 


CCCTATCTCA 


5220 


AGGAAATGAA 


TCAGGCGACC 


TTTGGAGATA 


AGGATCTCTT 


GACAGTAGGG 


GAGACTTGGG 


5280 


GAGCAACTCC 


AGAGATTGCC 


AAGTTCTACT 


CTGATCCAAA 


GGGGCAAGAA 


TTGTCTATGG 


5340 


TCTTCCAGTT 


TGAACATATC 


GGTCTTCAGT 


ATCAGGAAGG 


TCAGCCTAAA 


TGGCACTATC 


5400 


AAAAAGAGCT 


GAATATCGCT 


AAGTTAAAAG 


AAATCTTCAA 


CAAATGGCAG 


ACAGAGTTAG 


5460 


GAGTTGAGGA 


CGGCTGGAAT 


TCCCTCTTCT 


GGAACAACCA 


TGACCTCCCT 


CGTATTGTCT 


5520 


CAATCTGGGG 


AAATGACCAA 


GAATACCGCG 


AAAAATCTGC 


CAAAGCCTTT 


GCAATCTTAC 


5580 


TTCATCTCAT 


GACAGGAACT 


CCTTATATCT 


ACCAAGGTGA 


GGAGATTGGG 


ATGACCAACT 


5640 


ATCCGTTTGA 


AACACTGGAT 


CAAGTAGAAG 


ATATTGAATC 


TCTCAACTAT 


GCGCGTGAGG 


5700 


CTCTTGAAAA 


AGGTGTTCCG 


ATTGAAGAAA 


TCATGGACAG 


TATCCGTGTT 


ATTGGACGTG 


5760 


ACAATGCCCG 


TACCCCTATG 


CAATGGGACG 


AGAGCAAAAA 


CGCTGGTTTC 


TCAACAGGTC 


5820 


AACCTTGGTT 


GGCGGTTAAT 


CCAAATTACG 


AGATGATCAA 


TGTCCAAGAA 


GCGCTGGCAA 


5880 


ATCCAGATTC 


TATTTTCTAT 


ACCTATCAGA 


AACTGGTCCA 


AATTCGCAAG 


GAGAATAGCT 


5940 


GGCTAGTTCG 


AGCTGACTTT 


GAATTGCTTG 


ATACGGCTGA 


TAAGGTCTTT 


GCTTATATAC 


6000 


GTAAGGATGG 


CGACCGTCGC 


TTCCTAGTTG 


TGGCTAACTT 


GTCCAATGAA 


GAGCAAGACT 


6060 


TGACAGTAGA 


AGGAAAAGTC 


AAATCTGTCT 


TGATTGAAAA 


CACTGCGCCT 


AAAGAAGTAC 


6120 






CCATGGGATG 


CTTTCTGTGT 


GGAATTACTA 


TAAATATTTT 


6180 


TTGCAGAAAA 


ATTTAAAATT 


GAAATCGTAT 


AAAAACAAGG 


GAGGACTGTA 


TAAAAGACAG 


6240 


AAATCCTTTG 


TTTTTTATAA 


CCAAAGTTTA 


TAAACTTTCA 


TTCTTGAAAT 


TCAATTAACT 


6300 


TTACAAATTC 


CCACTATTAA 


GGAGAAAGAA 


GATGAACATA 


AAGAAGCGTG 


TCCTTAGTGC 


6360 


AGGCCTGACT 


TTTGCATCTG 


CTTTGCTTTT 


ACCCAAATCA 


TTCATACCTC 


TCTCAACTAG 


6420 


ATGTAACTTA 


CAAAACCCCT 


GACCTCATGA 


GCCACTTTCT 


TCCTCCTCAT 


GAGGTCAGTT 


6480 


TTACTTTCTG 


CTGTTCCAGT 


ATCGTTTTTC 


CTCGCTAGAT 


TTCCTCAAAA 


GGGCAGACTC 


6540 


CTCCCTTGGT 


GCGTCACACG 


ATTTTTTCAT 


CTCGACTGTT 


CTTTAATGCA 


TCATTAACGA 


6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC AGGTTGACTT TTCTAATCCT 6660 

AGAATAAAGT GCTGAAAACA ATTCGGAATA GGCATAGAGA CTAGACAATT TGAGGAGCTG 6720 

CTTGCGTCCT GTTCGAACAC ATTTTCCGG 6749 



(2) INFORMATION FOR SEQ ID NO: 85: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



TCTACCCATG 


GACTTTGAGG 


CATTCATTGT 


TCCATCTTCT 


AGTGGCGAAT 


CTTTTGATAC 


60 


AAACGATTCA 


ATTCACTTGG 


ATAGTGAAAC 


TCTCCCGCAA 


ACATTTTTCT 


GGTTAACTCA 


120 


ATCCAGCTGA 


TATTTCTTTC 


AGCCAAAATA 


ATGGACAAGT 


TCTCCCAAAA 


TCGTTCAGCC 


180 


ATATTGCTTC 


TCCTTTAGTT 


AGATAAATAA 


TGTGTTTGCG 


CCATGTAAAT 


CAATTGTTTC 


240 


GTATCTCTTG 


GCAATAGAGC 


TCTAGCCTCT 


TCCAAATTCA 


GACTTGGATA 


AACTCGCTTA 


300 


TTTGAAACCG 


CAAGAGGAAG 


TCTGATGCTT 


AGTTCAGGAT 


TTTTTAAAAT 


TATCTCAACG 


360 


AAATCCGTTA 


ATCTTAGATT 


GTCACGGTTC 


TTAAATCGTA 


ATAAATTGGG 


AGATAAAAAC 


420 


TCAAAACAAT 


CTGAAGAATA 


GCTCATCATC 


TCAATTAATT 


TGTCCTTTGT 


CATTTCAGAA 


460 


ACTGAATGAC 


AAGATACCTC 


TATGCCATAG 


TTTTGGAAGA 


AATCTAAAAG 


AAGTTGATTT 


540 


CTTTGTCTAT 


TTTTACTTAG 


ATAGACATCA 


ATCATGGGAG 


ACCTCCCAAA 


GATTCGGTTC 


600 


CATTTGATAT 


TCTGACACGA 


TTAAGGAATC 


TAATAAATTA 


AGGAATCTAA 


TAAATTTGCG 


660 


AAGTTAATCG 


GTTTCTTGTC 


TTCATCATAA 


GCTTTTACAG 


TTACTTGGGT 


TGTAAGTATT 


720 


CCCTCTTTTC 


CCTCGGCTCG 


ATAGCCTTGT 


CCATATAAAA 


CAAAAACGAG 


ATTTTGATGA 


780 


TCATCTACAA 


AGGCATCAAC 


CCCATTCTTT 


ATGTCTTGAC 


TTTCAAGGAA 


TTCCATAACG 


840 


TTTTGAAGAT 


AGGATTCGTA 


AAATAGTGGG 


TAGTTATGTT 


TTTTATGGTA 


ATCATCTAAA 


900 


AATGTCACTT 


CAAACTCACA 


TGGAGAGTAA 


TTTTGACTTT 


GAACACCCTA 


AAAGTGCCAT 


960 


CAAATTTGAA 


TTGGAATAAA 


TCAAATAAAT 


AGCCCCATCC 


TCATCAATCC 


AACCTTTGCT 


1020 


CAAAGACAAC 


TCCAACCGAT 


CTTTTAAAAC 


TGAGTAAACC 


ACCTTAACCT 


CCAGTTTCAT 


1080 


ATTCTTATAC 


CGTTCACTCT 


CAAATAAAAG 


TTTGGGGAGC 


TTATAATAAC 


GCTCTGATGT 


1140 


CTGATATTGA 


TTAGCGGTAA 


TACGCTTCAT 


TATTGTCCCT 


CCAAGACTAA 


AATTCCAACA 


1200 
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TTTCCAAATT CATCAAATCG GATTAAACCT ACTTGTTCCA TTTCATCAAC 

GCTTTTACCC AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA 

TCATGAGCCA CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTGCTGT 

TTTTTCCTCG CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT 

TTTCATCTCG ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG 

AGGAACAGGA AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAAGTGCTG 

GGAATAGGCA TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC 

TCCCACCACG TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT 

TCCAGCTAGA TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA 

AACTTCGTTT TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC 

CTCCTTGATG AAATTCTCGG CTTGACCACG TCCACGATAA AG 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19390 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



TAACTGAGTT 1260 

ACCCCTGACC 1320 

TCCAGTATCG 1380 

CACACGATTT 1440 

GTGGTTCATA 1S00 

AAAACAATTC 1560 

GAACACATTT 1620 

GGAAGTCACC 16 BO 

TCATCATACG 1740 

CCTCCTTCAT 1800 

1842 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
TCATCTTTAT CTCCTCGAAA TTTTCTAATA TAGCCATTAT AACAGAATTT 
CCTATTATAG TAAATCACTA TTTCAGTATA AAAAGAAAAA ACGAATCAGA 
TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTAGCC GATTTTTTGC 
CTGCATCGTG ACTCATCAAG ACTTGACCAT ACTCTTGTAA GACTGAGCGA 
TATCGTCTGC AAACTCGCGC ATACGGGCCA ACAGCCAAGC TGGATATGGG 
TTTCAATATC CACTAAAATG GTCAAATAAT AGCGCTCGTT CATTTTGTAG 
TTTCCATTTC AAAAGTCACT GTCTTGGCAA AAGCTACCAA GTCAGCCAAC 
AAAGGATGTA GTAGATGTAA GGTTCTTTCT TACTCTCAGC TTCTTGTTCA 
GCTCTTCTTC CTTGACTTCA ACTTGCTCAA GAGATTGAAT GGCTTCGATA 
TTTTGTCTGC GATGCTTTTT TCCAGGGTTT TGATAAATTC ATCTGGAGAC 
ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC TAGATTTTGG 
ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGGAGT CACACGGAAG 
CTGTATCCAG AAAGCTATCA GGCATCTCTA GCTCATCCAA GATAGCATAA 



TGTGAAAATT 

CGATTCGCTC 

AGATTGAGCA 

CTGATATCAC 

CTTGGATCAT 

AGTTCAGAAG 

TTAGCAAAAG 

GCCTGCTCTT 

TCATCCTTGG 

ATTTGAGCCA 

TCAATCTTTG 

CTCAACATGC 

AAGAACTCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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CTCTTTTTTC TTGAGGAACG AGAAAGTCAG CAATCTCCAT TCCACGATCC ATCAAATCCT 840 

CTAAAGATAT CGTGATTTTT AAAGTTGTAT CACTAATTTG TTTCATTTTC ATTGCTAGTA 900 

ACCTCATACT TTCAGTTCTA TCTATTATAC TAGATTTTTA CGATTTTATC AAAAGAAGGC 960 

TCCTCTATAC GGATAGATTT TCCCTAGGGT CTTTCTATAG GAGACTCCAA AAGAAAATTT 1020 

CTGCAGACAG ATAGAAAAAG CCTTCAAAAT CGGCTAAGAG CCGACTTTGA AGACCTTATA 1080 

CATCAGAATA CTTATAATTT AAAGGTTGCT ACACCGAGGA TAGAACGATT TAAGTTTCTG 1140 

AGAATTTGAA GACTTTGCTC AAATTTCTTA TAACGAGTCA CTCCGTACTC TTCAACAAGA 1200 

AGCACTGTAT CTCTTTCCAA AAGAGATGAT ACATCCTGTA AATCTACAAA ATGCATTCCT 1260 

TTTAAAGCTT CTTGACTCTG TTTCAATTTA TCTAAGATAG CTTTATTTGA GCTAACGATG 1320 

GTCAATTCCT GTCCAGTATT TTTGTATGAC AAAACATCTG CTAGGTTAGC AATTGTTGTA 1380 

ATCTCTGTTA CAAAATCAAT TTGATACTGA GAAAAATCAC CTACTCTATT GATTGTTGGA 144 0 

TTAAAGAGAT AAACTAACAC ATTTCCCATC ACAACCAAAA TCACACAAAC CACTCCAATA 1500 

ACAACTAAAC GAAGAATCAG ATTTTTCACA TTTAAGCCAA GCGCTGTTTC ACCATTTGCG 1560 

TTCAATTCTT TAGAGTTGAT GGTTTCCAGT TTTTCAATTT TCACATTTGC ATAGGCATGT 1620 

TTAAATTTCT CAATCAACCC ATCAATTTTT TTCTCTAACA AGTTATTGGC ATCTTTACTT 1680 

GATGTCAAAA TTTTCACACC AACCCCTGCA TCGTCAATCA TATAGTAGAC GGTCAATTTT 1740 

TTCCACCAAT AGTCATTCCT TGAATTTTTC AACGTTGTTT CTGTCGTGTC TAATTCACTG 1800 

GCAATTTTTT TCAACTCACT GGGTTCTACA TCATTCAAAA GATAAGCTCC ATTCAAATTA 1860 

CCATCAATCA ATTTCCCATA AAAATCACTA TAACCACCAA TTTGATGATT CAAAATCGTT 1920 

TTGTCCGACT CTTTTGGAGG AGTGATTTTA TAGATAAGAT AACTTGAATA ACTTGTTGTA 1980 

TCTTTGACAG TGTTTTTATT CCTAACTGCT TTAATTGTAA ATGGTACAGC AATGAGAGCA 2040 

AATAAAGCGA TGAGAGCTAA AATATTTGCT TTTCGCTTTT TATAAAGATT TGCAAACAAA 2100 

TCAGCTACTG AATAATGTTC AAACATGATT TTTTTCTCCT TTGTTTAGTA GATACTAGTT 2160 

TTCCTTTGTA AGCATTTTTG CTACAAATAT AATCACAAGA ACAATTCCCC AGAATTGCAT 2220 

TGTAAATAAA TTGAAGAAAC TTTCTGAAAA GCTGCTTCTT GGCATAAAGA ATAGATTATT 2280 

CAAGATGAGT AGGGATAAAG CAAATAGGAT TGTCCTTGAG CGATAGGCTA CTTGCAGCAT 2340 

GGCTATAAAT AATACGCCGA GTAAGAAACT AAGCAGAAAG ACTCCAATCA TACCATAGTC 2400 

CGTATACAAC TCCATGATAT AACTACTTCC GATACCATGC CCTTTCAAGT ATTCCTTGTT 2460 

CAAGACAAGA TAGGATAGAT TGTGGGCATA ACTATTACTA TCAATAGCTA GTTCCACACT 252 0 
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ATTGGTTGTA TCTTCAAAGG CTTTTCCTCC GAAAATGGCT CCCAAACTCC CCCTTGCAAA 2580 

ATAATCAAGA ACAGGACCAA AAGTAAAATT ACGGAAATCT CGGTAAGGGA GGCTACTGTT 2640 

AAATAGAAAA CCTCGAGCCA GAACACCAAA ACTAGTCCCT TGTTTATAGA TAAACTCAAG 2700 

TAAGATATCC CAGAAACCTG TATGGGAAAC TTGCACATTA TCCCGTACAT AATTGAGTAC 2760 

TCCCATCGCT AACATGAGAA TAGGAGAACC TACAAAAATC GCTAACTTTT CTTTAAACCC 2820 

AATCCATTTT CCTTTTTCAG TTTGCTCCCG CATAAAGTAA TAAACAAAAG CAAATmAAAT 2880 

ACTTAAAATA AAGGGATTTC GTGTCCCAAT TGCCAAATGA AT ACT ATT AG CTGCAATAAA 2940 

GGAGACAAGC ACTGCTGTGG CCTGCAATTT CTTTGGCTTG GTTGCCAGAT ACATACACAT 3000 

TGCATAGACC GTAAAGGTAG ACAAAATGTA GGTAAAATAA GGCAGTTTAC TTTCAAAATT 3060 

TGCATAGTAG GCATAGTAGG AAGTCTGCAA ACGATACAAG AGCCGTTCAA ATAACCGAAT 3120 

GAAATAGAAA GGATAAGTTA GAAGAAAAAC TCCTAGTGAT ACAAAGCGTA ACCGCTTGAT 3180 

ATAAACCTCT TTTAGAGAAT TTCCTATATT TGCTACTTTT ATTTTCTTCC TAGCTATGAA 3240 

GTAACGAGCC AGAATGCCTC CTGTGGTCAA GCCCAGAATC GAAATCATGA CAACTATAAA 3300 

GGCAAAACGA TAGGCTATTG GATGATAGGT ATCCAAAGCA CCATCCCTAA AATAATCAAT 3360 

GGTCGGTCTT GATACCAGAA ATACAAAAAT GCTTAAATAG AAAATAAAAT GGATTAAGTA 3420 

ATACTTGATA TCATTCCAAC AAGCAATTAA GCTACTAACC AACAAGAACA ATAAAGTAGA 3480 

AAGTAAGCTA ACATTATTAT TATTAAACAG ATACACAATT CCACTTACTA GCGTCAAGGC 3540 

ATAACTGACT ATGGTCAAAC TAAATAATAA TCGTTTCCCA TCAATCACTT GGTCACCCCC 3600 

GTTCTAATGT AATTTTTTAG ATTTTTCAAT ATTTTTCAGT AATAAGAATC GATATAAGGA 3660 

AATATTTATG AATAGGGCCA AAGCACTAAT TCTTCTCCCC TTACGGAAAA TTGGATTCCT 3720 

AGAAATAGCA AAGGCATGGC CTTTTAAAAA ACGATGAATC TGAGAATAGG CTTCAAACTG 3780 

TTTATACTGA TCATCTAGCA ACATCTTATC CAGAATAAAG AAGTGGGCAT AGGCCAATCT 3840 

GAAAAAAGCG ACCTCTTTCA AGTCAGGATA GTTTTTCACA ACTTCATTAT AAAACTTTTG 3900 

GTAGATATCA ATATAGGCTA AATCCTTCTC TGCATAGGGT TTGGTCGTAA TACTATCCCC 3960 

TCTATGGAAA TAGTAATAAT AGGGTTTAGT ATTAACCACA TACTTCTTGG CCAACTTGAT 4020 

TAAATCAAAA TGGTAATAGG CATCTTCGTA AATCAACCCC TTAGGAAAGG ATAGGGCACT 408 0 

TGCAATCTGT CTCTTGATTA GCTTATTGCA AATCGTCCCA GGTATTTTTT CACCTATGAG 4140 

GTATTCCTTT AGAAATGTTT GAGAATCACA GACAAAATAG TCATCCTGAT TGGCTGACTG 4200 

TGGGCTTTCA TCATTAGCAT AGACATTCAT GACACCACAG CTCGAAACAT CCGCATCTTC 4260 

TTGAACTAAT TGCTCATATA AGCTCTGAAT CATTTCTGGA TGGATATAAT CATCTGAGTC 4320 
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AATAAAAATC AGATAATCCC CCTCAGCCTG CTTCATCCCA TCATTTCGTG CTTGCGACAA 4380 
TCCTTCGTTC TTTTTATGAA GCACTGACAC CCTGTCATCT TGTTCAGCGA TTGAATCACA 4440 
CAAGCGACCA CTTTCATCTG TTGCACCATC ATCAACAAGA ATAATTTCCA GATTTTGATA 4500 
GGTCTGCTTC TGAATGGAAG CTATCGATTT TTCTAGGTAC TGCGCCACAT TATAGACTGC 4560 
CACAATCACA CTAATTAATG CAGTTTCCAT GCTACTCCTC TAATAGTTTT TCTACTTGTT 4620 

CGATTTGTTT TGTAATTGTA AATTGTTGAA TGAATTGGCT AGCCTCATCG ACATCAAAGT 4 680 

TTGAGGCAGA AGTCATGTAA TTAGTAATCG CCTGAGCTGC CTCTTGATTG CTCTCAATGA 4740 

TTTGTCCAAA TCGTCCTTCT TGGGATAATT CCTCAGCCCC TCCAACGTCC GTAGAGATAA 4 800 

AAGGGAGTCC CAGACTCAAG GCCTCCACAT ACACTCCAGG AAAACCTTCT TGTTTAGACA 4860 

TAGACAAAAG AACTTTCGTC TGAGATAGAT ACTGATAAGG ATTTTTTTGA TAACCAAGGA 4920 

AATGTACATA GTCCTCAATC CCATACTCTT TGACTCGTTT TTTCAGTTCC TCTTCCATAT 4980 

CACCAGCCCC GATAAAATAG AGATGATAGT TTTTTCCCTC TTGGTGTAAT AATCGTATCA 5040 

CTTCCACTAC ACGGTCAGAA CCCTTATTTT CCTCAATCCG TCCGATAGTA CAGATACTTT 5100 

GAGGAGCAAT CTCGATATCG ATCTTCTCTT GACATTTTTC TAGAATAGTC TGAAAATCAT 5160 

ATCCATTGTA GATTGTCTGT AATTTAGAAG TATAATCTCC ATAAACTTCC TTGATAGAAT 5220 

TGCTGGTCrT TTTTGAAATC CCTACAATTG TATTCGCAGC ATCCAACTGC CTTCTATGTG 5280 

ATTCTCTTTT AGAGCTATCC TTAAGAAGTT CTTCAATACT TCCATGAATC CAAGATATCT 5340 

TCTTGACTTC TCTTCTTTTA GAGAACAACA GTGGTGGATT CATAATGGTA AAAGAAACTT 5400 

CAACATCATA ATCATCTTTT ACAAGCAAAC GACGAGTCAC TCTTGGAAAA TAAATTCTCA 5460 

TTCTCCACAA AAAAGCTCGT AACCATCTGG TTTGGCGATA ATCTTGAAGG GATTTTAAAA 5520 

TGCGTACATG CTTTGGAACA GATTCATATC CCTTGTCAAA GTGCTCCATT TCAAGAATAT 5580 

CAATATCATA CTTTTCTGGA TCCAGATTTG AAACAATGGT TGATAGAATC TTCTCTGCAC 5640 

CACCTCCAAG AGAAAAAGAC CACATAAAAA ATAAGATTTT TTTCTTAGCC ACCATATTCT 5700 

CCCTTGTATT CTGTATAAGA CTTATCCATA TCAGCGATGA CAGCATCATG ATGCCGTACC 5760 

TGCTTGTCTG CTGGTGGAGG CGTCATATAA TCCCCAAAAG CAGTTCTGAG ATAGACATCA 5820 

TAGCCGATTG GAATAGCCAT CTCTGTTCCT TCAAATGGCA AGAAAAGATT GTCTTCAAAA 5880 

GATGTGATTG GGTACTTGTT TCTCATGTAG CCACGACCTG AGCATAATTC TGTAATGCCA 5940 

TCACAATCAG CCAAATCATA CTTAGTCATT TCTTTCTCAG CTTTTTTCCA GATGCGATAA 6000 

CGGAGAGATT TTGGAGTCAA ACCCAGTAAA ATGCGACTTC CCCATTTCAT GAGATCACCA 6060 
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TGCTTTTCTG GAATAGTTTG CGCACAAAAG AGTGAATAAA TCAAGGCCCA ACGAACCTGT 6120 

TTTTTCCGCT CAGCTGGATT TTTCGGATAA TAATCCAAAG GCAAAACATC CAAGGCCAGA 6180 

CCATGTGGCA AATCCAAATC CTGCTGATAA GGCTTGATAC AGGTGGTTTT CTTGTCACGA 6240 

ATGGTAATAA AAAGATTACG ATCAACAAAA TCCTTGTGAC TCTTTGACAA GAAATAACGT 6300 

TCATCTGCAT AACGAGGCCA TAATTCTGCT AATTTCTCAT AATCTTTACG AGGCATAAAA 6360 

AAGTCTAGGT CGTCGTCCCA AGGAATAAAT CCCTTGTTTC GAAGGGCACC AATAGCGCCT 6420 

CCGCCACAGA GATAACAGAG CAAATCATGT TCTTTACAAA AGGCCACAAA ATATTCAGCC 64 BO 

ATCTCCAGAC TACGAGCCTG AATTGCTTTT AAATCAGTCA TATTGTTCAT TATTCTTTCT 6540 

ATCGTATCGT TTCATTATAC CACAAACAAG GGGTGAAAAT CTATTGCAGA CTGTAAAAAA 6600 

TCAAAGCCTG ACTGCTATCC AAATAGCTAT CAAACTTTGA TTTTTCTGTC TTATACTCTT 6660 

CGAAAATCTC TTCAAACCAC GTCAGCTTCA CCTTGCCGTA GGTATAGCTA ACTGACTTCG 6720 

TCAGTCTTAT CTACAACCTC AAAACTGTGT TTTTAGCAGC CTGCGGCTAG CTTCCTAGTT 6780 

TGCACTTTGA TTTTCATTGA GTATTATCTT ATCTTAAGCC CATTTGAGCG AGCTTGGTTT 6840 

GATATTTGTT TTGATCAACC AGCAGGCCCA AGCCCCCATA AACATCATAG GCATCTACCC 6900 

AGTCACCCAG TTCTGGAATC GTCAATTTTT CAATACCATT TTTTGCTCCA TCCAAAACAG 6960 

ATAAACCGTT TGTTAGGAGG AAAGTATAGG GTACGTTGGT TGAGGTCATA GCAAAAACCT 7020 

TTCCAAGAGC TTCAGAACCA GTGAAAAGTT TAGTGGGATC TTTAATTTGC TCTAAAATTG 7080 

CTGTTAAAAC TTGTTGCTGT CTTTTTGTAC GGCCGTAATC TGCCTCATCA TCATCACGGA 7140 

AACGAGCATA ATTGAGCAGG GTCGAGCCAT TCATCTGCTG TTTTCCGACT TTAATGGTTT 7200 

GCGTTGGAGA CTCAGTCTCG GTAGCGTATA AATCATCTCC GACTGTAGCT TCTGTTAGGG 7260 

GACGCCCATT CAATGTTGAA AATTGAGCAT CAATCGTCAC CCCATCAGGG AAAAGCGTGT 7320 

CAATCGCTGT GGCAAAGGCC TGGAAATCAA CCAAGGCGTA GTACTTAATG TCCAAGTCAA 7 380 

AATTATCTTT CAAGACTTGG CGAACCATTT CTGCCCCTTT TTGCCCCTCT TGTTCTCCTA 7440 

ACTCGTAGGC TACGTTTAAC TTGTTATCTG TCTGTTTTCT ACCATTAATC ACTTGACTAT 7500 

AACCATCTAT ATAGACCAAA TTATCACGCA TGAAACTGAC TAGCTTCATT TTCTTATCTG 7560 

AGCCCCCGAC ATTTAATACC ATAATAGAGT CAGTTCGTGT CTCAACACTG TTCTGGCCGA 7620 

TTCGACCATC AGTACCCATG ATTAAAATAT TAACTCCATC TCTAGTGTCC TGACCATTAA 7680 

AGACTTCTAC TTGAGCTGCC CGGGCATCAG CAGTTTTCTT TGCGCTAGCA TCTTGGTAAC 7740 

CACGCAAAAA CATGAATACC ATGGCCAAAG CCACACAGAC CAAAAGTGAA AAAATCACCA 7800 

TAAAAATTCG TTTAAGACGG AGCTTCCGTC riTI IT rT T T TGGAGGGAAA GAGAGTGCTT 7860 
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GTGATTTGGA 


TTGTGAGCGA 


CTCCGGTTCG 


CATAGCTTCG 


TAAGTCAACC 


TGCTCTTCTC 


7920 


TTTCTTGTTC 


CAAGCTAGAG 


CTACTATTTC 


CCCTAGCAAG 


AGTTAGCTTT 


TCTTGCAAAT 


7980 


AGGCAAACTC 


ATTTTTTTCT 


CTCTCATTGA 


GATAGTGAAT 


ATTTTTTAGC 


AAATAATCAT 


8040 


AACGCAACTG 


CTCATGATGA 


CTTAAGGGAT 


TTTCTTTACT 


CATCTTCTCT 


CCTTTCCATG 


8100 


GTCTGATATT 


GGATAAATAG 


GATAGGCACC 


CAGAATTTTA 


TACTGGATTC 


CAATCGCTTC 


8160 


TAATTCTTTT 


TGGGCAAAGT 


GGACCAAGTC 


CTTATCGGTA 


TAATCCACAT 


CGATAATGAA 


8220 


AAAGTATTCA 


CCCAGTGCTG 


TCTTGAGTGG 


ACAACTTTCA 


ATTTTTGTCA 


AGTCAATTCC 


8280 


TCCCCAAGCA 


AAGGTCGACA 


GGGCCTTATA 


AAGTGCACCT 


GGAAGGTTGT 


CAGGTAATGT 


8340 


CAAGGCCAAA 


CTCATCTTTT 


CAGTTTGTGC 


TTGCAAGGGA 


ATACTAGGCT 


TTTCAGCTCC 


8400 


TAGAACCCAG 


AAACGTGTGA 


AATTGGCTTC 


CATTTCCTGA 


ATATCCTCGG 


CAATCAGTTC 


8460 


CAATCCATAT 


TCTTCAGCAG 


AACTTCTAGG 


TGCAACTGCT 


GCAAAGGGCT 


GGTCTGGATG 


8520 


TTCGGAAATA 


AAACGGGCCG 


CATAAGCTGT 


ACTAGCTGTT 


ACCTCGATTT 


GAGCCTCTGG 


8580 


ATATTGTTCA 


TCGATGAATT 


TCTTTCCTTG 


AGCCAAGGCC 


TGTGGATGTG 


AAAAAATCTT 


8640 


TTCAATCTTA 


GTATGGCCTG 


GAACCACCAT 


CAACTGCTGA 


TGAATAGCCT 


GAACGATTTC 


8700 


TGCTACTGCT 


TGGATGTGAG 


CCTGATGAAA 


AAGATACTCC 


AAGGTTTCAT 


GAACACTACC 


8760 


CTCAATAGAA 


TTTTCAACTG 


CCACCACAGA 


ATAGTTCACT 


AATCCTTGCT 


CATAAGCCTT 


8620 


GATGACATCT 


GTAATGTTGG 


CAAAAGCCTG 


CAATTCCTCA 


TGAGGAAAAG 


CTGTCTGCAC 


8880 


AACGTGGTGT 


GAAAATGATC 


CCTTGGGACC 


TAGATAAGCA 


ATTTTCATCT 


TAGTTCCTCT 


8940 


ATAATTTCCT 


CTGGGCTTAG 


CTTGGTCACA 


TCCAAAACCC 


GACTAGCCAC 


TTCCTCATAC 


9000 


CAAGCCTGTC 


TTTCTTGGAA 


AATAGCTACT 


AGTTCTTCCT 


TGCTATTATT 


TAGAAAAAGC 


9060 


GGTCGCTGAT 


TGTCCTTATC 


AGCTGCGATA 


CGTTGGTAGA 


GGGTTTCAAA 


ATCTGCTCTC 


9120 


AGGTAGATGT 


TATCTGTATT 


AGTCTTGAGT 


AAGTCACGAT 


TTCTCTGAGA 


AATAACCACT 


9180 


CCTCCTCCAG 


TTGACACGAC 


TTGGTCTGTT 


TCTAGTAAAT 


CAGCTAGGAC 


TTCTGATTCT 


9240 


ACCTGACGAA 


AGGCTGTTTC 


TCCCTTTTCA 


GCGAAAAAAT 


TCGCAATGGA 


CATACCTAGG 


9300 


CGATTCTCAA 


TCAGAGCATC 


CATATCAAGG 


TAATTAGGGT 


CCAAGCCTCT 


TGCAATAGTC 


9360 


GATTTTCCAG 


CCCCCATAAA 


CCCTAATAAC 


ACCTTAGCCA 


TGAATCAAGC 


TCTCCAAATC 


9420 


ATCAAAGAAA 


CTAGGATAGC 


TGGTATTGAT 


GGCTTCTGCA 


CGGTCAAGCT 


CCACCTCTCC 


9480 


ATCTGCAACC 


AAGAGGGCTG 


CGATAGCTGT 


CATCATGCCG 


ATACGGTGGT 


CACCAAACGT 


9540 


ATTGACTCTA 


GCACCGTGAA 


GAGCTGATTT 


TCCTTTGATA 


ATCATCCCAT 


CTGCCGTAGG 


9600 
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AGTAATATCT 


GCTCCCATAC 


TATTTAAGGC 


GTCTGCCACA 


ACCTGAATAC 


GGTCTGTTTC 


9660 


CTTGACCTTG 


AGCTCCTCAG 


CATCCTTGAT 


AACTGTTACA 


CCTTGGGCTT 


GGGTCGCAAG 


9720 


CAGGGCAATA 


ATGGGCAATT 


CATCAATCAA 


TCGTGGAATC 


AAAGCGCCAC 


CAATCTCTGT 


9780 


TCCTTTCAAG 


TCAGAAGACT 


CAACAATCAA 


GGTAGCAGAT 


TTAGCGACTG 


GATCGATTTC 


9840 


AGTTATTTCC 


AATTTTCCAC 


CCATGGCACG 


AATGACATCA 


ATAATACCGG 


TGCGAGTTTC 


9900 


GTTGATCCCC 


ACATTCTGCA 


GCACTAGACG 


AGAATTTGGA 


GCAATCAAAC 


CTGCGACTAA 


9960 


CCAAAAGGCT 


GCACTGGAAA 


TATCTCCTGG 


TACGACCACC 


TTCTGTCCTG 


TCAATTTTTG 


10020 


TGGCCCCTGG 


ACTGTGATTT 


TCTTACCATC 


CACACTTAAA 


TGACCACCAA 


ATTGTTTCAA 


10080 


CATATCTTCA 


GTATGATTAC 


GGGTGTACTC 


TTTTTCGATA 


ATAACTGACT 


CCCCCTTAGC 


10140 


TTGTAAGGCT 


GCAAACATCA 


AGGCTGACTT 


GACTTGGGCA 


GAGGCAATTG 


GCAACTCATA 


10200 


ATGAATAGGT 


CTTAGGTTTT 


TCGTCCCTTT 


TAAGCGAAGG 


GGAGGCAAGT 


CTCGTTCAGT 


10260 


TTGCCCTGAA 


ATGCTGACGC 


CCATTTTTTT 


CAGTGGAAGG 


GTCACACGGT 


CCATAGGACG 


10320 


TTTGGAAAGA 


CTATCATCTC 


CAAACATCTC 


TACTTCGAAA 


TCTGCACCAG 


CAAGGACACC 


10380 


TGAAATCAGG 


CGAATCGAGG 


TGCCAGAATT 


TCCCATATTA 


AGGGCATTTT 


GTGGCGCTTT 


10440 


TAAGCCAGCC 


ATGCCTACAC 


CTTGAATGGT 


AATAACCCCA 


TCTTTATCCT 


CAATTTCAAC 


10500 


ACCAAGGTCA 


CGAAAAACCT 


GCATGGTCGA 


AAGAACGTCT 


TCACCTCGCA 


GAATATCATA 


10560 


AACCTTGGTC 


TCACCCTCAG 


CCAAACTTCC 


AAAGATAATG 


GAACGGTGGC 


PGATAGACTT 


10620 


GTCACCTGGG 


ACGCGGATAC 


TACCATGTAA 


ATGGCGAATG 


TTTGTTTTTA 


GTTTCATACT 


10680 


GGACCTCATA 


CTTGCAATAC 


TTTTACCTAT 


TTTATCATAA 


AAAGCCAGAA 


ATTCCTTAAA 


10740 


AATTCCTGAC 


TTTAGGATCG 


TTCTTTTCTT 


ATTTCAGCAA 


TTCTGAAACT 


GGTTCAAAAA 


10800 


CAATTTTPTC 


AATATCAGAA 


AGGTAAATGG 


CCAATTGTTG 


TTGCTTGGTA 


AAGAATTCTG 


10860 


ACAAGAGGCT 


ATTTCCTTGA 


ATCTGTTTAC 


CAAAGCCTTC 


CATCTTAGCT 


TGGAAGGACG 


10920 


CATCTGGCAT 


TTGACCTGTC 


TGTGCTAGTT 


TTTGAATTTC 


CTCTTGAAAG 


GCAAGATAAT 


10980 


CTGTAAAGAT 


TTTGCTTGCC 


TCAGCATCTG 


CTGCAATCGC 


ATCTTTAGCT 


GCTTTAACAG 


11040 


CCTTGTATTC 


TGGTAATCCG 


CGTAGACCGC 


GACTGAGTTC 


GTTTGCACTA 


TCGTAAATAT 


11100 


TTGACATGTT 


CTTCTCCTTA 


TTTGATGACG 


ACTGTATAGT 


CAGTATTTTC 


TGTTATGAGA 


11160 


TGCTCAGCTC 


TTTCCAAGTC 


TTGAGCATTT 


TTAAATGAAA 


TTTGTAGGAT 


TCCGTGAATA 


11220 


TCCTCACGAT 


TTTCCTCGTT 


GATGTGGATA 


TTAACCAAGG 


AAGTTCCACG 


TAGCAGTTCC 


11280 


AAAATCCGCA 


GGATGACATC 


TTCTTCATCA 


GGAACGTCAA 


CATAGAGGTC 


GTAAGAGCTA 


11340 


TCCACACCAC 


CACGCTTATG 


GATTTCCATG 


GTCTGGCGTT 


GTTCACGCGC 


TTGGTTAAAA 


11400 
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AACTTCCAAA TTTGCTCTTC ATCTCCCTTA CTAATGGCCT GACCAATCGC TTCCAAACGT 11460 

TCCTTGAAAT CCTCAATTCT ATCCAGAATG ATCTCGCTAT TGGACAAGAG AATGGAGGTC 11520 

CACATTCCTG GCTCGCTTTC CGCAATTCGG GTCATATCTC GAAAACCACC TGCCGCAAAG 115 BO 

CGCCTTGCCA TCTCATGCTC TTGAGCATAG ACCGCAGTCT GCTCCATGAG ACTAGAAGCC 11640 

AAAATATGAG GAAAATGGCT AATCTGAGAA GTGACACGAT CATGCTCCTT GGCATCAATC 11700 

TCGATAAAAC GAGCATGAAG ACCTGAAAGC AGATCCTTCA TTTCCTTAAG CGTGTCCTGA 11760 

CTTGTCAGGC TTGAAGGTGT AAAGATATAA TAGGCATTTT CAAAAAGATT GACATCTGCC 11820 

GAAGCAGCCC CTGTCTTGTG ACTACCAGCC ATGGGATGGG CCCCGACAAA GCGAACAGAC 11880 

TTGCCAGCCA AATACTGCTC CGCCGCATCC ACAATGGTTG ACTTGGTCGA ACCAGCATCT 11940 

GAAATAATAA CGCCTTCTCG CAAATCCAAA TTGGCCAACT CCTTAATGAA AGCAATAGTT 12O00 

TGTTTGATTG CCAAGCTGAG GATAATGACA TCTGCCAAAG GAGCAAAACT AGCAAAATCA 12060 

TCCGTTGCAC GGTCAATCAT ACCTTCTTTC AAGGCGATAT CTCTCGAAGC TTGACTACGA 12120 

TTATAACCTA AAATTTCATA ATCTGGATGA TCGCGTTTGA TACCAAGTGC CATAGAGGCT 12180 

CCAATCAACC CAAGACCTGC GATATAGATT GTTTTTGCCA TAGGAACTCC TTAATAGTTC 12240 

TTTGTATAGT CTCGGTGTTT GGCTACCGCT TCTTTTAGTT CCTCAAGATT ATCTGATGAG 12300 

AATTTTTCGA GGATTTCTTG CGCCAGAACC GTTGCTACAA CTGCTTCCAT CACCATTCCT 12360 

GCAGCTGGAA GAGCAGTCGG ATCACTTCTC TCCACGGTTG CCTTGTAAGG TTCCTGGGTT 12420 

TCGATATCCA CACTCATAAG AGGTTTATAA AGAGTAGGAA TGGGTTTCAT GACCCCACGA 12480 

ACAACGATGG GTTGCCCATT AGTCATACCA CCTTCAAAAC CACCTAGATT ATTGGTACGG 12540 

CGAGTATAAC CGTCTTCTTT AGACCAGAGA ATTTCATCCA TAACTTGGCT GCCTTTACGA 12600 

TAACCAGCCT CAAAGCCAAG ACCAAATTCC ACCCCTTTAA AGGCATTCAT AGAGACAACA 12660 

GCTTGAGCCA ATCTTGCATC CAATTTTCTA TCCCATTGGA CATAGGAACC AAGACCAACT 12720 

GGAACGCCTC CGACGACTGT CTCCACAACC CCACCGATCG TATCACCATC ACGTTTGATT 12780 

TGGTCAATAT AGTCCTTGAT TTCCTGTTCT CGTTCTTGGT TGACAATAGA AACTTCAGAC 12840 

TGGGCAGCTC TTTGCTTAAT TTCAGCGACT GTCAGATTTT CAGGAACATC GATTTCCTTG 12900 

CCACCAAAGA CCACGACATG GTTGGCAATC TCCATATCCA GCTCAGCCAA GAGGCGTTTG 12960 

GCTACTGCAC CAACTGCCAC CCGCATGGTG GTTTCACGAG CTGATGAACG CTCCAAAGAA 13020 

TTTCGCAAAT CATCAAAACG GTACTTAATC CCCCCAACCA AATCGGCATG ACCTGGGCGA 13080 

GGATGAGTAA TTTTCCGCTT GCTTTTAAGG CGGTCTTCAA TGTCCTCCGC AGACATGATG 13140 
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TCCAGCCATT TCTGGTGGTC CTTATTGATG ACATCCATAG TAATAGGCGC CCCTGTCGTC 13200 

TTCCCGTGGC GAACGCCCGA AGTAAAGACA ACCTGGTCAT TCTCAATCTT CATACGACCA 13260 

CCACGACCGT AGCCACCCTG ACGGCGTCTA AGGTCCTCAT TGATATCCTC AGCTGTCAAT 13320 

GGAAGTCCAG CTGGAATTCC CTCAATAATA GCTGTTAGAC GGGGGCCGTG TCATTCTCCT 13380 

GCAGTTAAAT ATCTCATACA CTCTCCTTAT TTTACCAAGT AGTCTTTCAT CTCTTCCAGA 13440 

GAAACTGGGT GAATGGTCGC TGAACCAAGC TCTGGCACCA AGACCAATTT CAAGGTGTTA 13500 

CCACGCGCTT TCTTGTCATG AGTAAGAGCC TGATAAAGCT TGCCAACTTC CCAATTTTCA 13560 

TAGTCAACAG GCAAACCGAA TTTCTGACAC ATCTCTGTGA TAGATTGGGT AATGCCAGCT 13620 

GGCATGAGGC CTTTTTCCTC AGCAACCTTG GAAATCTGTA CCATTCCCAT GGCAACAGCC 13680 

TCTCCATGCA TGACCTTGCC ATAACCGGCA GTCGCTTCGA TGGCATGGCC AATAGTGTGG 13740 

CCAAAATTGA GGTAAAGACG AATACCATTG TCCAACTCAT CTTCAACCAC CATCTTGCGC 13800 

TTCACCTGAC AAGAATGTTC AATCAAGGTC TCTGCATGTT CCAAAATACT CTCAACAGAA 13860 

CCATTCAGTC CCGTCAAGAG AGCCCACAGT TCTGGATCCT CAATCAAGCC ATACTTGATA 13920 

ACTTCACCCA TCCCTTCAAT CAACTCTCTT TTTCCGAGGG TTTCAAGAAC AAGTGGATCA 13980 

ATCAGAACCC CATCTGGTTG GGCAAAGGTC CCCACCATAT TTTTAGCAAA TGGTGTATTA 14040 

ACGCCTGTCT TTCCACCGAT AGAAGAATCA ACCTGAGCTG TCAAACTAGT CGGAATCTGA 14100 

ACAAAGTGAA TACCCCGCAT ATAGGTAGAG GCTACAAATC CAGCCAGGTC CCCAACAACG 14160 

CCACCACCAA GAGCAACGAT TCCATCGCTA CGAGTCAGAC CTTGCTTGAC TAGAAATTCA 14220 

TAGACTTTCT GAACAGTAGT TAAATTCTTT CTTTCTTCAC CTTCTAAGAA ATCAAAAACA 14280 

GCTACCTGAA AACCAGCATC TTCTAGGCTG AGCTTGACCT TCTCTGCATA GAGAGAGGCT 14 340 

ACATGGTTAT CTGTCACAAT GACTACCTTT TGCGGTTGCC AGAGTTCTCG CAACCACTGA 14400 

CCAGCCTGGG CCATACAACC TTTTTCAATC TGAATATCAT AAGGATGGTG AGGAATATCG 14460 

ATTCTGATTT TCATAGGAGA GTCTCCCTTT CTTTATTGGT ATTTTTCTGT TAAAGACTGC 14520 

CAAATCTCTT CTGTCGGCAT TTCCTTGCCT GTCCACAGTT GAAAAGCTTC TGCAGCTTGA 14 580 

TAGAGTAACA TTCCCAGACC ATTGACTGCT GGATTGCCCT GACTTCTAGC CCATTTCAAA 14640 

AACGGTGTTT CAAAGGGTTG GTATATGATA TCTGCAACTA AAAGAGTTTC TGGTAAGACT 14700 

ATGTTTTCAG GAACAGGAGA GGATTGGCCA TCCATGCCCA CACTGGTGGC ATTAACTAGC 14760 

AAATCCGACT CGGCAATCCT TGCTTGCAGT TCAGAAACAT ATTCTAAAGC ACACAAATCC 14820 

ACTTTAAAAC CTGTCTGCTC CTGTAACTTG TCTAGGTAAG GTCTTGTTTT TTCCATAGAA 14880 

ACGGAACGAA CAAAGACCGA AATCTGACTG ACGCCATCCA AAATAGCCTG TGCCAAGATT 14940 
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GATTTAGCCG CACCACCTGC ACCCAGCAGG GTCATCTTTT TACCTGAAAT TGTAAAAGAA 15000 

GGCAAGCACT TAAAAAATCC CTTGCCATCT GTATTATATC CAATTAAATT GCCATTCTCA 15060 

TTGACAACCG TATTAACCGC ACCAATCAAG CGCCCTTCAT CGCTCAGCTT ATCCAAATAA 15120 

GGAATCACCT GCTCCTTATA GGGCATGGAC AGATTGATCC CAAACATCTG GTAGCGACCA 15180 

ATATTGGCCA CTGTTTCTAC CAAGTCACTC GCTTCAATCT CCCAAGCCAC ATAAGCACCG 15240 

TTGGTAGCTG TCGCCTCAAA GGCTCTATTG TGGATGAAGG GAGAAATAGA ATGCTTAATA 15300 

GGATTGGCAA CAACTGCAGC TAAACGTGTA TAGCCATCAA GCTTCATCCA AAATCTCCCT 15360 

GATTTTTTTC ATCCTACCTA GAGAAATCTG CCCAGGGGCA CTAACCTCAT CCAGACTGGC 15420 

AAAAGACCAA CTCGAACCAG TCACATCCGC AGTGATACGA GAGACCTTGC CCACCTTACC 154 SO 

CATAGAAATG GTCACATATT CCTGTTCAGG ATTGAGGGTT TTAAAGCCTC GTGTATAGTT 15540 

CATCAAGTCT AAGACATCCT GCTCCGTGTG AGCCATCACC GCAACCTTAA CAAGTTTTGG 15600 

ATTTAGGATC GTCAACTCTG ACAAGATTTC CATCATGTTC TCAGGTGTTT CTTGGAAATT 15660 

ATGGTAACTC AAAACAAGAT TTGGGAAGTC CACCATTTCC TCAAAAACAT CCTTGTAGCT 15720 

ATAGTACTCA AAATCAATAT AGTCTGGTTG ATAGAGTTCC GCAACTTCCT TGATTAGATG 15780 

GATATACTCT TCTGGAGAAA GGTCGATTTC TCCACCTTCG GAGCCAGTTC GTAGCGTGAA 15840 

AACCAACTCA CGGCCTGCGA ATTTTTCAAA AATGGCTGGA GCTACCTGCA AAATCGCTTC 15900 

TTTAGGCAGA TAGTCGGCAC GCCATTCAAT GATGTCGGCA TCCAGCTACC TCGTGGCATC 15960 

CAGAGCCTGA GCCTCCTCTA AACTTCTTGG CATTACTGAA ACGATTAATT TCATTTACTA 16020 

ACCTTCATAC TAATCACCTT GAGGTAATTA CTACTTTCAT CTTTTTTATT ATAGGCAAAA 16080 

TCTGCTGGAA GACCATATTT GTTTAAAATC TGGTAACTTC TTCCTGCAAA ACCTTTATCA 16140 

ATTTGTTCTG TAAATTTCTG ACGGGAAACA TTGGCAGCAT TGGTACTGGC AATGATAATC 16200 

CCTCCCGGAT TTAAAATCTC AAGACTCTGG GAAATCAACT TGTGATAATC CTTGGCCACA 16260 

GAGAAAGTTT GTTTTTTATT CCGAGCAAAG CTACGCGGAT CTAGGACAAT CACATCGTAG 16320 

GTCAAGTCTT TCCGTTTGGC ATATTTGAAA TACTCAAAGA CATCCATGAC TATAAAACGA 16380 

TGCTCGTCTG TGCTGAGCCC ATTTGCCTGA AAATGCGCTT GAGACAATTC TCGTGAACGT 16440 

TTGGCTAGAT CAACAGAAGT TGTATGGCTA GCTCCTCCCA TGGCCGCAGC TACTGAAAAA 16500 

GCCGCTGTGT AGGAAAACAT ATTGAGTAAG GATTTACCCA TAGCCAAGCC GTCAACTAAA 16560 

CTACCGCGAA CCTCATGCTG GTCTAGGAAA ATTCCTGTCA TCAAGCCATC ATTCATAAAG 16620 

ACTTGATACA GGACACCATT TTCTAAAACA TTGAAAAAGT CAGGTGCTTC TTGACCATAA 16680 
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ACATGGGCAG ATTCATAGTC CAAACCCTTA AAGCGGATTT TCTCATAAGC TCCTAAAACC 16740 

TCAGGGAAAA CCTGTCTAAA GGCTTCTGAT ATAGTCTGAC GAATCTGATA AACATAAGAG 16800 

TTATACCAAG AAAAGACGGC GTAGTCGCCA TAAAGGTCCA CTGTCAGACC CCCAAAGCCA 16860 

TCTCCCTCTT GATTAAAGAG ACGAAAGGCA GTTGTCAAAT CATCTTGATA GTAGGCGTTT 16920 

CTCTTTTCTT TGGCTTTTCT AAACAACGTT TCAAAGAAAG CTTGATTGAA GGCCACCTTG 16980 

TCTTTGCTGA TAAACCAGCC CAAGCCCTTG TTTTGCTGAG AAAGGTAGGC AGTCCCAAGA 17040 

AAGTTTCCTT CCTGACCCTG CACCTCTACT TCCTGATCCT TAAGATTGAC ATTCTCAAGA 17100 

TCACTGGCTT CTAGTAAAAC TAGCCCCTTA GCAAGCTTCT TTTCAA CCCT TTTGCTGACT 17160 

CTTATTCTAT TCATAACTAC CATTATATCA AACTTTTAGA CAATTCTCAA AAAAGAAACT 17220 

ACCCTTGCTT TTTTACTCTT CTTTTAAAAA ATGGTATACT AGACTTCCTG CAAAACTAGG 17280 

AAGTAAATGT GTAAGAATCA CAGTAAAAAA TGCTCTTCCC TCTTCGAGGA GCATTTCTTT 17340 

TTATCAACGA AAATCAAATA GCAAACTATG AAACTAGCCT CAGGTTAACT GTGAGATTAT 17400 

AGGTACAGAG GTTGTATCAG CAATATGTGT CTGTCAAATT TAGTGACAAA GGTAGTAGAA 17460 

GAAAGATAAA GAAATAAATC ACCTTCACTA CGTATCTCGA AAATTTGATT TTATAGAGAA 17520 

GCCTTTTGTT ACAAACTCAA TATACTATCA ATAAATAATA TTATAGAAGC AACAATAATT 17580 

ATAATTTCAC CTATCTGCAT CATTCTATTT CGAACTCTAA ATATATCTTC TATCAAAAAT 17640 

ACTTGGAACA CACACATTAT AGGAATTAAC GTTTTTGAAA TTGAAAAATA TCCAAATAAA 17700 

TAAACTATAA ACAACAAAAA TAGAACTATG TTATATTTCT TATTCAAAAC ATTCCTCCCT 17760 

ATATATTTTT GATTACCAAT CTTAATCATT TACAACTACA TTCTAACAAA CTATAAAAGC 17820 

GTTTGTCGAA TTGAATTTAT CAAGCAAGCG ACCAACCAGT TCATCTTTTT TCTATTTCTG 17880 

CCAATATGCG TGACAGGTAA TAATGATAGC CAAAAATAGC AAGAGCAAGC AAGACGATAA 17940 

GAGCTCCTAC TCCCAAGCTG ATGGCAAGGA TAGGGGAGAG AGACTGAACC AAGAATATGC 18000 

TCCCAATTAC AAGGGCCATC AGGATTGCAC TATAAATAAA CAATAAAACT ATGGCGAC TA 18060 

TGCCATTTGA ACGATTCACC AGGTCCGTAA TGCTACTCCA ATTGGTTGAC AGATTTTTAA iB120 

CGTCCTTAAA GTAATGGTGG CAAGAAAGGA TGACACTGGC AATGATCCAG ACTACAAGAA 18180 

GGTAAATCAT CGAAATGATG GGCAAGCCTA GATATAGAGA AAGAGCAAGC AAAGTCAGAA 18240 

CTGGTAAAAA GGACTGGACA GCATATATAA TCCAAAATTT CACTTTCACA TAACGAGCAA 18300 

AGTCAAAGGG TAAACTCTTA AGAAAATCAA CATTTTCCCT CTCCAAGGAC AAGGCAATTG 18360 

AATGCAGGCT GGTGATATTG TTATTGACAA CTGCTATAAA GAGAGCTATA AAAAACAAGG 18420 

GTAACCAGTA TGGAGGATGA ATGTCTGGAA CTATCTGAGA ATCTCGGATT TTGGAAATCA 18480 
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GACCGATCAT CATGAGATAA GGAAGGAAAG CACTTGTAAA AAGCACTGTA ATCACGCCAG 18540 

TCCCCTGTCC CAAGAGGGTG AGGTGGTAGC GTAAAACCAT GCGAAAAAAT CCCTTTTTAG 18600 

TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC CTTCTCCTCA CTATTAAGCA 18660 

GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT CAGATAAAGC AGGAAGAGAG 18720 

TTAGTCCTAT CCAAGCGAGC AGACCCACTA AGGCTTCTGT CGAAAAAGGC TCCACTGCTA 18780 

TTTTGTAAAA GATATGAAGA GGATAAAGGA GAAATGGAAT GTCTCTAACT TTGTCAACAA 18840 

TACTTCCAAA AGTCGACTGA AGAAAGAAGA TAAATATTAA AGGTATGAGA ACTCCTATCC 18900 

CAATCATCAC ATTCGAAAAA ATAGACTGAT ACTTTCTGAA GACCCTAGTT TGAGCCAAGA 18960 

AATGCACTGC CACTACCATC ACTAGAGCCA CAGAGACAAA TAATAAGGTC AAGGACAGTA 19020 

GCATCAAAGG CAAACCCAGC CATAGAGAAG GAGCTAGCCT AATGTAGAGG ACCAGAAAAT 19080 

AAGCTAGGAT TGGTACAATT CCAGTTAGAG CTGGCAAAAG GACAGACAGT CCTTTAGCAA 19140 

TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA TACCAAATCC TTACTCTCAT 19200 

AAAAGACATT GTAAAAGCCC GTTAAAGAAG TTGAAAAGGC AATCACTAGT AAAATAGCAA 19260 

TCATCGAGCT AAAATAAATA GGTATTTCCT CAAAAGGAAA ATGAATGGCT ATATTACTAA 19320 

AACAGATGAT CATCAAGAGA CTGGAAAAAA TGTAAGAACT TAAGACTCTA GCGGAAACAT 19380 

TTACTTTTTT 19390 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CCGAGCGTCG TTACAGACTT TATCAAGATT GGACGCAAGA AGAAATTCAA CATATAAAGG 60 

AAAATATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA ACAGGACTTC 120 

TCAACGACCC AAATGGCTTT TCTTACTTTG ATGGCAAGTG GATCCTCTTT TACCAGAATT 180 

TTCCTTTTGG TGCAGCCCAC GGTTTAAAAT CTTGGGCACA GCTAGAAAGT GATGATTTGA 240 

TTCACTTTAA AGAAACTGGA ATCAAAGTTT TACCAGATAC TCCATTAGAT AGCCACGGTG 300 

CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT ACAGGAAATG 360 

TTCGCGATAA AAACTGGATC CGTCACCCAT ACCAGATCGG TCCTTTGATG GACAAGGAGG 420 
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GTAAGATTAC AAAGATTGAC AAGATCTTGA TTGACCAGCC ACCAGACTCT ACTGACCACT 480 

TCCGCGATCC ACAAATTTTT AACTTTCAGG GTCAATATTA TGCCATTCTC CGCGGACAAG 540 

ACTTGGAGAA AAAAGGTTTC GTTCGTCTCT ACAAGGCTGT CAATAACGAC TACACAAACT 600 

GGCAAGCAGT TGGCGACCTT GACTTTGCTA ACGACCGTAC TGCCTACATG ATGGAATGTC 660 

CTAATTTGGT CTTTGTAGAG GAACAACCTG TCCTTCTCTA CTGTCCACAA GGATTGGATA 720 

AGAAAGTTCT AGACTACGAT AATATCTTTC CAAATATGTA TAAGATCGGG GCTTCCTTTG 780 

ACCCTAAAAA TGCCAAAATG GTAGATGTGT CTCAACTTCA AAACATGGAT TACGGTTTCG 840 

AAGCCTATGC AACTCAAGCC TTCAACGCTC CTGATGGGCG TCCTCTAGCA GTTAGCTGGC 900 

TTGGTTTGCC AGATGTTTCT TACCCATCTG ACCGTTTTGA CCACCAAGGA ACCTTCTCTT 960 

TGGTCAAGGA ACTCACTATC AAAGACGACA AGCTCTACCA GTATCCAGTC GCTGCTATTA 1020 

AGGACCTTCG TGCTTCTGAA GAAGCCTTCT CAAACCGTTC CCAAACCAAG AACACTTACG 1080 

AACTTGAACT CAACTTGGAA GCTAATAGCC AGAGCGAGAT TGTCTTACTT GCTGATAAAG 1140 

AAGGTAAGGC ACTTTCAATC AACTTTGACC TTGTAAACGG TCAAGTAACA GTGGATCGTA 1200 

GCCAGGCTGG AGAACAGTAT GCCCAAGAAT TTGGGACAAC TCGTTCTTGC CCTATCGAGA 1260 

ATCAGGCTAC TACTGCTACA ATCTTCATCG ATAACTCTGT CTTTGAAATT TTCATCAATA 1320 

AAGGAGAAAA AGTATTTTCT GGTCGTGTCT TCCCACATGC GGACCAAAAT GGTATCCTGA 1380 

TTAAATCTGG AAACCCAACT GGAACTTACT ATGAATTAGA TTATGGTCGC AAAACTAACT 1440 

GATGTCGCCA AACTTGCAGG CGTCAGTCCT ACTACCGTTT CTCGGGTTAT CAATAAAAAA 1500 

GGGTATCTAT CTGAGAAAAC CATCCAAAAA GTCAATGAAG CCATGCGAGA ATTGGGCTAT 1560 

AAACCCAACA ACCTGGCTCG TAGTCTGCAA GGAAAATCAG CTAAGTTAAT CGGCTTGATT 1620 

TTCCCCAATA TTTCCAATGT TTTCTATGCA GAATTGATTG ATAAATTGGA ACACCAACTC 1680 

TTCAAAAATG GTTACAAGAC CATCATCTGC AACAGTGAAC ATGATTCTGA GAAGGAACGC 1740 

GAATACATCG AAATGTTGGA AGCCAATCAG GTGGACGGCA TCATTTCTGG TAGTCACAAC 1800 

CTAGGAATCG AAGACTACAA TCCTGTGACA GCGCCGATTA TTTCCTTTGA CCGAAACCTA 1860 

TCGCCAGACA TCCCTGTCGT CTCCTCTGAC AACTATGCTG GTGGGGTTCT TGCTGCCCAA 1920 

ACCTTGGTCA AGACAGGTGC CCAGTCTATC ATCATGATTA CAGGGAATGA CAATTCTAAT 1980 

TCGCCAACCG GACTGCGCCA CGCTGGTTTT GCATCCGTAC TCCCAAAAGC TCCTATTATC 2040 

AATGTTTCCA GTGACTTTTC TCCCGTCAGA AAAGAAATGG AAATCAAGAA TATCTTGACC 2100 

CGGGAAAAAC CAGATGCCAT TTTTGCTTCG GATGATTTGA CAGCTATTCT GGTCATTAAA 2160 

ATCGCTCAAG AATTGGGCAT TTCTGTCCCA AAAGAGCTCA AGGTCATCGG CTATGATGGG 2220 
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ACCTACTTTA 


TCGAAAATTA 


CTACCCTCAA 


TTGGCTACTA 


TCAAGCAACC 


TTTGGAAGAG 


2280 


ATTGCTTGTC 


TCACTATTGA 


TCTTCTCTTG 


CAAAAGATTG 


AAGGCAAGGA 


AGTCGCCACA 


2340 


ACTGGTTACT 


TCTTACCAGT 


TACGCTATTA 


CCAGGAAAAA 


GTATTTAAAC 


ACAAGAAAAC 


2400 


TCAGACCGAT 


TCGTCTGAGT 


TTTTATGATC 


TTAAATTTTC 


GAGATAGCGC 


TGGGCTGTCT 


2460 


CTAGGTTAAA 


GGTTTTATCT 


GAGATGAGGC 


GCTCTACTAG 


GGGAGCAACT 


TCAGATTCAC 


2520 


TAGCCCCAGC 


TAGGAGAGCT 


AGGGATTTGG 


CCTGTAGTTT 


CATGTGGCCT 


TGCTGGATGC 


2580 


CCGTACTTAC 


CAAGGCTTTG 


AGGGCTGCAA 


AATTTTGAGC 


AAGACCGATG 


GACACGATAA 


2640 


TCTGGGCTAA 


TTCTCTGGCA 


GAAGGATTTC 


CTAGTAGATC 


ATGACTGAGA 


ACTACACGTG 


2700 


GGTTGAGGCC 


GATAGAGCCA 


CCCTTAGTCG 


CTACAGGCAT 


GGGCAGGCTC 


ATCTCACCCA 


2760 


CCAATTCTTC 


TCTTTCAAGG 


TCCAGCGTCC 


AGCAGCTAAG 


ACCTTGATAG 


CGTCCATCTC 


2820 


GACTGGCAAA 


GGCATGGGCC 


CCAGCTTCGA 


TGCCACGCCA 


GTCATTACCA 


GTGGCAATCA 


2880 


AAATCGCATC 


AATACCATTA 


AAAATTCCTT 


TATTATGAGT 


AGCAGCTCGG 


TAAGGATCAG 


2940 


CCTGCGCAAA 


CTGACTAGCC 


AACGCAATTT 


TCTCCGCAAT 


CTCTCGTCCT 


TGATCCTTTT 


3000 


GGCGGCTCAA 


GTAGCGAAAG 


GCGATGCGAC 


AGCTTGCAGT 


CACCAGAGAA 


TCGGTCGCGT 


3060 


AGTTGGACAG 


GATTCCCATG 


AGACTCTGTC 


CCTGACTGAG 


TTCTTCTAAG 


ACTGGTTTCA 


3120 


AGGCTTCCAG 


CATGGTGTTG 


AGCATATTGG 


CACCCATGGC 


TTCCTGGGTA 


TCGACATGAA 


3180 


TATAAACAAC 


GAGAAAGTCT 


GGTTCGCCTT 


TTATCTGCTC 


GACATGCAGA 


TCACGCGCCC 


3240 


CACCTCCACG 


TTTAACCATA 


GAAGGATAGG 


CTTGATTGGC 


AAGCTCCAAG 


AGCTCCGCTT 


3300 


TCTTGCTGGC 


AATCTTCTCT 


TGCGCTAGTT 


TAGGATTAGC 


AACTTGATAA 


AGGGCTACCT 


3360 


GCCCAATCAT 


CTGTCGCTGA 


TGGACTTGTG 


CAGTAAAACC 


ACCTGCACGC 


TTCATGATTT 


3420 


TGCTGGCATA 


GCTGGCCGCC 


GCAACCACAG 


AGGGTTCTTC 


TGTCACATAG 


GGAACGGTGT 


3480 


ATTCCTCACC 


GTTGACAAGT 


ACCTCCGGAA 


CCAGTGAATA 


AGGCAGAGAA 


AAAGTTCCCA 


3540 


CTACATTCTC 


ACTCAGCTGC 


TCTGCCACAG 


TCACCCTCAT 


CTGTTCATCC 


TTCTCCAGAC 


3600 


TAGCTTGTCT 


CTCAGGACTA 


AGGAGCGCCT 


GAGCTTTTAA 


CAGCTCGAGG 


CGCTCTTGGT 


3660 


ATGATTTTTT 


AGAAAATCCA 


TTCCAACTTA 


TCTTCATTAT 


TTTTCAACCT 


TGCTATAACG 


3720 


GCGTTGGTGG 


TCGAGAATTT 


CAACCAAGGC 


AAAATCTTGA 


TTTTCATAGC 


CAGCAAACTG 


3780 


GGCAGAGTTA 


GTTTCATCCA 


AGTTTACTTC 


CTCAAAAAAG 


ACCTTTTCAT 


AGTCTGCAAC 


3840 


GGATAGGGCA 


GTTCGTTGGT 


TGAGCTTGTT 


CAAACGGTCT 


TTATCCAAAT 


AAGCTTCATA 


3900 


TCCTTCAACC 


AATTCACCAC 


TGAAGAACTC 


AGCCACAGCT 


CCACTTCCGT 


AACTATAAAG 


3960 
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GGCGATTTTA TCCCCAGCTT TCAAGCTATC TCTATTTTCC AAGAGAGACA AAAGTCCAAG 4020 

GAAAAGTCAA CCTGTGTAGA TATTCCCCAC CTTTTGACTG TAGAGAATAG ACTGGTCAAA 4080 

ATGCTTTTGT AAGAGGTCTT TTTTCTCTTG AGGCAGGCTC TTATCCATGA TTTTTTTCAA 4140 

GCCTTTTAGC CCTAATTTAG GATAAGGCAA GTGGAAACAA ACAGCCGCAA AATCATCCAA 4200 

AGTAAGCTGG TAGCGTTTTT GATATTCAAG CCAAGTCGTT TTCAAACTAT CCAAGTATTG 4260 

TTGGGTAGAA TAGACACCAT TTACATAAGG AGTTGTCGAG TAATTTGGTC GCCAGAAATC 4320 

CATGATGTCA CGGGTCTGAG CTACATTGTC ATTATTAAAG GCCATCATGC GTGGATTTTG 4380 

TGTAATCAAC ATAGCTACAC TTCCAGCACC TTGAGTTGGT TCTCCTGGAG TTTCAATACC 4440 

GTATTTGGCA ATATCACTGG CAATGACCAA GACCTTGGAC TCCGGAGAAT TTTCCACATG 4500 

CAATTTGGCA TAATGGAGGG CAGCAGTCGC TCCGTAGCAG GCTTCTTTAA TCTCGAAACT 4560 

ACGAGCAAAG GGCTGGATGC CCAGCAAGCC ATGCACAAAG ACGGCCGCAG CCTTACTCTG 4620 

GTCAATTCCT GACTCGCTCG CCACAATGAC CATGTCAACT TCTTGTCTTT CTTGCTCAGT 4680 

TAAAATAGAG TCACTACCAC TGGCCGCCAA GGTCACGATA TCCTCAGTTA GGGGCGCAAT 4740 

ACTCAATTCC TTGAGTAAGA GTCCTTTACT TAATTTTTCA GGGTCAATTC CCCTCGCTTC 4800 

TGCTAAGTCT TGTAATTTCA AGACATATTG ACTGGTCGCA AAACCAATCT TATCAATACC 4860 

GATTGTCATA TTTACCTCTG TTTTATCATT CATGTAAAAA ATCGTTCTAT ACTATTTTAT 4920 

CACAAATGGC AGTAAAAGAG AGAAAAAAGA CTTGATTCAC CAAATCAAGC CTCTTATTGG 4980 

TCATCATTTT AAAGAATGAT TAGTTGCTAG AGAGTTCACC GATATAAGTA GCTTTATAAG 5040 

CTCCATTCAC ACTTATCAGC TCCTGGAGGA TCAAATTTCC TGAGTAAGTC CTTCCCATCT 5100 

CATCTACAAA TTTTTGATAA AACTGACTGG TCGGAATTTC TCTGACATCC TTATCAAATG 5160 

TCTTATCAAG TGTTTTACTA ACCTTCTCAG CAATCAATTG ATGCTCTTGC CATCCACTTT 5220 

GAAACTCTGA GCCCGAACTA GAAACCATGA CTGGGATAAA CAACAAGGTC ACTAGATTTA 5280 

CAGACAATAA GGAAAGTAGT AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA 5340 

ATACCAGCAA TCAAATTCAT TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGGTTGTT 5400 

GAAAACATTT TAAACGTTTT TACTTTGGCA AAGATGTTCT CAACCTTGCT TCTCTCCTTA 5460 

GATAGCGCAT GCTTACAGGC TTTATCTTCA GCTGTTAGCG GCTTGAGTTT GCTGGATTTA 5520 

CGTGGAGTTT GTGCTTGAGG ATATATCTTC ATGAGCCCTT GATAATCACT GTCAGCCAAG 5580 

ATTTTACCAG CTTGTCCGAT ATTTCTGCAA CTCATTTTGA ACAACTTCAT ATCATGACTA 5640 

TAGTTCACAG CGATATCCAA AGAAACAATT CTCCCTTGAC TTGTGACAAT CGCTTGAGCC 5700 

TTCATAGCGT GAAATTTCTT TTTACCAGAA TCATTCGCTA ATTGTTTTTT AGGGCGATTG 5760 
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ATTTTTACTT CCGTCACATC AATCATTATC GTGTCCTCAA AGCTGAGAGG AGTTCTTGAA 5820 

ATCCTAACAC CACTTTGAAC AACAGTTACT TCAACCCATT GGCTCCGACG GATTAAGTTG 5880 

- CTTTCGTGGA TACCAAAATC AGCCGCAATT TCTTCATAAG TGCGGTATTC TCGCACATAT 5940 

AGAAAGCGTT ATCAATTTAT TTATCTCATT TTTCAGAAAA TTCTTTTATT TCTGTAAAGT 6000 

CT ACGAT ACT CGATGTCTTT TTATATAATG ATAGAGTCTG AGAATCACTG TTCCGCTAGC 6060 

CATTCCAATA GACATTACCA AAGCCAACAT GACAACCAAG GTCGCACTTG CCAGTGCTTT 6120 

ATTATAGTCC CCTGTCACAA AAAAGGCAGT TGTTCGGTAG GAGAGATAAC CTGGAACCAG 6180 

CCGTGCCAAA ATGGCCAAGA TAAAGACCAC AGCAGGTGTC TTATAAAGAA TACTTAAAAT 6240 

CTGGCTGACA CAAGAACCAA TAATGGCTGC AATGAAGGTA GCTACAATGA CATTGGTCGG 6300 

TTCCTTGAGC AAGAGATAGA TTAGCCAGAC AGTCATGCCC AAAATCCCTC CAGGTAAGAG 6360 

CATACACCGT TGCACATTGA GTACGATTAA AAAAGTGATA ATGGCAAGAA AACTTGCTAC 6420 

TGCTTGTAAT AAAAAGGTTG TTAGTGTCAT ATTAGTTCAT CAATACCAAG GCGACAGAAG 6480 

TTCCTGCCCC TAAAGCGAGG GTAATGAGCA GGGATTCAAA CATCTTACTC ATACCAGAGT 6540 

TTATGTGGTT GGTCATAATA TCACGGACCG CATTGGTCAA GGCAATACCT GGTACAAACG 6600 

GCATGACCCC ACCAGCTATA ATCAAATCTG CCGTTGAAGG AAAACCTGTG TAGCGAGCCC 6660 

AAAACTGGGC AATTATCCCA AAGACAAAGG CTCCAGCAAA GGCTGTCACA AAGGGAATTC 6720 

GGATAAATTT TTCCACATAG AGGGAAAAGG CAAAACCAAA TAAGGTCGCC ACTCCTGCCC 6780 

CAAGTGCGTC GTAGATATTT CCGCTAAACA TAACTGAAAA GAAAGGAGCA CTAAAGGTCG 6840 

CAGCCAGAGT TACCTGCAAC TTAGTATAGG GAACCGGTTT GGCTTGCAAG GCCGTCAATT 6900 

GCTTAAAGGC TGTTTCTAAG TCAATCTGCC CCCCAACTAG CTCACGAGAA ATCTGGTTCA 6960 

CATCGCAGAC TTTTTCGATG TTATAAGAAG AGGAGGTCAC GCGCTTCATG CGCAAATATT 7020 

GGTATTTTCA ATAGAGAAAA AGATAGCGCC AGGCATGGCA ACGACATTGC AATCCACAAT 7080 

CCCCTGCGAA TGCGCGATTC GAATCATGGT ATCTTCTACA CGATGGATTT CTGAGCCACT 7140 

TTTAAGGAGA ATAGTCCCCG CTAGCATAAT CACATCAATG ACGGCATTTA ATTCTTTTGA 7200 

TTCTTCCATG CTTTCCTCCT TTTATCAACT CCCTCTATTC TATCACAAAT CCGGACTCAA 7260 

AAAAAATCTT TGCCATGAAA TCATGACAAA GATTGATTAC TCATTTTGAT TATCCATCTG 7320 

CTTTTAAGGA GTAGCTGAAG TTGTTTTAGG TTTGTAGATT GAAATCTTGA CTCTAGTCTT 7380 

ATTGAGGTCT ACCTTTTCAC CPGCTCTAGG ACTTTGTTCA ACAACCATGC CTTCTGCACT 7440 

ACCTGCAGGC GCTGTCGTCA CTTCTACAAC TTCTATATTA GCTTCCTTAA TCCCAACAAT 7500 



WO 98/18931 PCT/US97/19588 



698 

TTGAATCAAA TTGTTCTTAG TAAACTCCAA GCTAGAACCA ATGTAACTCG GCATGGCAAC 7560 

ACTTGTAACT TTTTTAGCTA CTGTCAAGAC AATTTGAGTA GGTTTACTCA CATCATAAGT 7 620 

CGTTCCGGCA CCTGGACTTT GTTTCATAAT CGTTCCTGGT TCGCTTTCGC TGGACTCTTC 7680 

TTCCTCTATC TTAATCAAAT TCTCAGGAAC CTTCTTCTGC TTGAGTTCTG AGATTACTTC 7740 

TGTAGAGTTC CGTCCAATAT AGTTCCCTAA TTGAATCGTC GTTCCTTTTT TAGCTACTGT 7800 

CAAAACAATT TGAGTTGCCT TGCTCAAGTC ATAGGTCGTA CCTTCTGGTA GACTTTGCTT 7860 

CAGGACCGTT CCAGCCTCAC TCTCATTCGA CTCTTCTTCC TCAATTTTAA TCAAATTATC 7920 

TGGAACTTTT TTCTCTTTTA ATTCCGCAAT GACATCAGAG GATTTCCGAC CGACATAATT 7980 

ACTAATTTGG AAAGATTGCT TGCCTGATGA GACAACCAAA TTGATTTTCG TTCCTTCTTT 8040 

TCGACCAGTT CCAGCGCCAG GATCTGTACG GATAATCCGC CCTTCTTCCA CCTTTTCAGT 8100 

AGCCTCTGTC TTCTCCTCAC CAATCTCAAA ATTGGCTTTT TTGAGCGTTG CCTTGGCCTC 8160 

TGCAACTGTC TGACCTGCCA CATCTGGAAT GGCAATGGTT GCAGGAGTTC TGGATAGTAT 8220 

CCAAATAAGA GAAGCTGCCA CCAATACAAG GCTGGCCAAC AAAATCAGGT AACGCATCTT 8280 

AAATCTATGT TTTTTCGGTG CTTGTGGTTG GTAAGTTTCC TCTGTCACAG CCTGGCTTGG 8340 

GTTTTTGATT GATTTGTGTT CTGTTTGCGC TTGAACCTTA GGAATAGATG TCAAGGTACT 8400 

CTGAGAAACC TTCGGCAAGG TCTTGGTATC TGCCTTGCTC GTTTCATCAA AGATTAACTT 8460 

ACTTTCATTT CTACGATTGT AGGACAACCT ACTAGACAAG TCCACATACA TCTCTGAAAC 8520 

CGAGCGGTAG CGATTGGTCA ACTTTTTAGC AGTTGCCTTG ATAATAACAT TTTCTAAAGC 8580 

CTGAGGTACA GATGGATTTT CTGCAATAAC GGACGGCAGG GGTTTCTGGA AATGCTGGAG 8640 

GGCAATGGTC ACCGCGCTAT CCCCGTCATA AGGGATATGG CCTGTCAGCA TCTCATAGAA 8700 

AATAATCCCC ATGGCATAGA TATCACTCTG CACAGTCGCC TTCGAACCAC GCGCCTGCTC 8760 

TGGTGACAAG TAATGAACTG AGCCCAACAT CGAGTTAGTC TGGCTCAGAC TTGTCTCTGC 8820 

AAAGGCTACA GCAATCCCAA AGTCTGTGAC CTTGGCAGTC CCATCTGGTG TCAAGAGGAT 8880 

ATTTTGAGGT TTCAAGTCCC TGTGAACAAT TCCTCGAGTA TGGGCCAAGC GCATAGCCAA 8940 

GAGAATTTGT CCCATGATAC GGACTGCTTC TTCATTAGAA AGAGGATAAT GTTCCTTGAT 9000 

ATAGCGTTTG AGGTCCAGTC CAGCCACATA CTCCATAGCT AGGTACTGTT GACCGTCTTC 9060 

CTCGCCAATA TCTGTTATCC GAACGATATG AGGATGGTCT AGATCTGCCA TAGCTCTCGC 9120 

TTCACGCTGA AAACGAGCTA CAGCTATCGG GTCCGTCTGG TAGTTGGTCC TCAGAACCTT 9180 

CACTGCCACT TCTTCCCCAT CTAAGATTAA GTCTTTGGCT AGGTAGACAT CCGCCATACC 9240 

TCCTCGACCA ATCTGTTTGA CAATCCGATA GCGTCCGGCA AAAATCTTGC CGATTTGGAT 9300 
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CATTCTGCAT 


CCTCCTCGTT 


CATAGAAACA 


AGGGCAACCG 


TAATGTTGTC 


TAAACCTCCT 


9360 


GCATTGTTAG 


CAAAACGAAC 


AAGTGTCTCC 


GTTTTATCTG 


CTAAAGGAAT 


ATCACTGGTT 


9420 


ACAATATCAC 


GAATCTCACT 


GCCTGAAATC 


ATGTTGGTCA 


AGCCGTCACT 


ATTGAGCAAG 


9480 


AGATAGTCAC 


CTGACTCAAG 


GATAACTGTC 


CCAAAATCAG 


GCTGAATTTC 


ATCTTTTTGC 


9540 


CCAATAGACT 


GGGTGATAAT 


ATTTTTTTGC 


GGATGAGCTT 


CTCCCTCTTC 


TGGTGTCAAT 


9600 


TGACCAGCCT 


TGAGCAATTC 


ATTAACCAAG 


GAATGATCGC 


TCGTCAACTG 


ATGGTATTCT 


9660 


TCTCCACGAA 


TCAAGCCGAT 


ACGCGAATCA 


CCAATATGAG 


CATAGATAGC 


CTGATTATCA 


9720 


ATAATAGCAA 


GGACTTCCAA 


AGTAGTTCCC 


ATGCCTCTGT 


AAGCTTCATC 


CTGACCAAGC 


9780 


TGGTGAATCT 


TTTGATTTTC 


AATTTCTAGG TAATGGGCGA 


ACCATTCACG 


CACTTCATTG 


9840 


ACTGTATCGA 


TCTGGGTATC 


AACCCAAGCT 


ACACCCAGGT 


CTGTGACCGC 


CATTTCACTA 


9900 


GCGATATTCC 


CTGCCCGATG 


ACCTCCCATC 


CCATCAGCTA 


AAATAATCAT 


GGTACGTCCA 


9960 


GCTCTATTGA 


CATAGTGGTT 


GACATAGTCT 


TGGTTATTTG 


TTCGTTTCTG 


ACCAACATCT 


10020 


GTTAATAATG 


AAATTTCCAT 


GTGTCAGTTC 


CTTCCTAATC 


CGATATCTTG 


CGAAATTGAC 


10080 


TGATGAAGAA 


TCCATCACTT 


CCATACAATT 


CAGGTGTAAT 


GAGGATACAG 


CCGTCTTTCA 


10140 


TGATATCCTT 


ACATTCATGT 


TCTAGTTTTA 


CCTGCTCGAA 


CTCGGGATGA 


CTCTCTAAAA 


10200 


AGGCCTTAAC 


GACTTGAAAA 


TTCTCCTCTG 


AGACGATAGT 


GCAGGTGCTA 


TAAGTTATTA 


10260 


TACCACCTTT 


GCCTAGTATT 


TGACAAACAC 


TACCTAATAT 


TTCTAACTGA 


ATTTCCTGCA 


10320 


AGGACGCGAA 


ATCTGCCGTT 


TCTTTATTGT 


ATTTGATATC 


TGGTTTTCGG 


CGCAAAAGAC 


10380 


CGATTCCTGA 


ACAAGGAGCA 


TCCACCAAAA TCTTATCAAA 


GGAATCCTGG 


TCAAAAAACT 


10440 


CATGCACCTT 


TCTGGCATCC 


AATTTTTGAG 


TTTGAACCCG 


ATCTTCAACT 


CCCAGACGTT 


10500 


GGGCATTTTC 


TTGAATTAAA 


TCCAACTTGT 


GGTCGTACAA 


GTCCAGAGCA 


GTAACCTGAC 


10560 


CTGTCCTAAG 


ATAAGAGGCT 


ATATGGGCTG 


TTTTCCCACC 


TGGAGCCGCA 


CAGGCATCAA 


10620 


GCACTCGCTC 


ATCACCTTGT 


AAATCAAGCG 


TCGGAGCAAC 


CAGCTGACTG 


GACTCATCTT 


10680 


GGATGGTAAT 


GGCTCCATCC 


GCAAACAAAT TATGCCCTGC 


AAAATGCCCC 


TGCTCCTTAA 


10740 


CCAGACCAGT 


GGTTGCTAAA 


AGGGAATTAT 


TCGCCTCCAA 


CAAGGCTTGG 


ATTTCCTCTT 


10800 


TTCGACTTAG 


GTCTGTTACA 


CGAATACTGG 


CTTTGTTTCG 


CACTAACAGG 


CTTTCAAAGA 


10860 


TGGCTTTTGC 


TCTCTCCTCT 


CCGTATTCTT 


CCTTGAGTTT 


GGCAACTAGC 


CAAACTGGGA 


10920 


GAGAATAGGC 


AATGGAGTCA 


CGCTTGTTTT 


TTCGCTTGAT 


GCTAGCAATA 


TCTGGCCAGC 


10980 


CTTCACGCAA 


GATACGGCGA 


AGGACAGCGT 


TGACCAATTT 


TTCACTGCCT 


TTTTTACGGA 


11040 
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GTTTGCCCAA TTCCACTGCT TCATTAACCA CAGCATGATC TGGAATCTTG TCCAAATAGC 
GGAGTTGGTA GGCACTCATG AGAAGAAGGA CATAGAGCCA GCTGTCTAAC TGGTCTCTGT 
CTTCGATAAA GTGGGATAGG TACCATTCCA GAGTCAGTTT ACGGGCTACC GTTCCATAGA 
CCAGCTCGGT CACTAAGCCC TTGTCTGCTG CCAAAAGTTG ACTTCCCTTT AGATGCTTAT 
TTAAGGCGAT ATTTGAATAT CCTTGGTTCA CAAAAACATC CTCTAGCACT GCTAGAGCTA 
AACTTCTAGC CGTTTCTACT TTAGTCACCA AATCGTTCTC CTACAGTCAA TGTACGTCCA 
ACTCCGTTGA GGAAGGAAGC AATGTCCATC TTACGCTTAC CAGCTGGCTG CACTTGTTTG 
AGGGATAGAG CCCCTTCAGC CGTTGCGACA ATCAATTCTT TCTTGCCGAT AGAGAGAATC 
TCACCTGGAT TTCCCTGACC TTCTACTGGT AGGGCTTCAT AAATCTTAAA GCGGTCGCCC 
TTAAGGAAAG TATGGGCAAC AGGCCAGGGG TTCATTCCAC GAATTTGGTT AAAGAGTTGA 
CGATTGCrTT TGTTCCACTC CAGTTTTTCT TCCTCTGGCT TTATATTTGG AGAGAAGGTA 
ACCTGACTCG TATCCTGCGG TTCAGGTTTG ATATCACCAG CAATATAGGC AGGCAGAGTG 
TCCAAAAGCA AATCACGACC AACTAGCGCC AATTTTTCAA ACAAGCTGCC AACATTGTCC 



TCATCTGTGA 

ATTTCCATGA 

GCACCACCAC 

TCAAGGAGTT 

AGCTTCATAA 

AGTCCTGCTT 

ACAGCACGGT 

CCTTTTAAGA 

TCTCCTTCTT 

TCCCGTTCTT 

CGGTATTTAA 

GTTGGCCCCA 

TAGGCACGTT 

TAGTAAGGTG 

TAATCTTGAT 

ACTTGACCTG 

GTTCTCTCAG 



TCGGAATGCT 
TGGTCACACC 
GGTGTCTAGG 
TGCTTGGGAG 
GATCTTCCAT 
CCTTGGCAGC 
CTGGCTGGGT 
CTGTTGCTGA 
ATAAAAATTG 
GAGTCAAGGC 
TTAAAATCTG 
GAATGGGACT 
TGAAAACCTC 
GATAGCCGAG 
CCTTGGCAAA 
CCTTTTCAGC 
AAGAACGGAA 



GCGACGAGAA ATCATATCTC CTGCATCCAT TTCCTTAACC 
AGCTTCCTCA TCCCCTTGAA TCAAGGCATA ATGGATAGGC 
AAGGAGGGAG GCATGAACCT TGACAGCAAA CTCCATGCTA 
AAACTGCCCA AAAGCAGCAG TCACAATTCC ATCTGCTCCT 
CTCTGGACTT CCAGATAATT TTTCAGGTTG GTAGATAGAT 
CTGCTTGACT GGGGTTTCTT GGATAACTTT TTTACGACCA 
CACAACGGCT AGAATTTCGT AACGGTCATC TGTCAAAAGT 
AAAGTCGGGG GTCCCCATAA AGATTACTTT TGTCATATCT 
CTGCGGCTCA TGGTCAATGC TGAGACGGAG CTCACTATTT 
TAAAACCTGG TTGAGGGTCG ACCCCAGCTC ATCTTCTAAA 
GTAATGATAG AGGTTGTGGG TACGGGCAAT CGGTTTTGGC 
GGTCTCTGAC AAGCCTGACC GCAAAATGTT CATGACTTCA 
TTCTTCTTTC TTGTGAGAAA GGGTAATACC AATCGTGAAA 
TTGTCGTCTG ATTCCCATTT CATAGGCATA AAAGCCTTCG 
TCGAATAGCA TAGTGCTGCG GATTGTAGGA CTGTATCAAG 
CCGACCTGAT CGACCTGCCA CCTGAGTCAA GAGCTGGAAG 
ATCAGGCAGA TTCAAGGCCG TATCCGCATT TAGAACTCCG 



11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 
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11880 

11940 

12000 

12060 

12120 

12180 
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12300 

12360 

12420 
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ACTAGGGTAA 


CATTGGGAAA 


ATCCAAACCC 


TTTGCAATCA 


TCTCAGTACC 


AAGTAAAATA 


12900 


TCCGCTTCCC 


CTCGCCCAAA 


CTGGTCAAGC 


AAGGCTTGGT 


GACTGCCTTT 


CTTTCGAGTC 


12960 


GTATCCACAT 


CCATCCTCAG 


AATGCGAGCT 


TGGGGAAAGA 


GTTCTGCTAG 


CTCATCATAA 


13020 


GCCTTCTGAG 


TTCCCGTCCC 


ATAGTAACGA 


ATACTGCGGC 


TCTTACAGTT 


AGGACAGACC 


13080 


TGAGGAATAT 


CCTTCGAGAA 


ACCACAATAA 


TGGCAGTTCA 


TAGTCTTGGT 


ATCCATATGC 


13140 


AAGGTCAGAG 


AAATATCGCA 


CTTGGGACAA 


GTATCCACCG 


TCCCACACTC 


CCGACACATG 


13200 


ACAAAGCTAG 


AATAACCACG 


GCCATTGAGC 


ATGAGAACCA 


CCTCCTCTTT 


TTTAACCACA 


13260 


CGGTCTTGGA TAGCCTCTAG 


CAAAGGAGGC 


GTAAAGTTTG 


ACGTCTCATT 


TTGTCCGATA 


13320 


TAGTCTCGAA 


AGTCAATCAC 


TTGAACCTCA 


GGGATTGTAG 


CCAAAGGATT 


GGCACCTTGG 


13380 


CTTAGACGTA 


AGTGTTGATA 


GACGCCTTTG 


CCACCACCTG 


CCCGGCTCTC 


TAAGCTCGGC 


13440 


GTTGCAGATC 


CAAGTACCAG 


AGTTGCTTGA 


TTATACTGAG 


CCCCTAAAAT 


AGCTACCTCT 


13500 


CTGGCATGGT 


AACGGGGATT 


GCTGTCCTCC 


TTATAAGCCG 


CTTCATCCTC 


TTCATCAATA 


13560 


ATCATGACAC 


CCAGATTTTT 


CACAGGAGCA 


AAGATAGCAG 


ATCTGGCACC 


AACAACAACT 


13620 


TGGGCATCGC 


CACGCTCCAC 


CTTCCGCCAT 


TCATCATACT 


TTTCACCATT 


GGATAATCCT 


13680 


GAGTGAAGAA 


TGGCTACCTT 


CTCCCCAAAA 


CGTGCTATAA 


AACGCTCGGT 


CATCTGAGGA 


13740 


GTCAAGGAAA 


TCTCAGGTAC 


CAGCAAAATA 


GCTGTCTTGC 


CCTTATCCAG 


GGCACCTTGG 


13800 


ATAATCTGCA 


AGTAAACCTC 


GGTCTTCCCA 


CTTCCTGTAA 


TCCCTTGAAG 


TAGAAAGGGA 


13860 


GGTTGAGAAC 


TGCCAATACA 


ACTCACAACC 


GCATCACCCC 


CCTGTCTTTG 


TTCTGGATTT 


13920 


AACTCCAAAG 


GTCTACTTGC 


TTCAATTCCT 


TCAAAATAAG 


CAGCCGAGCG 


TTGAACTTCC 


13980 


TTTTGGACTA 


TGGTAACAGC 


ACCTTGATCC 


ACAAAGAAGT 


TGACTTGCTC 


TCGCGAGTAG 


14040 


GACTCTAACA 


AGCTAGCCAA 


GGAAGCGCTC 


TCTCCATCAC 


ACACCAGATA 


ATCTCTCAGT 


14100 


TCCAACTTTT 


TCTTGGCACC 


TGTAGAAATC 


TCAACACCTT 


CTAATTGAGC 


ATGGTCAACC 


14160 


TCATACCAAG 


ACTGGGTCTT 


GACCTTCTTT 


TGATCGACTC 


CCTGATATTC 


CACACCAACC 


14220 


AGGCCTTTTC 


TAGTCAAACC 


CATCATTTCA 


GCTTGCTTGG 


CAAGGTCTAG 


TGAAGAAAAG 


14280 


GCTAGCCAAT 


CTTCTGAACC 


AAACAGGCGC 


ACTCGTTCTT 


CCTGACTCAA 


GCCTTCCAGA 


14340 


GGATAGACAA 


TCTTGTCATA 


GCTAGAATTC 


AGAAACCCTG 


GAAGCATGGC 


CTTGAGGATA 


14400 


GACATTTTGT 


AGGAGAAGAC 


AGATTTGCGT 


AACTCCTCAG 


CCAGCCAGAG 


TTGTTCTGGC 


14460 


GTGAGAACAG 


GAGAAAAATC 


CAGCACCTCT 


GCAATATCTT 


TTAAATCTTG 


CTCCATCTCT 


14520 


TCTCCATCTG 


ATTGGGACTT 


CAAACCAAGA 


ACAATCCCTT 


GAATCAGGCG 


ATTACCCTTA 


14580 



WO 98/18931 



PCIYUS97/19588 



702 



CCAAAAGGCA CATGAACCCG CATCCCAACT TCCAGCATTC CCTCAAATTC CTCCGGAATC 
CTGTAACTAT AGGGCTGCTC CGTCTGCATC AAGGGCACAT CTACGATAAT CTTAGCTAGG 
GCCATCTTCT CACCTCCTCC TTGTCAGTAC ATTCTTGCAA TAGAAAAAAT AACATTGACT 
CCCCCCAACC TTAAATTTTT TCACCATCTT Cm T T C HT AGCAATTTGC TCTTTGATTT 
TCTTTTCTTC TTCTTCTTTG CGGCGTTTTT CTTCTTCGAT ACGGCGACGC ACTGCTTCAC 
GTTTTCCTTC TGGATCTGGG TGAATTGTAA CGTTTCCTGA TTCGATTTCT TCTAAAGCGC 
GAAGAGTTGA TTTTTCAGAC TTGAAACCTT GAGTTGCTGG GGCACCTGCT TCCAATTCGT 
GGGCACGTTT TGCTTCCAAG ATTACGAGTG AATATTTTGA AGGAACCTTG TCGAGCAAGG 
TATCAATAGA CGGTTTTAAC ATCATTTGCT TGTACCTATT TTCTAAATTT TATCGGGTAG 
TTGGAGATTT TGGTAACATC TCCTGATAGT GACCAATGAC ACGATCCACA CAGAAGTGTT 
CTGCTTCAAT CACACATTTG ACACGTTCAG CAGCTAGGGG TACCTGATCG TTGACAATCG 
CATAATCATA CTCACGCATG AGGGCAATTT CTTCCTTCGC CTTTTCGATT CGTTGGGCAA 
TCACTTCTGC ACTATCTGTT CCACGACCTA CCAAGCGATC TTGCAATTCA TCCAAATCTG 
GTGGTGTCAG GAAGATAAAG ACAGCATCTG GAACCTTTTT CTTGACCTGA AGAGCACCCT 
GAACTTCAAT TTCAAGGAAA ACATCGATTC CCTTGTCCAA GGTTTCATTG ACATAGCTCA 
GAGGAGTTCC ATAGTAGTTA CCGACATATT CTGCGTATTC CAACATCTGT CCTTGACGAA 
TCAGCTCTTC AAATTCTTCA CGAGTACGGA AGAAATAGTC AACACCGTCC ACTTCTCCAG 
GACGTTGTGC GCGTGTCGTC ATCGATACAG AATATTGAAA TTGGTTTTCA GAACTCTCAA 
AAATCTCTCT TCTAACCGTT CCTTTTCCAA CCCCTGAAGG ACCAGAAAAA ACGATTAGTA 
AGCCTCGGTC TGCCATTGTG TCTCCTTTTA GTCAATCTGT GAAATAACAT TTCTCTAGAA 
TAATGGCAAA AAGCCAGATT ATCCTTTACA GTCTTTCTAT CTAGTGTAAC AAAAAAGCAG 
TAATTTTTCA ACTGCTCTTT CTTATTTATT TAGCATAATC TACTGCACGA AGCTCGCGAA 
TCACGGTTAC CTTGATATTT CCTGGATAAT CGAGATTGTT TTCAATTTTC TTACGAACTT 
TGTGAGCCAA GATTGTGACT TTGTCGTCCT TGATTTTTCC TGGATTGACC ATGATACGAA 
TTTCACGTCC TGCTTGAAGG GCAAAGCTAG TTTGCACTCC TTCAAAGCCG TTAGCAATTT 
CTTCCAAATC ATGGAGACGC TTGATGTAGC TTTCAAGAGA CTCACTACGA GCACCTGGAC 
GGGCTGCGCT CAAGGCATCT GCTGCAGCGA CGATAACTGC TATCACGCTC TCAGCTTCAA 
CATCTCCGTG GTGACTAGCA ATCGTATTCA CCACAACTGG GGGTTCCTTG TACTTACGGG 
CCAATTCCAT ACCGATTTCA ACGTGGCTAC CTTCAACCTC ATGGTCAATG GCTTTCCCGA 
TATCGTGAAG GAATCCAGCA CGACGGGCAA GAGCCGCATT TTCACCAAGT TCGCTCGCCA 



14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
.15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
1S960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
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TCATACCACC CAACTTAGCA ACCTCAATCG AATGGCGCAA AACATTTTGT CCATATGAAG 16440 

TACGGAACTG CAAACGTCCC ATAATCTTCA TCAAGTCTGG ATGAAGGTTT GGCGCACCAA 16500 

TTTCATAGGC AGCAGCCTCA CCGTATTCAC GAATCTTATT GTCAATCTCT TGACGGTTTT 16560 

TCTCAACCAA CTCTTCGATA CGAGCTGGAT GTATACGACC ATCTTTGAGC AACATTTCCA 16620 

TAGTCATACG GGCAATCTCA CGACGAATCG GATCAAATCC TGACAAGGTC ACCACTTCTG 16680 

GTGTATCGTC GATAATCACA TCGACCCCTG TCAAACTTTC AAAGGTACGA ATGTTACGAC 16740 

CTTCACGACC AATAATGCGT CCCTTCATAG TATCGTCTGG CAGATGAACT GTTGAGTTTG 16800 

TTGACTCCGC TACATATTCA CCAGCGATAC GTTGCATAGC TTGAACCAAG ATGTCCTTGG 16860 

CCATTTTGTC AGAACGTTCC TTGACCTCTT GCTCAGCTTC GCGAATGCGA CTGGCAATCT 16920 

CCCTGGTCAA GTTTTCCTCT GTCTGAGCCA AGATAATATC TCGTGCTTCT GCCTGAGACA 16980 

GCGCACCAAT ACGCTCTAGT TCTGCTTCTT TTTGTCTTTC GACTTCCTCT AATTGCTCTT 17040 

CACGCGCATC AAGGTTTTTC GCTCTATCAG AAATACTTTG TTCTTTTTGT TCAAGTGTTT 17100 

GTTCTTTACT CGTCAAATTG TCGTCCTTAC GGTCAAGGCT AGTAGCTCTC TCTGTCAAAC 17160 

GACTTTCGAT TTGTTTGAGT TCTTGACGTT CTGATTTGAA TTCAGCGTCC ACTTCTTCAC 17220 

GGTATTTTCT GGCTTCTTCT TTGGCCTCCA ATAGTGCTTC TTTTTTAAGA GACTTGCTTT 17280 

CACGTTTGGC TTCATTAACA AGTAAATCCG CTTCACGCTC AGCTTGTCCA CGTAAATTAG 17340 

TTGCTTCTTG TTCAGCATTT AAAAGCATCA ACTCTGCAGC TTCCTGAGAT GATTTCATCT 17400 

TAGCTGAGAT GCTGACATAT CCAATGACTA AACCAATGAT GACGGCAAAA ACAGCAATCG 17460 

CAAGCGACAT GATTTCCATG TTTTTACCTC ATTTTATTGT TATTCCGAAT CACATACATT 17520 

CTTTTACATT CTACCATAAA AAAGTGATTT TCACAAACCT AAAATAGAAT ATGTTTTGAC 17580 

GAATTTGGAA CACATTTACC AAAATAAACT TGTTGTTTAG AAATAGTAGT TTAGTAGAGA 17640 

CTTGAGAAAA AGCCTACCTT TCAATAGACT TAGTAATGAT CTTTAAAGGA CAAGAAAGCC 17700 

ACGCTATCTC CATCCATCAT ATAAATCAAG CGATTTTCTG CATCAATACG CCGTGACCAG 17760 

GCTCCTTGGT AATCATATTT GAGTGGTTCT GGTTTACCTA TTCCTGTAAA GGGATCACGT 17820 

TGAATATCCT TGATTAGTTT ATTGATTCTT TTTAACGTTT TCTTATCCTG ATTTTGCCAG 17880 

TAGCAATAAT CTGCCCAGGC ATCTTCTGTA AACTTGAGCA GCATTTCTTA CTCCTCAATA 17940 

ACATGGACCT GAGTACTTCC AGCACGAACT TGAGCCATTC CTCGCAAAAC CTTATCAGAA 18000 

AGTTCCTTAT TTTGAGCAAT TCTCAGGGTT TCTTGGATAC TATCCCACTC ACTCTTTGAA 18060 

AGGACTACAA TGTCCTCATC TGGATTTTTA TTGACCACCG TCAAAGGCTC AAATTCATCA 18X20 
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TTTACCTTCT TCATGTAGTC CTTTAAATGA TTTCGGAATG TTGAGTAAAG GACTGCTTCC 
ATAACCATAC CTCGTTTTAG CTCTTTTCCA CTATTATACA CGAAAAGAAA GAAATTGTCA 
GGAACTTGTA CAAGATTTTC TTTTCTATCT ATTTATACTC AATGAAAATC AAAGAGCAAA 
CTAGGAAACT ACCCGCAGGC TGTACTTGAG TACGGCAAGG CGACGTTGAC GCGATTTGAA 
TTTGATTTTC GAAGAGTATT ATTCGTAAAA AATCTCAAAA AGCCTACCTT TCGGTAGACT 
TAGTTTGTTT CTATTC 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



18180 
18240 
18)00 
18360 
16420 
18436 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

ACGTAGAAAA ACTATTTCTA TCACAGATAA TATTCCGTAT GTTGTTGGAG GTATTGAAAT 60 

AAACGTCCTA GGTATCTTTC TCAGTCTATG TGACTTACAA GGGAAAACTC TTTTCGAGAC 120 

AGAAATTTTG AATGAAGATT ATCCTATTTC AGAAATCAAT TCCACCATTA CCAATATGAT 180 

AAAAACAGCT ATAGAGTACG TCCCTTTGGA AACAAAATTA CTTGGATTTG GCTTATCAAT 240 

ACCTGGACAT TATAACAAAG ACTCCGGAAG TATCATTACA AACAACCCCA TATGGGAATC 300 

TTTTAATTTA TTAAATGTAA TTAAAAGATT CAATTTTCCT TTTATTGTAA AAAATAATAT 360 

CGATTGTATG GCTATAGGAC AATACCTTTT TAATCCACAC AATACCCCCG ATAACTTTAT 420 

TTTCCTACAC GCTGGATTAG GTATTTACAC TTCCTTTTTC ACAAAACAAA AAATAGGAGC 480 

CTCTAAAAAT CCTTATATCG GAGAAATTGG ACACACCATT GTCGAATTGA ATGGGCAATA 540 

TTGTGAATGC GGAAAAAAAG GTTCTTTACA AACATATATT TCGGATGCTT GGTTAATCAA 600 

ACACGCCCAA TTATTATTTA AAAATTCCCA ACTAACTGTA CTAAAAAGCC TTGTAAAGAC 660 

TGAAAAAGAC ATTCATTTAG ACACCCTTTT AACGGCTTAT AATTTAGGCG ACTCCGCTTT 720 

ACCTCAACAA ATTGATAAAG GAGTCAATTT ATTAGCCACT TCTATTGCAA ATCTCCTCCT 780 

CATCAATCCT GCTGATAAAA TCTATATCAA CAGTCAATTG CTTAATTATC AACCTTTCAC 840 

TCATGAAGTC AGGGATAAAA TCCAAGACCA GCTCCACTTC GTTCCCTTTA CTCCTAATAT 900 

AGAAATTGAA ATTTTACCTT ACAACAAACA TCGTGGAAGT ATAGGAGCTT GTGCATTAGC 960 

TATCGTCGCT TTTTTCATAG AACATAGCAA TGTATTACAA GATATTATTT CACCTTAATA 1020 

TATTAGAAAT CTATAGACCT GTTTAAATCA ACTATAACCT GTAGTAGATA TCTCGTATTT 1080 
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AGACAATATG 
ATTTATATTA 
AACCCCCAGA 
TATGAAATTC 
TTTACATATA 
TTTGAATAGG 
AACAATCAAT 
GTAAAAGTGG 
TCACAATATA 
AAAAGAAACG 
AATCCCTAAA 
TAAAGTTACA 
TGATAGCATT 
GTGCTTTGGC 
AATACAGACA 
GATACAAGAG 
AAACGATTGA 
AATACTAGTG 
AAAATTTATC 
AGGATTGACT 
TTTACATTTC 



705 

AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCGCT ATGTTGAGTG 
AAATAACTTT TCTTCTAGCT GCATTTTATT ATTATAAAAA CATTCATCAT 
ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 
TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 
GTATTTAGCG CCACATAGTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 



CCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT ATGGCTCAAA 
TGATGCTAAA ACCTGTACCT AGATGTTTCC GTTCATAAAA CCATGAAACT 



ATGAAATTGA 

TAAGTTCATC 

TCAAATGCTC 

TTTACTAAAG 

AAGATGTACG 

TCTACAAATA 

ATTCCTAGCC 

CTCGTAACAA 

TATTAATCCA 

TAATAAGCAA 

TAACTATCCT 

TCGTTAGGCA 



TAGCGATAGT CAAATCAAGA GGCATCATAA CTCTAAAAAG 
CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 
ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 
ACACTGCTCA ACTTTACACC TGTAAATGGT TGTTGTATAA 
ACCACACTGT TGTAAATCAT AGTGTTCGCG AATATATTAC 
CAAGTAAAGA GAGCGGATGA GATTCAAACG AAATATGTCA 
TTCATATCAT TTAAAGAATT CTATAGACAA AATTTTTTCC 
CTGCTTCATT TTTCTACCAA CATATTTAGG AACAGGATAA 
TAGCTCAGTT CTATACCAAT CTAAGACAAA TAAGCTAAAA 
ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTTAATAAAC 
TCCAGTCAGA AGCTTGTCAA ATCACACCGA AAATTCTTCT 
ATCAAGCAAA AACTCGACGA TAGTACAAAC ATTATCATAC 



TCCTAAATTA TATACTTTAG TAAGCTTTTC GGATAAGAAA AAAGGTTCAT 
TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 



AGCAACAAGA AGATTTTCAG 
TTTTGAATAT AAACTACAAT 
TACGATTATA TGATACAGGC 
TATTTGCGAA AAGTTATTTT 
TTGCTAGACT CCAAAGAACT 
GAGCATACTC CAACTCATAA 
GTTCAAAAAT TGATGGAGAA 
AAATAAGGAA TTAATTCAGG 



TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 
AATATAAACT AAATTTTATA GGAGGAAGAC AATGGATTGG 
ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 
TGAAGACTAT TCTTATTTAA CAAACCAAGA TGTAGAAAAG 
AACCCGTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 
ATATGTGATT TCATTAAATA AACCTGCTAA GTTAACCAAT 
ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 
AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTTATGT 



1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1630 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

25B0 

2640 

2700 

2760 

2820 
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CTTAGAATAT CATGGTTTAC GTGCCACACA AGATGTTGAT GCTTTTATGG CTCTATAATA 2880 

TTTGTAGTGG GTAAATCCCC TATGGATATT ATGGAGCCTA TTTTTGTGTA GAAAAAAAGT 2940 

CCCATATGAC CTATAATGAA AAGCGACAAA ACAACTCATT AGAAAGAATC ATATGGAACA 3000 

ATTACATTTT ATCACAAAAT TACTAGACAT TAAAGACCCT AATATCCAGA TTTTAGACAT 3060 

CATCAATAAG GATACACACA AGGAAATCAT CGCCAAACTG GACTACGACG CCCCATCTTG 3120 

CCCTGAGTGC GGAAACCAAT TGAAGAAATA TGACTTTCAA AAACCGTCTA AGATCCCTTA 3180 

CCTCGAAACA ACTGGTATGC CTTCTAGAAT TCTCCTTAGA AAACGCCGTT TCAAGTGCTA 3240 

TCACTGTTCA AAAATGATGG TCGCTGAAAC TTCTATCGTC AAGAAGAATC ATCAAATTCC 3300 

TCGTATTATC AACCAAAAAA TTGCGCAAAA GTTGATTGAG AAGATTTCTA TGACCGATAT 3360 

TGCTCATCAG CTGGCCATTT CAACTTCAAC TGTCATTCGC AAGCTCAATG ATTCTCACTT 3420 

TGAGCATGAT TTTTCGCGTC TTCCTGAGAT TATGTCCTGG GACGTTGAAA CAGTCCGGGG 3480 

AGTGACTGTT TCAATCGGGA GATGGAGATG ACCTTTATTG CGCAAGATTT TGAAAAGCTC 3540 

GATATCATCA CTGTTCTTGA AGGTAGAACA CAAGCTGTCA TCCGAGATCA CTTTCTTAAA 3600 

TATGATAGAG CCGTCCGATG TCGCGTCAAA ATTATTACTA TGGATATCTT TAGTCCTTAC 3660 

TATGACTTAG CTAGACAACT TTTCCCGTGT GCTAAAATCG TTCTTGATCG CTTTCACATT 3720 

GTACAACATC TTAGCCGTGC TATGAGTCGT GTGCGTGTCC AAATCATGAA TCAGTTTCAT 3780 

CGAAAATCCC ATGAATACAA GGCTATCAAG CGCTACTGGA AACTCATTCA ACAGGATAGC 3840 

CGTAAACTCA GCGATAAACA TTTTTATCGC CCTACTTTTC GTATGCATTT AACCAATAAA 3900 

GACATTTTAG ACAAGCTTTT GAGCTATTCA CAAGACTTGA AACATCACTA TCAGCTCTAT 3 960 

CAACTCTTGC TGTTTCACTT TCAGAATAAG GAACCGGAGA AATTTTTCGA ACTTATCGAG 4020 

GACAATCTTA AGCAGGTTCA TCCTATTTTT CAGACTGTCT TTAAAACCTP CCTCAAAGAT 4080 

AAAGAAAAGG TTATCAACGC CCTTCAACTA CACTATTCTA ATGCCAAACT GGAAGCGACC 4140 

AATAATCTCA TCAAACTTAT CAAGCGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 4200 

AAAAAACGGA TTTTTATCGC TCTGAATATC AAAAAAGAAA GGACAAAATT TGTCCTTTCT 42 60 

CGAGCTTAGC TTTTTTTCAA CCCACTACAG TTGACAAAGA GCCGGAAAAA GGAACAGCCT 4 320 

TAGCTTTCCT TTCATTTCTT TTTATTTCCC TCGTAGTAAA CGTGCTAGCT TCCACAAAAC 4380 

AAACAGGATT CCCAGAAATG CCAGTACCAC TAGCCCACGG TACAACCATT GAGAGGTTGC 4440 

AACACGCGAT ACAGATTGTC CTTCTTTCGT AAAAGCAACC CTCGCAACTG CAGCTGTTTG 4500 

TGGATCTGAT TTTTGATAAA CAGCGACTCG TTCAAAATTC ACTAATAAGC GTTTATTAAA 4560 

GGTAGGAATC GGATCGCAGG TTATCAAGGT CATGATATTT TTAGAGCTAA CCGATTCTAA 4620 
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TTTTTCCCAT TCCGACGGTA AAATAATCTC TGTGTCCATC ATCTGATATT CTACAATTTC 
CTGGCCATTA TCATAATAAA GAGCATCTCC AACTTTTAGC TGATCCAAAT GGCGGAAAAA 
GACATGGCTT GGCTCTGCAC GGTGCCCAGC AATCACTGAG CGAATCCCTG TACCATCCAG 
AGGCAGCGGT GTACCATCCA CATGAGCCAA GCCCATCCCT AAATGATGAT AATCTGCTCC 
CAAATAAACC GGCTCCATGA TTTCCAAACT TGGAATAGAC AAGTAACCAT AGACTGCATC 
AGGGTCGTCA GACACTTGGT AATTGACCTC ATATCCCTCC GCCAAAAAAG GATCTACAAT 
GCGATTTTGC GAAGCCAAGC GTTGATTGTA GGCGAGAGAA TGGTTCTGTT GTTCTTGGTA 
CATTTCAGTT GTCATGGATT TCACAAATGT AGCATGACCT TTCACCTGTC CAAGAGACTG 
CAACACCATC TGTCCAAAAC AATAAATAGG AATCAAACAG GCTACCAACA TCAACAAGTA 
TCCCAATAAG GCTCGTAGTT TAGTCCTTGA CATGACGCCC CTCCAATTGC TTTTCTAGTC 
CTTTGACAAT CCGTCGATTA CGATACACGC GATACAGCAA GAGAAGGATG ACCGCCATCG 
CTCCTAGTAA TAACCACAAC CAGAATTGCC CACGCTCTCT CACCGCTCGA TTCCGCTCTG 
CAATTGGTGC CGTATACGGA ATCCGCTTCC CACGTACCAA CAGACGATGA CTGTTAATCA 
TATACGGTGT ACAAGTCAAC AAGGTCGCAT AATCTTCCCC ATGTTGAATC AAGACAGGCT 
CAAAGTCATT CGGCTCCACC GTCACTATCT GATCCACTTG GTACGCCAAC ACCTGATCTA 
AAACGTGAAG ATAAAAGATA TCCCCTTTTT TCATCTTATC CAATTGACTG AACAATTCTG 
CCGTTGGCAA TCCTCTGTGA GCAGTGATCA CTGTATGGGT ATTTTCACCT CCAACAGGCA 
GCGAAGCCCC TTCTAACACC CCTGCCCCTT TCTGAAGAAT GTCCTCACTC CTTCCGACAT 
ACATCGGAAT TTCCTGATCA ATCGCAGGAA TTTCCACATA GCCAATCCGC TCATGGACCT 
TT AG CAT ATT GGCATATTCT GAGACGCCTT TCTTTTTCTC TTGCTCTGTA AAAGGATCAA 
GAATTTCAGA TGGTTTCAAG GTCGCATTGA AGGCTTGAGC CAAGCGCCAA CGCTCCTCAA 
GTTCTGCCTT ATCCATCTGG GAAACCGTCT CATCAAACTC TTTAATAACC TCGTTTGACT 
CAATACGATA ATAATAACGA GACACCAATG GATATATCGC AACGGCGAAT CCTACTAAGA 



AAATCAGAAG AAGGATCAGC GGATCTTTCT T( 



GCCTTTTTTT CGTGAACCTC 



TACTGTTGTC CATCCTCCAC CTTCACTTCC TTCCTTGCTG CTTTCAGCGC CTTCAAAGCC 
TTTTCCGGTT GTTTTTTCTT CTTGCGCAAG CGTCGAATAA TCCATAAAAG AATCACAATC 
AAACCAACTG CCACATAAAA CAGGTAGCGA TAGAGATGAC TGAGTTTGTT TGCTGCAATA 
AATTCTTCCT CAACCTCTGC TACGTACGGT ATCCGATGCC CCCGAACCAA TAGACGATGG 
GTATTGATCA TGTATGGCGT ACAAGTCAGC AAGGTCACAT AATCATGACC TGGTACAATC 



4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
S460 
5520 
5580 
5640 
5700 
5760 
5820 
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AATAAATCAT CAAAGTTCGT CGGCTCAATC ACCTTTACTT GATCCACTTG ATAGCCCATC 6420 

ACTTCCTTGA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAAGTTTGGT CAAATCCGTA 6480 

AACATCTTAG CTGTTGGCAA ACCTGTATGT GCCGTAATCA CCGCATGGGT CGAATTGCCT 6540 

CCGATCGGCA GAGAAGTTCC CTCTAGATGC CCAGCCCCTT GCTGCAATAC CTCTTCAGCA 6600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAACGGGGA TTTCCACATG CCCCATCCGC 6660 

TCATGGATTT CTAACATACG TGCATACTCT GCTCGCCCTT TTTTCTTCAT TTCTTCCGAC 6720 

CAAGGATCGC CACTCACTAC ATTATTCAAA GAGTCATTGA AGGCTTGTGC CAATTTCATT 6780 

CGTTCATCAA TGTCAGCCTC ATCCAACGTT GCTTTTTCCT TATCAAAGTC AGCAATTTGT 6840 

TGATTTGATT CCACTCGATA ATACAAGCGA GACACCAGCG GATACGCCAT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 6960 

ATTTTTATCA GCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 7001 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104X1 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GAGGGAGCTT AAGAAGTTAC CACCGTCCTC TAGCGCCTTA TCCGCATCAA AGTTAAGGTT 60 

GATATTTTTA AAACTGTCGC CAGCTTGTGA TACGATCCTT TGTTTAAGGT CATTTAGGGT 120 

TTTAGTGAAA TCTGCATTGC TGAGGATATC ACTCTTTGAG AGATTCAAGG CAAAATTGAT 180 

GATGATATTG ATCTGGTTTC CTGTTATGAC CTGATCAACT TTGTAATTTT TTAAGGTATC 240 

TTCAACAATC TTGCGGATAT CTTCTTCTGT CAGATTTCCC TTACTTTCTT TAGCTTTGGC 300 

GAGTCCTGAC TTGATATCAG CTAGGGCAAC GTTTAATTTA TTAGCATCAT AGCCTGA1 ?T 360 

GTCCTTGTTT TCAGCATTGA TATCTGACAA AGCTTTTAGC TCTTCTTGAG CCAAATCTTT 420 

ATTAGCTTGT GGCACCTTGG CTCCATTAGC CTCTAGCGAA TAGTAAATCC CTGCTAAAGC 480 

ACTTTCTCCT GTAACTGGAA TAGGGGCTGC TACAGTGATT TTGGCATGTT CCATACCCAG 540 

CGTTACTGCT GCGTTTCGGT ACATATCCTG AGTCACCTTA GTGATATTTT CTGGTGTTTC 600 

AATCTTGACC TCAAGTGGCG ATTTGTCACC TAGCTTTTGA ATCTTGGCTG ATGAATACAA 660 

CTGTAAGCTA GAGTCATTGG CCACATTCAT GATTTTAGAA TAAACATCAG GTGTCATGGT 720 

CTTGAGTTCT TTGGTATCTG TTGAGGCATT GTAGCCCAGT TTTTTAAGAG TTTGATTTTT 780 
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TTGGTCTTCA GATAGGGAGG AACCTAGGAC ATATTCAGGT TGGACATAGG TTTCATCGAT 840 

AACTTTTTGA ACATCTGTTG CTGCATGGAC GCTATTCATA GCTGTTACTG CCCACAAGAT 900 

CGCAGCGCTA GTCAGAAAGA GTTTCTTTCT CATAGGGAAT TTCCTCCTTT ACTTCTTTAG 960 

AGTAATATAT CTATCTTAAA GAAAACTTAT AACAAAAACA CCTGGTCTAG CCAGATGTTG 1020 

AAAAGAGAGT GAAACATTTG ATGATGTAAA GGTTAAGTCG TACCTGTCTA GAATAATAAT 1080 

AGTTTCCTCC ATTTACATAG AGTTCAGCAC CGTGAAAAAT GGAAATGGGG TGAATATAAC 1140 

TATAAGTCTT TCCAGTCCTA TTACCAAGCA AGGGGGCAAC AGTCTCACGA GAGTACTGTT 1200 

TGGCTAGAGC CAGGGTATTT TCCTTGCCAT TTTGGGCGAT AAAATCGATA TAGGCAGGTC 1260 

CAAAATTATA GGCTTGAACA GCTGTCCAGA TATCTACCCC CTTCTTCTGC GCCAGATAGA 1320 

GATTGCCTGT CAGAGTTTGA ATGCCTTGCC GAATGCTAGA GGCATTATCA TTGATGGTGT 1380 

TGGTGGAACC ACTTGCAGAC TCACTAGACT GCATAACATC GCCTTCTTTT CCTTTTGTTT 1440 

CAGTATAAAT CATAGCAAGC ACAAGCTCTT CGTTTGCTGG GGTCTCTTGT TCACTCAATA 1500 

TTTCTCGCAC CATGGGTTGA TAGGTCATGA CTTGTTTGAC ATCTTGATGA ACGCGGTAAG 1560 

CTTTATAGCC AGCAAAAAGG AAGACTGCTA GTACAAGCAC TCTTCGAATT CGTTTAAACA 1620 

TTATTTACTT TGGATATCCT CGATATTTTT GATTAACATA GAGTAGGTTC CATTTTCGTT 1680 

TTGGATAAAC TCAACAGACT CGGCGTCTTG ATAGACGTTA TTGGGAACGA TGAGCTCAAT 1740 

TCCATTTGAT AAGGAGAGTT TTTGGTTTTC AAATTTCTTT AATTGGCGAC TGGCATCAAT 1800 

TTCATCAAAT TGAACAGGTT CTGGTACGGC TTCTTTGACT TGGTCAATAA AGCTCAAACG 1860 

AGCCGTCAGA TTGTTGTCAA AAAGGTCATT AGCCAATTTC TCAGGTGACA ATTCATTGCT 1920 

TTCTTCTAGG TTGTTGAAAA TAGCTGATTT GACCTTGGAT TGAAATTGAA AATCATCTGT 1980 

GTTAAAAGAT TTAGCAATTC TCTGGGCTGT TTTTTCCAGT TCCTTGATAG ATTTTTTAGC 2040 

AGAAATCTTA GGAGCGACAG CAAGAAGATT ATCTGAAAAA TAGTTCAAAA AAGTCCCGTT 2100 

GTACTTGATT CGTTTTTCAA TCAGGTGATA CTTCCTACTC TGAAGATTGA CCACCAAGGC 2160 

CTCATCAGCT CCTGTTCCAA ATCCAGGCAG GTTATTCTGA GTTAGCTTGA TTGGATTATC 2220 

AACTTCTCCT CCGAGGTGGG TCAAGGTCTC CCGCAGGGCA ATTCGCAAGA AAGCGAAATG 2280 

TTCTACACCT TCTTTAGAAA ATTGCACAAA AATCAAGTCA TTGGTCTTGA GATTTTCAGA 2340 

AATGCTAAAC TCCTCTTTCC AGAGATTAGC CAGCGTTACT GATGTCTCCA ACAAATCGTC 2400 

TGTAATATGA TTGAAGAAGG GATTTTCTTC TTCGAAAATC CCAGTCTTGG CTTCATCTGA 2460 

ATACACATGT TCAATTTTTT TACGCAGGTA TTCTTCCATT TTTGGAGTAA TATTGAGAAA 2520 
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CTTATCTGCT AAGAACACTT CGGTATCATC CGGACTGAAC TGGTGAATAA TGGCTTTCTT 2580 

AATATAAATG TCCATAAAAG TTTTAGTCCT CGTATAATGG GAAGGCATCT GTCAATTCTT 2640 

TGACTGCACT TCTCACTTCT TCTAATACAG CCTCATTTTC TGAATTCTTA AGGGTTTTAA 2700 

TGATGAGTTC AGCCACTTTG CGACTTTCTT CTTCACCAAA TCCACGTGCA GTAATGGCTG 2760 

CTGCTCCGAT ACGAATCCCA CTTGTCTTGA ATGGTGACAA GCTTTCGT AA GGGATTGAGT 2820 

TTTTATTTAA GGTAATATTG ACTTCATCCA ACAAGTTTTG AGCAACTTTG CCGTTTTCTA 2880 

CAACTTTAGT CACATCAACA AGGAAGAGAT GGTTTTCAGT TCCACCTGAA ATAATACGGA 2940 

AATCAGGGTC TTGCAAGAAG ACATCTGCCA ' TAGCCTTGCT GTTCTTAATT ACATTGGCAG 3000 

CATATTCCTT GAAGGCTGGA TCCAAAACTT CTTTGAAGGA AACTGCCTTA GCCGCCACAA 306 0 

CATGCTCTAA AGGACCGCCC TGAATACCTG GGAAAATAGC TGAATTGATT TTTTTAGCAA 3120 

GTTCTTCGTC ATTGGTCAAA ATCAAACCAC CACCAGGTCC ACGAAGGGTT TTGTGGGTCG 3180 

TTGTTGTTGT GATATGAGCG TATGGAACTG GGCTTGGATG AACGCCAGCC GCAACCAAGC 3240 

CAGCGATATG GGCCATGTCC ACCATGAGCT TCGCACCGAC AGCATCTGCG ATTTCACGGA 3300 

ATTTTGAAAA ATCGATAATT TGAGAATAGG CTGAAGCACC AGCTACAATC AG'ITTTGGTT 3360 

TTACTTCTTG GGCTTGTTTC AAGATAGCAT CAAAGTCTAA GAGTTCCGTT TTAGGATCAA 3420 

CACTATAAGA AACAAAGTTG TAGGTTTGAC CAGAGAAGCT AACAGGAGCC CCATGAGTCA 3480 

AATGACCACC TGATGCCAAA TCCATTCCCA TAACCGTATC ACCTGGCTCA ATCAAGGACA 3540 

TGTAAGCCGC ACAGTTAGCT TGGCTTCCTG AATGTGGTTG AACATTGGCA AATTTAGCAC 3 600 

CGAAAATTTC TTTTGCGCGT TCAATAGCAA GAGTCTCTAC AACGTCTACT ACATCAGTTC 3660 

CACCATAATA ACGGCGTCCT GGGTAACCCT CGGCATATTT ATTTGTCAAG ATAGACCCTT 3720 

GAGCTGCCAT AACAGCCTTG GAAACTACGT TTTCCGAAGC AATTAACTCG ATATTATTTT 3780 

GTTGGCGTTC TTCTTCTTTG GCAATAGCAT TCCAGAGATC AGCATCATAT GCTTTAAAAT 3840 

CATCTTTGTC AAAAATCATA GGTCTTCTCC TTTATTGTGT GACTAGTCCA TTAGTTTGAT 3900 

TTTACAATAA GAAAATCAAA CTAACAGATG CGAATAAACC GTTTCTGCAT TTTATCACAA 3960 

CTATAGCCAA CTTTTTCATA AAATGCATGA GCACCCAGAC GATGATTGGC AGAATTTAAG 4020 

CGGATAAACC CATAACCACA TCTTTTTGCT TCTTCTTCCA ACCCTTGTAG TAAACTTTTA 4080 

CCAATACCTT GACCTTGCGC TTGAGGTGAA ACTGCTAAAG CTAAGATATT AAATCCTGCT 4140 

TTGGAATAGA GTGATTCGTA AACTTCAGCG TGGACATATC CAAGTAAGAC ATGATTAGCT 4200 

GCATCCTCAT AGCCAAGTAG GAAATGATGG GAATCCTGAG ACAGTCTAGC TAGTTGGCTA 4260 

GCCGTTTCCT CTGGACTAAA AGTATAACCC AAAGCCTCTT GGTTGATGTC ACATATAGCT 4320 
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TTCACATCAG 


TTTCTCTTAA 


ATCTCTTAGC 


ATCTCATTCC 


TCCTCAAAAG 


AAATCTTTGG 


4380 


CAACCGAGCA 


AGAATATCTT 


CTCGCTTAAT 


GGCCCCTTGA 


CGTAAGATTT 


TCACCTTGTC 


4440 


TCCCGACAAA 


TTCAAAATAG 


TTGAATCCTG 


TCCAGTTAGA 


AAAGCATCGT 


CTTCCAGACC 


4S00 


CAGAACCTCT 


TGGTCAAAAT 


CCTCTAGAAT 


TTGATTAAAG 


GTCACTCCAC 


TCGCCTGACC 


4560 


TGAGATATTG 


GCAGACGGCC 


CAATCAAGGG 


ACCTGTCTCT 


CGAATCAAAT 


CAAGGGTAAT 


4620 


GGGATGACTA 


GGCATCCGAA 


ATCCAACAGT 


TGCAAGGCCA 


GAATTGACCC 


AATAGGGAAC 


4680 


TCGGTCATTA 


GCTTCGAGAA 


TAATGGTCAA 


GGGACCTGGT 


AAAAAGATCT 


CTACAAGTTT 


4740 


TTGAAGATAA 


GTTGGCTGAT 


TCTTTGAAAA 


GTACAAGATG 


TCCTCTAAAG 


AGGCAACATT 


480O 


GAGATTGAGC 


GCCTTGTCTC 


TACGTCGACG 


TTTAAGCTGG 


TAAACATGGT 


CAACTGCTTT 


4860 


TTCGTCTAGC 


GCCTTAGCAA 


AGAGACCGTA 


AACTGTCTCT 


GTAGGCAAAA 


CGACAGCTCC 


4920 


ACCATTTTCC 


AACTCTTGTC 


TAATCCTGTC 


CATCATCAAC 


GACAACCATC 


CTATCTTGAC 


4980 


CAAATTGGTC 


CTTGAGTGTT 


CGTACTCGCT 


TTTCAGGAAG 


ATGTTTCCTA 


AAAAGTTCAG 


5040 


GAACACTTTG 


ACCTTGCTTG 


TATCCAATTT 


CAAGGTAAAT 


CTTACCACCA 


TCTTTGAGAT 


5100 


AGTCTTTTGC 


ATCTTCCGCA 


ATTCTACGGT 


AAATAGCTAG 


GCCATCCTCA 


TCTGCAAACA 


5160 


GAGCTAGATG 


AGGCTCCGAA 


TACAAGACAT 


TCAAGCCTAC 


CTCTGACTCA 


TCTTCACGAG 


5220 


AGATATAGGG 


TGGATTGGAA 


ACAATTATAT 


CATATTTTTC 


AGAAATTTCT 


GTAAAACAGT 


5280 


CACATTTTTT 


TAAAAATATT 


TGAAGATTTT 


GATTTTTAGC 


ATTTTCGCTA 


GCTACATCTA 


5340 


AAGCATCTTG 


GGAAATATCT 


GCTGCCGTCA 


CTGACCAATC 


TGGTCTGTTT 


TTTGCTAGAG 


5400 


CGAGAGCAAT 


AGCTCCACTA 


CCTGTTCCGA 


TATCTAGGAC 


CATAAGATTT 


TTCACAGGAT 


5460 


TTTCAGCCAG 


GATAAGCTCC 


ACCAACTCCT 


CTGTTTCTGG 


ACGAGGAATC 


AAAACCCGTT 


5520 


CATCCACCTT 


TAAATGCATT 


CCATAAAAAT 


CTGCCTGTCC 


AATGATGTAC 


TGAGCTGGCT 


5580 


TGTGAGCTGC 


TAGTTGCTGG 


TAAATATCTT 


CTACAAATTG 


TTTTTCTTCC 


TCTGTTGTCA 


5640 


CCTCCTGCTG 


GAGGGCAAAA ATAAAGTCTG 


TAAAAGATAG 


ATTTTTCACA 


CTACGATAGA 


5700 


CAAAAGAGAG 


GCTTTCCGCT 


TCCTCTCCTT 


GTCTTATCAA 


CTCTTCTTCA 


AAATTTGAAA 


5760 


ATAATTGAGC 


TAATTTCATT 


ATTTGTITAA 


TTCTTCTAGT 


TTTTGTGTTT 


GGTCATAAAG 


5820 


CACCAAGGCA TCCACAACTT 


CGTCCAATTT 


ACCAGACAAA 


ATCGTATCTA 


GTTTTTGGAG 


5880 


GCTCAAGCCG 


ATACGGTGGT 


CTGTGACACG 


GTTTTGTGGG 


AAGTTATAAG 


TTCGGATCCG 


5940 


TTCTGAACGG 


TCACCAGTAC 


CGATTGTCGA 


CTTACGCTCA 


GCGTCCTGCT 


CATCTTGAGC 


6000 


AATCTGAGCA 


AAGTGGTCAG 


CAACACGGGC 


ACGGATGATT 


TTCATGGCCT 


TCTCACGGTT 


6060 
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CTTCTGCTGG GTACGTTCTT CCTGCATCTC AACCTTGATA TTGGTTGGCA ACTGAACGAT 6120 

ACGAACGGCA GTCGCAACCT TATTGACGTT CTGTCCACCA GCACCAGAGG CGTGATAGAT 6180 

GTCGACACGA AGGTCTTTTG GATCAATGTC GTATTCAACC TCTTCAACTT CTGGCATAAC 6240 

AAGAACTGTC GCTGTCGAAG TATGAACACG GCCTTGGCTT TCTGTCACAG GAACACGTTG 6300 

CACACGGTGG GCACCTGATT CATACTTAAG CTTAGAGTAT ACAGACTGAC CTGAAACCAT 6360 

AGCAACCACT TCTTTAAAAC CACCGACACC ATTCATAGAG GCTTCCATGA CTTCAAAGCG 6420 

CCAACCTTGG GCTTCCGCAT ACTTTTGGTA CATAGTTAGC AAATCTCCAG CGAAAAGTGC 6480 

CGCTTCGTCT CCACCAGCTG CTCCACGGAT TTCAAGGATG ATATTCTTGT CATCGTTTGG 6540 

ATCCTTTGGA AGGAGCAAAA TTTTCAGTTT TTCTTCATAT TCTTCTTTTT CAGCCTTGGC 6600 

ATCTTTGAGT TCTTGCTTGG CCAATTCTTC CAACTCCGCA TCTCCGCCTG ATTCCTTAAT 6660 

CATCTCTTCG GCATCGACGA TATTTTGAAG GACTTGTTTA TACTCACGGT AGGCTATTAC 6720 

GGTGTCACGA TTGGAAGCTT CTTCTTTTGA AACCTCCATA AAACGCTTGG TGTCTGAAAC 6780 

GACATCAGGG TCACTCAGCA ATTCTCCTAA TTCTTCATAA CGGTCTTCTA CAACTTGTAG 6840 

TTGATCATAG ATGTTCATTT TTTCTCCTTA TTTCTCAATT GTTAAATCAT AGATTGCTAC 6900 

TACTTCATTC TCGGATATTT CCCCAGTTTC TTTAAATCCA TAACTGAGGT AACAAAATCT 6960 

TGCCTGTTCA TTTTCTGGTT CATArGACAA CCAAAGTTTA TTGCTTAAAC CTGCTGGCGC 7020 

TGTTCGAACA TAGTCTAGTA CTTTATCCAT AATTGGTTTA AAATATCCTT GATTTTGAAA 7080 

ATTCTTATCA ATCATAAAAC GAAATAGTAA ATAATTTCCA CTACTAATTC CGATCTTTTT 7140 

ATCATAAGCT ATCATCACAA AACCTATAAT TGCATCATTA TCATAAACTG CCAATGGAGC 7200 

TACAAAATCT CCATTTTTAG TGTAGACGTA TCCTTCAGCT AAACTAATTG CGTTGGTTGC 7260 

AATGAATTGT TTTTGATATT CCTTGACATC CAAATTTAAA ACATCAAAAT AATTTTCCAT 7320 

TGTAACATCT CTTAGTTCAA TTGTCATAGT TTTGCTCCTT GTTAGAGGTT ATCATTGGCG 7380 

CAAAATAATG TTTACGGCAA ACTGAGATAT AGGTTTCGTT ACCACCAATC TGCATCTG'lT 7440 

CTCCATCGTA AACGGGCAGT CCATCCTGTG TTCGCAACAC CATGCTCGCC TTTTTCTTGC 7500 

AATACTGACA GATGGTCTTG ATTTCGTCAA TCTTGTCTGC TAAAAGCAAG AGATATTTGG 7560 

AACCTTCGAA CAATTCATTG CGAAAGTCAT TTTTCAAGCC AAAAGCCATG ACGGGTATGT 7620 

CTAACTCGTC CACAACACGA GCTAGGTCGT AAACATGGTG GCGTTTGAGA AACTGGGCTT 7680 

CATCGACCAA AACACAGTAA GGTTTTTCTG GTAGGTCTCG GATATAGCCA AAGATATCCG 7740 

TTGTTTCCTC AATCGCAAgG GCAGGGCGTT TCATGCCAAT TCGACTCGAC ACATAGCCAA 7800 

CGCCGTCACG CGTATCCAGA GCCGAGGTCA TAATCACAAC ACCTTTTCCT TGCTCCTCGT 7860 
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AGTTATAGGC CACTTTGAGA ATCTCAATCG TTTTACCAGA GTTCATGGTC CCATAACGAT 7920 

AGTACAACTG TGCCATGTTT CTTGCTTCAC GTCCATTTCT AAATTTTTGC TACATTCTAG 7980 

TATATCATAA TTTTCTTAAG CTTTAAACGG CAAAATGTGG TAAAATAGAA GAAATCAAAA 8040 

ACTAGTGGAG GAAGCTATTA TGCCATTTGT ACGCATCGAT TTATTTGAAG GACGCACGCT 8100 

CGAGCAAAAG AAAGCTCTTG CTAAGGAAGT AACGGAAGCA GTTGTCCGCA ACACTGGAGC 8160 

CCCTCAATCT GCTGTCCATG TCATCATCAA CGACATGCCA GAAGGAACTT ACTTCCCACA 8220 

AGGGGAAATG CGTACTAAAT AAGCTAGCTT AAGCAGAATT GCTTAGGCTT TTTCAATCTC 8280 

CAAGTAGCAT TCATTGAAGA AATATCCTAA ATTTGTTACA ATTTGAAAAG AAACTTGGAG 8340 

AATTTCCAAG AAAAGAGCTA TTAATTAAAG GAAACATTAT GATTACACGT GAATTTGATA 8400 

CCATCGCTGC TATCTCTACT CCACTAGGTG AAGGGGCTAT TGGTATTGTC CGCCTGAGCG 8460 

gAACAGACAG TTTTGCTATT GCGCAAAAGA TTTTTAAAGG AAAAGACTTG AACAAGGTTG 8520 

CCAGCCACAC TCTCAACTAC GGTCACATTA TTGATCCTCT GACTGGTAAA GTCATGGACG 8580 

AGGTTATGGT TGGGGCTATG AAGTCTCCAA AGACCTTCAC TCGTGAGGAT ATTATCGAGA 8640 

TTAACACCCA CGGTGGGATT GCGGTGACCA ATGAAATTCT CCAGCTAGCT ATTCGTGAAG 8700 

GGGCTCGGTT GGCAGAACCT GGTGAATTTA CCAAACGTGC TTTTTTAAAC GGTCGCGTAG 87 60 

ACTTGACACA GGCAGAGGCT GTGATGGATA TCATCCGTGC CAAGACTGAC AAGGCCATGA 8820 

ACATTGCGGT CAAACAATTA GACGGCTCCC TTTCTGACCT CATTAACAAT ACCCGTCAAG 8 880 

AAATCCTCAA TACACTTGCC CAAGTTGAGG TCAATATCGA CTATCCTGAG TATGACGATG 8940 

TTGAGGAAGC CACTACTGCT GTTGTCCGAG AGAAGACAAT GGAGTTTGAG CAATTACTAA 9000 

CCAAACTCCT TAGGACAGCA CGTCGTGGTA AAATCCTTCG TGAAGGAATT TCAACGGCTA 9060 

TCATTGGACG TCCCAACCTT GGGAAATCAA GCCTTCTCAA CAACCTCTTG CGTGAGGACA 9120 

AGGCTATCGT AACAGATATC GCTGGGACAA CACGAGATGT CATCGAAGAG TACGTCAACA 9180 

TCAATGGTGT ACCTCTCAAA TTGATTGATA CAGCCGGTAT TCGTGAAACG GATGATATCG 9240 

TTGAACAAAT TGGAGTTGAG CGTTCGAAAA AAGCTCTTAA GGAAGCTGAC CTAGTTCTGC 9300 

TAGTACTAAA CGCTAGTGAA CCACTAACCG CCCAAGATCG CCAACTCCTA GAAATCAGTC 9360 

AGGAGACTAA TCGCATTATT CT TC TTAACA AAACTGACCT GCCTGAAACG ATTGAAACTT 9420 

CGGAACTACC TGAAGATGTC ATCCGCATTT CAGTTCTTAA AAATCAAAAC ATCGATAAAA 9480 

TCGAAGAGAG AATCAACAAC CTCTTCTTTG AAAATGCTGC TTTGGTTGAG CAAGATGCTA 9540 

CCTACTTGTC AAACGCCCGT CACATTTCCT TGATTGAGAA GGCCGTTGAA AGCCTACAAG 9600 
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CTGTTAACCA AGGTCTTGAA CTAGGGATGC CAGTTGACTT GCTTCAAGTT GACTTGACCC 9660 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 9720 

AACTCTTTAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 9780 

TGGATTTTAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT TATGGAGCCT 9840 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9960 

TAATATCCAG ATTTTAGACA TCATCAATAA GGATACACAC AAGGAAATCA TCGCCAAACT 10020 

GGACTACGAC GCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTGGTAT GCCCACTAGA ATTCTCCTTA 10140 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA GATTGCTCAA AAGTTAATTG 10260 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 10320 

GTAAGCTCAA TGACTTTCAC TTTAAACATG ATTTTTCTTG TCTTCCTGAG ATTATGTCTT 10380 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
tV) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



GTTTTGGGTT 


CTGGAAATTA 


TCAGATGGTT 


GGAAAAGCCG 


TCCACATCAA 


GATAGTGTTC 


60 


GGAGATTTAA 


GTTTAAATTG 


AAGAAACTAA 


CACAGAGGAA 


ATGGAGTATA 


GACCTAACAA 


120 


GACGTATTGA 


GCAACTGAAT 


TTGTCTATTC 


GAGGATGGAT 


AAACTATTGC 


TCATTGGGAA 


180 


ATATGAAAAG 


TATAGTCGCC 


AGCATAGATG 


AGCGCTTGCG 


TACTCGCCTA 


CGAGTGATTA 


240 


TCTGGAAGCA 


ATGGAAGAAG 


AAATCGAGAC 


GATTATGGGG 


ATTGCTTAAG 


TTAGGAGTTC 


300 


CTAAATGGAT 


AGCAGATAAG 


GTATCTGGCT 


GGGGCGACCA 


TTATCAATTA 


GTAGCTCAGA 


360 


AGTCGGTACT 


TAAACGTGCT 


ATATCAAAAC 


CAGTCCTGGA 


AAAACGTGGA 


CTGGTTTCGT 


420 


GTTTGGATTA 


TTACCTTGAA 


CGACATGCGT 


TAAAAGTTAG 


TTGAACCGCC 


GTATGCCAAA 


480 


CGGCACGTAC 


GGTGGTGTGA 


GAGGGGCTAG 


AGATTATCCC 


CTACTCGATT 


AACTCCCCTG 


540 


AAATTTATTT 


TAATTATGCA 


AATTTCACGT 


ATTTTTGATG 


CTGAGACGAC 


GATCCTGGGA 


600 
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ACTTTTCAGA TATTTTTTTG ACTATCTAAA TCTATCATTA GAAAAGCTTA GAGCGCCAAA 660 

GGATTTGAGC GTTTTTCTGA TTTTTAAGAC TTTTTCCAGT CTCTTTTTCG ATTGAAGATG 720 

TAATTATTCT ACTAACTAAC TAACTTCTTA GTACTAGCCA ACAACGATAA TCATAATTCC 780 

TCCTAAAATT AGGAATAATA AAGGCAATAG 'lTrTTGTTTT TTCATGTAAA AAACCTCACT 840 

TTTGTTTTCT GCTATTTTAT GCTAAAATAT TAAAAATCAA ATTTAATTCC AAAGTTTGTA 900 

ACTAAAGGGG GAGCGCTACA TGTCTAATTC ATTTGTCAAG TTGTTAGTCT CTCAATTATT 960 

TGCAAATTTA GCAGATATTT TCTTTAGAGT AACAATCATT GCTAACATAT ACATTATTTC 1020 

AAAATCAGTA ATTGCCACAT CACTAGTTCC TATCTTAATA GGAATATCCT CTTTTGTTGC 1080 

GAGTCTTTTA GTTCCGTTGG TTACTAAAAG GTTAGCGCTA AATAGGGTTT TATCTTTATC 1140 

TCAATTTGGA AAGACTATAT TATTGGCGAT ACTGGTAGGA ATGTTTACCG TAATGCAATC 1200 

CGTAGCGCCT TTGGTGACCT ATCTATTTGT TGTTGCAATT TCCATACTAG ATGGTTTTGC 1260 

AGCACCCGTT TCCTATGCTA TTGTGCCACG CTATGCGACC GATTTGGGTA AGGCTAATTC 1320 

AGCCTTATCA ATGACTGGTG AAGCTGTTCA ATTGATAGCT TGGGGATTAG GTGGACTCTT 1380 

GTTTGCAACA ATTGGTCTGT TACCTACCAC GTGTATCAAT TTAGTCTTGT ATATCATTTC 1440 

TAGCTTTCTG ATGTTATTTC TTCCTAACGC TGAAGTGGAG GTGTTAGAGT CAGAAACTAA 1500 

TCTTGAAATT TTGCTCAAAG GTTGGAAGTT AGTTGCTAGA AATCCTAGAT TAAGACTTTT 1560 

TGTATCAGCA AATTTATTGG AAATTTTTTC AAATACGATT TGGGTTTCTT CC ATT AT ACT 1620 

TGTTTTTGTA ACGGAGTTAT TAAATAAAAC GGAAAGTTAC TGGGCATATT CTAATACAGC 1680 

ATACTCTATT GGTATTATAA TTAGTGGCTT AATTGCTTTT AGGCTATCTG AAAAGTTCCT 1740 

TGCTGCTAAA TGGGAACCCC AATTATTCAC CCCAAATCTA AAAACCATCC AGAATCCTTG 1800 

CCTTAGCTTA GATCCTGGAT GGTTTCTTTT TTCACCCAAT GGCTCTTTTT TACTAGACAA 1860 

AAAAGAGTTT CCCCTTTATG GTATAAGTGT AGAAAAAAAC ACAAAAAGAA AGGAAACTCA 1920 

CATGAACAGT TTACCAAATC ATCACTTCCA AAACAAGTCT TTTTACCAAC TATCTTTCGA 19 BO 

TCGAGGTCAT TTAACCCAGT ATGGTGGTCT TATCTTTTTT CAGGAACTTT TTTCCCAGTT 2040 

GAAACTAAAA GAGCGGATTT CTAAGTATTT AGTAACGAAT GACCAACGCC GCTACTGTCG 2100 

TTATTCGGAT TCAGATATCC TTGTCCAGTT CCTCTTTCAA CTGTTAACAG GTTATGGAAC 2160 

GGACTATGCT TGTAAAGAAT TGTCAGCTGA TGCCTACTTT CCAAAATTGT TGGAAGGAGG 2220 

GCAGCTTGCT TCACAGCCAA CCTTATCCCG TTTTCTTTCC AGAACTGACG AGGAAACAGT 2280 

CCATAGTTTG CGATGCCTCA ACCTTGAATT GGTCGAATTC TTTTTACAGT TTCACCAGCT 2340 
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AAACCAACTC ATTCTACATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 2393 
(2) INFORMATION FOR SEQ ID NO: 91: 

H) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4762 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EON ESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTTGTATCTT TTTAGGTCTC TTTCAATCCA AACCCTTTAA ACTATACGTC ATTTCGGTTC 60 

CTGCAAGTCT TGTGGTAATT TTAGGTTTGA TTTTACTTTT CTTTTCACAA GAGCCTCTGC 120 

ACGCTTCTTA TTTGATGGTC GTCTTCCCTG TTTTCCTACT TTTATTGGTA ACCAATATTA 180 

AGAGTCAACA GAGGGGGCGT AGTGCTAGAA GAACCCGAAG AGAAACGCCA TTATGCCTAT 240 

GGAGTCGTTT CTTCAAAGGA AATCTATATC TGCTAGTTTT TGGGTTTGTC TATCTTTTGT 300 

CTGTTCCTTT TTTGATGAAG TTTGTCCTTT ATCCAGTACC TTATCAAGAA CGTAATCGTC 360 

TTGCTGATTT GGTAAAAGAG GAGACAAATA CGGAAGATGC TATCTCATGC ATGGGATGAT 420 

ACTGCGACTC TTTATCGTAA GAGTGAGCGC TTGTCCCATC GGCCATTTTG TCCCCGTTGC 480 

ACTATACAGC AACTGAGGAA AATCGTAATA AGTTACTTAA TGACTTGAAA GAAAAACAAC 540 

CTAAGGTGAT TGTGGTAAAT GATAAGGTGG TAGTCTGGTC TGAACTGGAA ACACTCTTAA 600 

AAGAAAATTA CCAACAAGTA AAGACTGATT ACTCAGAGTT TAAAGTCTAT AAAATTAAAT 660 

AACCAAATCA ATATCTTGTG TATTTTTAAA AATTTTAGGA TTTTTAACAC AAGATATTGA 720 

TTTTTCTTTT TAGAGTGGTA TAATACTTTT TAGAAAGAAC ATTTTAGAAA AGAGCATGCA 780 

TATGATTGCA CTAGAAGAAA AAATTACAAT TTTGCCAACT CTCTTCGTCG AGAAACGAGA 840 

TGGGAGACGT GTTGTATTTG ATGTGGACAA GATTGACAAG GCTCTCCACA AGGCGGCTGA 900 

CAAGGTTATG GATGTGACAC CCCTGGTTGA AAAATGCCTC AATGATCTGA CTGAGCGAAT 960 

TATTACAGAA ATTCATAGTC GCTTTCCACA GGGAATTAAG ATTTACGAAA TTCAAAATAT 1020 

CGTAGAACAT GAACTCCTTG AAGCCAAAGA ATATGCGCTG GCTCAGGAGT ATATTACTTA 1080 

TCGGACACAG AGGGATTTTG AGCGCTCAAA AGCGACGGAT ATCAACTTTA GTATTCATAA 1140 

ACTTCTCAAC AAAGACCAGA CAGTTGTCAA TGAAAACGCT AATAAAGACA GTGATGTCTT 1200 

TAACACTCAG CGTGATTTGA CAGCAGGGAT TGTTGGGAAA TCAATCGGAC TGCAAATGCT 1260 

TCCTAAGCAC GTAGCCAATG CCCACCAAAA GGGGGATATC CACTATCACG ATTTGGACTA 1320 

CAGTCCCTAT ACCCCTATGA CCAACTGCTG TTTGATTGAT TTTAAGGGTA TGTTGGAAAA 1380 
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TGGTTTTAAG ATTGGAAATG CAGAGGTAGA GAGTCCCAAG TCTATCCAGA CTGCGACAGC 1440 
ACAGATTTCT CAAATCATTG CCAACGTTGC TTCTAGCCAG TACGGTGGCT GTTCAGCTGA 1500 
CCGTATCGAT GAAATTTTGG CGCCTTATGC AGAGAAGAAT TATCAAAAAC ATCTCAAAGA 1560 
TGCAGAAGAG TGGGTATTGC CTGAAAAACA GGAAGATTAC GCTTGGAAGA AAGCGCAAAA 1620 
GGACATCTAC GATGCCATGC AATCTCTTGA GTATGAAATC AATACTCTCT TCACTTCAAA 1680 
TGGACAAACA CCTTTTACTT CGTTAGGTTT TGGTCTGGGA ACCAGTCGTT TTGAACGAGA 1740 
AATTCAAAAA GCTATTTTAA ACATTCGCAT CAAGGGTCTT GGTTCAGAAC ACCGTACGGC 1800 
TATCTTTCCT AAACTTATCT TTACGCTTAA AAGAGGCCTC AACTTAGAGG AAGGAACTCC I860 
CAACTATGAC ATCAAGCAGT TGGCTCTAGA GTGTGCAACC AAGCGGATGT ATCCAGACGT 1920 
CTTGTCTTAT GATAAGATTG TTGATTTGAC AGGTTCTTTC AAGGTGCCTA TGGGCTGCCG 1980 
TTCTTTCCTT CAAGGGTGGA AGGATGAAAA TGGTGTAGAA GTCAATTCAG GTCGCATGAA 2040 
TCTGGGTGTT GTGACGGTTA ATCTGCCTCG TATTGCTCTT GAGTCTGAAG GTGATATGAA 2100 

TAAGTTCTGG GAAATCTTCA ACGAGCGAAT GAATATCGCA GAAGATGCTC TTGTTTACCG 2160 

TGTCGAACGC ACTAAAGAGG CGACACCAGC GAATGCTCCT ATTCTTTATC AGTACGGTGC 2220 

TTTTGGCCAT CCTCTAGGTA AAGAAGAAAG TGTTGACCAG CTCTTTAAGA ATCGTCGTGC 2280 

GACCGTTTCG CTGGGCTATA TCGGCTTGTA TGAAGTAGCG ACAGTTTTCT TTGGTAACAG 2340 

CTGGGAAAGT AATCCAGATG CTAAGGAATT CACCCTAGAC ATCATTCACG ATATGAAACG 2400 

CCGTGTAGAA GAGTGGTCAG ACCAATATGG CTACCATTTC TCTATCTACT CAACACCATC 2460 

CGAAAGTCTG ACAGACCGTT TCTGCCGACT AGATATAGAC AAGTTTGGCT CTATTCCTGA 2520 

TATCACAGAC AAGGAATACT ACACCAACTC TTTCCACTAC GATGTTCGTA AAAATCCAAC 2580 

ACCGTTTGAA AAATTGGACT TTGAGAAAGT CTATCCGGAA GCAGGTGCGT CAGGTGGTTT 2640 

CATCCATTAT TGTCAGTATC CAGTCCTTCA GCAAAATCCA AAGGCCTTGG AAGCTGTCTG 2700 

GGATTATGCT TATGACCGTG TAGGCTATCT ACGCACCAAT ACTCCGATTG ACCGTTGCTA 2760 

CAAGTGTGAC TTTGAAGGGG ATTTTGAACC AACTGAGAGA GGGTTTGCTT GTCCAAACTG 2820 

TGGCAATAGC GACCCTAAAA CAGTAGATGT GGTGAAACGA ACTTGTGGCT ACCTAGGTAA 2880 

TCCTCAACCA AGACCGATGG TCAACGGGCG TCACAAGGAA ATCGCTGCGC GTGTCAAACA 2940 

TATGAATGGT TCAACGATTA AAATAGCTGG GCATCAAGTA ACAAATTAGA AAGAAATGAA 3000 

ATGGGAAAAT ATCAACTAGA CGATAAGGGG CGCGCACAAG TGACCCGTTA TCACGAGAAA 3060 

CACTCTAAAG GTGGAGCTGG TAAGAAAGAA CGCTTGCTTA GCTTCAGAGA ACAATTTTTA 3120 
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AACAAGAACA AGAAAAAATA AAAGTGAGAG CCAGCTCTCG CTTTTCTCAT AGTCGGAGGT 3180 

AAGGATGGAA TTACGCAGAC CAAGATTAGC GGATAAGAAA GCTCTTTTAG ATATGATGAC 3240 

AGAGTTTGAA AAATTTCAGT CGCCTCACGA CGGCGGTTTC TGGGATACAG AGAACTTTGT 3300 

GTATGAAGAC TGGTTAGAAA GCAATCAGGA ACAGGAAATG GGGATTAATC TGCCTGAAGG 3360 

ATGGGTTTCT GCAATTCAGT TAGTGGCTTT TTCTGAGAAA GGTCAAGCAG TTGGATTTCT 3420 

TAATCTCCGG TTGCGCCTCA GTAACTTTCT ACTAGAAGAA GGTGGCCACA TTGGCTACTC 3480 

CATTCGTCCA TCTGAAAGAG GCAAGGGTTA TGCAAAAGAG ACTCTCCCTC AGGGCTTGCA 3540 

AGTTGCTAAG GAAAAGAACA TCAAGAAAGC TCTGGTGACC TGTAGTGTGA ATAATCCTGC 3600 

TAGCAGAGCA GTCATTCTAG CAAATGGTGG AATATTTGAG GATGCTCGCA ATGGACTCGA 3660 

GCGTTATTGG ATAGAGGTAG CGAATGAATA ATCCAAAACC ACAAGAATGG AAAAGCGAGG 3720 

AACTTAGTCA AGGTCGTATC ATTGACTACA AGGCCTTTAA CTTTGTGGAC GGCGAAGGCG 3780 

TGCGCAACTC TCTCTATGTA TCAGGCTGTA TGTTTCACTG CGAGGGATGT TATAATGTTG 3840 

CGACTTGGTC TTTTAATGCT GGCATTCCCT ATACAGCAGA ATTAGAAGAG CAGATTATGG 3900 

CAGACCTTGC CCAACCCTAT GTTCAAGGCT TGACTTTGCT GGGAGGGGAG CCTTTTCTCA 3960 

ATACTGGGAT TCTCTTGCCA CTTGTTAAGC GGATTCGGAA GGAATTGCCA GACAAGGACA 4020 

TCTGGTCCTG GACCGGCTAC ACTTGGGAAG AAATGATGTT GGAAACTCCA GATAAACTGG 4080 

AATTCTTGTC ACTGATTGAC ATTCTTGTCG ATGGAAGATA TGATCGAACT AAGAGAAATC 4140 

TTATGCTCCA GTTTCGAGGT TCATCTAACC AACGAATTAT CGATCTGCAA AAATCGCTCA 4200 

AAAGTGGGCA AGTAGTGATT TGGGACAACC TCAATGACGG AAAAGAAAGC TATGAACAGG 4260 

TGAAGAGAGA ATGAAGAAAA AGGACTTACT AGACCAACTA GTCTCAGAGA TCGAGACGGG 4320 

GAAAGTCAGG ACACTGGGAA TATACGGTCA TCGAGCTTCA GGTAAATCAA CCTTTGCACA 4380 

GGAATTGTAC CAAGCTTTAG ATTCTACTAC AGTAAATTTG CTAGAGACAG ATCCTTATAT 4440 

CACCTCAGGA CGCCATCTGG TAGTACCCAA GGACGCGCCG AATCAAAAGG TGACAGCCAC 4500 

TCTGCCAGTG GCGCATGAAC TGGAGAGTTT GCAGAGAGAT ATCCTTGCTT GCAGGCGGGT 4560 

ATGGATGTCT TGACAATTGA AGAACCTTGG AAGGCTAGTG AGGTCTTGTC TGGAGCCAAA 4620 

CCAATTTTGA TTGTCGAAGG CATGTCTGTT GGCTTTCTAC CCAAGGAACT CTTTGAAAAA 4680 

ACCATCTGTT TCTACACGGA TGAGGAGACC GAATTAAAGC GACGCCTTGC TAGAGATACG 4740 

ACTGTGAGAA ATCGCGATGC GG 4752 
(2) INFORMATION FOR SEQ 10 NO: 92: 
<i> SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



GATGCAGGTT 


TCGACCCACA 


TATTCCAGAA 


AATTACTTTA 


A A/"* A flV^ A TV A 

AAGATGATUA 




ou 


GTACCTTGTC 


TTTGTTGGTC 


TTCATCTGCA GCCCTCTTTT 


TCAGT AAT1 C> 






GCGGTCTATC 


AGGAGACGCC 


TTTTGATTGG 


AGAAAGATAG 


AAGATGATvjL. 


AIL. iu(.AlAI 




GGGTATTTAT 


AAGAGGAATT 


ATGACATATT 


TAGACGCTTT 


m ft ft n x /** /"**^ 

TAAATCAGO T 




Tift 


TACCGAGTGC 


CCTGCTCTTG 


CATTTTAAGG 


AACTCTTTCC 


TTCTAGCGAC 


GATTTTCTGG 


JUU 


TTTGGCAATT 


TTTCTATTTG 


CAAAATACGA 


CAGGCTTAGA 


AGAAATGTCG 


f+f+H ft ft. A* ft 

CCAAGCCAGA 


JDU 


TTGCTGAAAG 


GATTGGCAAG 


GAAATTTCGG 


ATGTCAACCA 


GTCCATTTCT 


AATCTGACGC* 


4 4 U 


AAAGGGGACT 


GCTCCAGTAT 


CGTACTATCG 


AATTAAATGG 


CGAAATTGAA 


TTGtrTuTTTv* 


a on 


ATGCTAGTTT 


GGCCTTGGAA 


CGTTTGGATG 


ACCTGTTTGG 


AGCAGTTCAT 


TCAAO i IX. AU 


34 U 


ACCAGCTAAC 


ACCTCAAAAC 


CAGCTCAAGG 


ATTTGGTGGA 


AACCTTCCAG 


A /"V* A PflVB/V' 

CAQGAiil IGL* 


QUU 


GACGATTGTT 


GACGCCTTTT 


GAGATTGAGG 


ATTTGACCAA 


GACACTAAAG 


GAAGATGGAA 


CCA 

sou 


CCAGTGCTGA 


CTTGATTAAG 


GAGGCTCTTC 


GTGAAGCTGT 


TTTGAATGGA 


AAACXAAAL I 




GGAAGTACAT 


TCAGGCGATT 


TTGAGAAACT 


GGCGCCATGA 


& ft p^n^^ ft ft 

AGGAATCAAG 


AGTGT\jot_ l\_ 




AAATTGAGGC 


CAAGAGAGCA 


GAAAGAGAAG 


CAAGCAATCC 


TCAGTTGACA 


CAGGTATCTG 


840 


CAGATTTCAT 


AAATGCCATG 


GATCTCTGGA 


AGGATTAATC 


CATGCAAGTA 


GGCTTGAAAT 


900 


CCGAGTAAGA 


TTTGCAAGCT 


GTGTATAATT 


GTGATAGAAT 


AAATAGAAAA 


TAAATTGAAA 


960 


AAAGAGGTAT 


GTGAAATGTC 


ACGTAAACCA 


TTTATCGCTG 


GTAACTGGAA 


AATGAACAAA 


1020 


AATCCAGAAG 


AAGCTAAAGC 


ATTCGTTGAA 


GCAGTTGCAT 


CAAAACTTCC 


TTCATCAGAT 


1080 


CTTGTTGAAG 


CAGGTATCGC 


TGCTCCAGCT 


CTTGATTTGA 


CAACTGTTCT 


TGCTGTTGCA 


1140 


AAAGGCTCAA 


ACCTTAAAGT 


TGCTGCTCAA 


AACTGCTACT 


TTGAAAATGC 


AGGTGCTTTC 


1200 


ACTGGTGAAA 


CTAGCCCACA 


AGTTTTGAAA 


GAAATCGGTA 


CTGACTACGT 


TGTTATCGGT 


1260 


CACTCAGAAC 


GCCGTGACTA 


CTTCCATGAA 


ACTGATGAAG 


ATATCAACAA 


AAAAGCAAAA 


1320 


GCAATCTTTG 


CGAACGGTAT 


GCTTCCAATC 


ATCTCTTGTG 


GTGAATCACT 


TGAAACTTAC 


1380 


GAAGCTGGTA 


AAGCTGCTGA 


ATTCGTAGGT 


GCTCAAGTAT 


CTGCTGCATT 


GGCTGGATTG 


1440 


ACTGCTGAAC 


AAGTTGCTGC 


CTCAGTTATC 


GCTTATGAGC 


CAATCTGGGC 


TATCGGTACT 


1500 
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GGTAAATCAG CTTCACAAGA CGATGCACAA AAAATGTGTA AAGTTGTTCG TGACGTTGTA 1560 

GCTGCTGACT TTGGTCAAGA AGTCGCAGAC AAAGTTCGTG TTCAATACGG TGGTTCTGTT 1620 

AAACCTGAAA ATGTTGCTTC ATACATGGCT TGCCCAGACG TTGACGGTGC CCTTGTAGGT 1680 

GGTGCGTCAC TTGAAGCTGA AAGCTTCTTG GCTTTGCTTG ACTTTGTAAA ATAATCAGTA 1740 

AGTAGCAAAA GCTAGGTGGA ACAGCATTCA GATGTCTGTT ACATTTTTTA TAGGAGAGAA 1800 

AGATTGAAAA CAAAAATTGG ATTAGCAAGT ATCTGTTTAC TAGGCTTGGC AACTAGTCAT 1860 

CTCGCTGCAA ATGAAACTGA AGTAGCAAAA ACTTCGCAGG ATACAACGAC AGCTTCAAGT 1920 

AGTTCAGAGC AAAATCAGTC TTCTAATAAA ACGCAAACGA GCGCAGAAGT ACAGACTAAT 1980 

GCTGCTGCCC ACTGGGATGG GGATTATTAT GTAAAGGATG ATGGTTCTAA AGCTCAAAGT 2040 

GAATGGATTT TTGACAACTA CTATAAGGCT TGGTTTTATA TTAATTCAGA TGGTCGTTAC 2100 

TCGCAGAATG AATGGCATGG AAATTACTAC CTGAAATCAG GTGGATATAT GGCCCAAAAC 2160 

GAGTGGATCT ATGACAGTAA TTACAAGAGT TGGTTTTATC TCAAGTCAGA TGGGGCTTAT 2220 

CCTCATCAAG AATGGCAATT GATTGGAAAT AAGTGGTACT ACTTCAAGAA GTGGGGTTAC 2280 

ATGGCTAAAA GCCAATGGCA AGGAAGTTAT TTCTTGAATG GTCAAGGAGC TATGATGCAA 2340 

AATGAATGGC TCTATGATCC AGCCTATTCT GCTTATTTTT ATCTAAAATC CGATGGAACT 2400 

TATGCTAACC AAGAGTGGCA AAAAGTGGGC GGCAAATGGT ACTATTTCAA GAAGTGGGGC 2460 

TATATGGCTC GGAATGAGTG GCAAGGCAAC TACTATTTGA CTCGAAGTGC TGCCATGGCG 2520 

ACTGACGAAG TGATTATGGA TGGTACTCGC TATATCTTTG CGGCCTCTGG TGAGCTCAAA 2580 

GAAAAAAAAG ATTTGAATGT CGGCTGGGTT CACAGAGATG GTAACCGCTA TTTCTTTAAT 264 0 

AATAGAGAAG AACAAGTGGG AACCGAACAT GCTAAGAAAG TCATTGATAT TAGTGAGCAC 2700 

AATGGTCGTA TCAATGATTG GAAAAAGGTT ATTGATGAGA ACGAAGTGGA TGGTGTCATT 2760 

GTTCGTCTAG GTTATAGCGG TAAAGAAGAC AAGGAATTGG CGCATAACAT TAAGGAGTTA 2820 

AACCGTCTGG GAATTCCTTA TGGTGTCTAT CTCTATACCT ATGCTGAAAA TGAGACCCAT 2880 

GCTGAGAGTG ACGCTAAACA GACCATTGAA CTTATAAAGA AATACAATAT GAACCTGTCT 2940 

TACCCTATCT ATTATGATGT TGAGAATTGG GAATATGTAA ATAAGAGCAA GAGAGCTCCA 3000 

AGTGATACAG GCACTTGGGT TAAAATCATC AACAAGTACA TGGACACGAT GAAGCAGGCG 3060 

GGTTATCAAA ATGTGTATGT CTATAGCTAT CGTAGTTTAT TACAGACGCG TTTAAAACAC 3120 

CCAGATATTT TAAAACATGT AAACTGGGTA GCGGCCTATA CGAATGCTTT AGAATGGGAA 3180 

AACCCTCATT ATTCAGGAAA AAAAGGTTGG CAATATACCT CTTCTGAATA CATGAAAGGA 3240 

ATCCAAGGGC GCGTAGATGT CAGCGTTTGG TATTAAGCGA TGATTTGAAA GAGGGATGTG 3 300 
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ATAGTAGCAC CCTCTTTTTC TTTGTTTTAT GATAGTTCAT CCTCGAGTAA ATTCAAGTTC 3360 

TTGCTCGGAA ATGAAGCTTA TATAGTAGAT TGAATATAGA CAAATACCTT GTGATTGGTA 3420 

AAACATTTTA GAAATTCATT TACCTTTCCT AATCGACTTG GTTTCATCTT ATTTCAATCT 3480 

ATTATAGTAT TGGGGAATTT CTTCAAACCA CATCAGCTTG GTCAGTTCTA CCTGCGACCT 3540 

CAAAACTTGT GCTTTGGTCA AGCTGGGTTT AGTTTCCTAG TTTGCTGATG GATTTCCATT 3600 

GACTATAAGC ATCCAACCCT CTTTTTGTCT TCTAAAGAAT TCTTAAATTA TCAGTCTATT 3660 

GCAACTTTTC TCATATAAGT TCTTTGTCTT GCTATTGGTT TTCCTTAGTA GTATACTAAG 3720 

GTAGTAATCA TTAAGAAGTG GTTACAAAAA ATAATGAATG AGGTAAAGAA AATGGTAGAA 3780 

TTGAAAAAAG AAGCAGTAAA AGACGTAACA TCATTGACAA AAGCAGCGCC GG 3832 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY ; linear 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



TGAAAAAATC 


CTCATGAACC 


TGGCGCCAAT 


AGACAAGTGT 


CTTGTTTCCC 


TCACCTTCCT 


60 


TATAGGCATG 


GTCAGCTGAC 


ACTCGATTGA 


AGGGTTTAAC 


AGAAACCTTT 


GTAATTTCGA 


120 


CAATGCAGAC 


AGCCTGATTT 


TGACTATCTA 


AAATGACATC 


GAAGGTCCCT 


ACTTGGGGAA 


180 


GTGGTTCGTC 


TTCTAGCACA 


TAGAGGTCAT 


AGGCTGATGC 


TGTTGCTGTC 


TTTTCTCCTT 


240 


TAAACACCAA 


ATCCGCTAAA 


AGGTCTGGTT 


CAACTCCAAA 


AGCCCAGGCA 


TCGATTTCAT 


300 


CTCCGATCAA 


AGGATTGATT 


TGCTTGTATT 


TATTCCACAT 


TTCTTGCGGT 


ATCATGGGTG 


360 


CTCCTTTGTA 


ATTTTTTACT 


TTCVIVITTT 


ATGTGTTTAA 


GATGATCTGG 


ATCGTCAATC 


420 


TCTAAATCAA AAATCTCTGG 


AATAGAACTG 


TAGTGGATAA 


TGCACTTGAT 


ACCCAACTGA 


480 


TTCATTTTTT 


GTATGAAAGA 


AGTATTCAGA 


TAGCCTGCTA 


CAGCAAAATC 


AATCTTGTTC * 


540 


TTTCTTGCTT 


TATCCTGCAT 


ATCTCTTAGC 


ATATCTAACA 


TTATTGGACT 


TTCCATATCA 


600 


TGCCATTGAC 


TGTTTCTCAT 


AGTCGCAAAA 


ACAAAGGAAG 


TCAAATCATT 


CATTCCAACT 


660 


ACAATCTTTG AAATGCCCGT 


TTCCAGTATA 


CTAGATAACT 


CAAAATACGC 


TGACGGTAAT 


720 


TCAATCATCG 


TTCCGACTTT 


CCCAGTAAAA 


CCCTGCTGAC 


GCAATACTGT 


AATACCTTGT 


780 


TTTAATTGGT 


CGGCATCATT 


GACAAAAGGA 


AAGATAACAG 


ATAGATTGGG 


GTTGGTTTGA 


840 
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TAAACTTCTG TAACGACATG TGCTTCAGCC TGAAATTCAT CCAAACACGC CAGTAAACGC 900 

CTAGTTCCTC TATAGCCAAA CAAGGGATGC CCTTCGTCAA AAAACTCTTT AGTCCCCACT 960 J 

AAACAATTCG CTTCTGTATT CGTTAATTCA GTAAAACGAT ACCAAACTTC CTTACCTAAG 1020 

TAAAAGGAGC AAATAGTATC AAGATAATCT TTCACAAATT CCTGACAACT TTGTAATAGT 1080 

ATATTTTGAT TGAGCTCTCT CAATAAGTAT TCCCCACGAA - TCATGCCGAC GTGGTGAAAT 1140 

AGTTGAGGAT AAATTTTTTC AAGAATTTTT TCGCCACTAA GGGCAAGTTG ATTTCTCATC 1200 

ATTCACCTTC CAATTCATGT AAGAAGTCTT GTCCAGTTCT GGAAATCCTA ATAATTCAGA 1260 

CTTAACCTTC AAGACTAATG GCGATGCATT TTCTTCTGTA ATCTCTTGAA TATCCATCCA 1320 

AATATATCCA AGTGAATCAT TCGCACCATC AGACACAGCT TCCGAAATCG TAACTTGAGG 1380 

TGCACTCTCA TTCATTTCAA CATCATACAA GGCTATGACA TGGTGAACCA TAAAATTTTT 1440 

TAACTCTTCC CTGACGAAAA CATCGTAGAT TCGAGGATTA GAGTAGCTTC TAACAGTAAA 1500 

TCCCGTCTCT TCCATAACTT CTCTAGTCAG CGTTTCCGTC AGTCCTTCAC CAAGTTGCTG 1560 

ACTGCCTCCA GGTAGATCAT ACCGATGTTG ATAAGGGCCT CTCGTTTTTT CAATGCAAAG 1620 

TAACTTTCCA TTTTCAAAGC AAACACAGTA GACCCCAAAG TCATTTTTGA TTTCCATCCA 1680 

ACTCCTCCTA CTTCAAAGAC CAGCCACCAT CTATTGTCAA GATTTGTCCT TGCATGGCGC 1740 

TCGCTTTTCC ACTTGCTAAA AAAAGACTAA GCTCTGCTAT TTCCTCTGGC TCAATCCAGC 1800 

GCTTGATTGG GGTTTCACTA GCCACCCAGT CACCCAAACC ACCTGGTTCA AAATCCGCAG I860 

CGGTCATAGC TGTCTTGACT GCTCCTGGAG CGATACCAAA GACCTGAATC CCAGCTTCAG 1920 

CATAGTCTAG AGCCAACTGC TTGGTGAAGC CAGCCAAGGC ATGCTTGGAT GAAGTATAGG 1980 

CGTGACCACC TCCACCTGCT AGGCTAGAAG CAATGGAACA CATATTGATG ATGATTCCCT 2040 

TTTTATTTTC CAGCATTTGT GTCAAATAAT ACCGAGTCAA CTCTACTGGA ATAATGTAGT 2100 

TGATTTCAAA AATCTCTTGA ATGTCCTGCG CCGTTTGTTC CAACAGTGGT TTGTAATCAT 2160 

CCAAAACTCC AGCAGTATTA CACAAAACAT CCACCTGAGG GCACCAGTCA AAAATAGGTT 2220 

CCAAGTCCAA GGTCAAATCT CTCTGTAAAA AGCGAAAATC ACCCTCTAAG AGTGCCTTTT 2280 

CACCTTGGTC AACTCCATAA ACTTGATAGC CCTTCTCTAA AAAGAGGCGA GCTTGAGCCA 2340 

ATCCGATCCC TGAACTCACT CCTGTAATGA GTACACGTTT AGTCATGCAC TTCTACCCAA 2400 

TCCGTTGCCA AAACATCACA AACTGTCGGG CTCCACATGG AAAAACCTTC TCCTTCGCCA 2460 

GAAACGTTGA TTAGGAAATA AGGTGTCATT TCAAGTGCAA GCCCATTTTG CTCGATGGTA 2520 

TCAAAGAGTT GGACATAGTT TTCCGCACCT CCCCAACCAG TTCGTACATA TTTTCTCTTA 2580 

GCCTTTAACC CAGGCAGGAT CTCTTCAAAT GTCATGTTTT TCTCCTTTAA TTCTACATTC 2640 
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TTCATTTAAT TATAGCAAAA AACCGCTTTA TACGGCTTTT TGAATGTGAG TTATTCAAAC 2700 

CTGCTACTAC TTACGGCAAA TTATTCCCTG CAGCAAGATA AATTTCATAC CATTCTTTTC 2760 

TTGTTAAGCT AAAGTTTGCC GCTCGGCTAA CTTCTCTCAA GTGCTTAGGA TTTGTTGTAC 2820 

CTACGACTGC CTGCATTTTT GCTGGATAAC GCAATATCCA AGAAATGGCA ATAGTTGAAG 2880 

AGGTTACTCC ATATTTAATA GCTAAACGAT CAAGTACTTG ATTTAAAGCT TGAAATTTCT 2940 

CATTTCCAAC AAAATTCCCT TTAAAATACC CGAATTGTAA GACAGACCAT GCTTGAATGA 3000 

CCACATCGTG TAATTGGCAA TATTCAAAAA TGCTGCCATC TCGCATAGCT GCTTGACTAT 3060 

CTTCCATATT AACATGAAAA GCTCATTCAA ATCCTGGAGT AAAAGCCGCA CTCAATTGTA 3120 

CCTGATTAAC AGCTAACGGC TGCTTGACAT CTTTTTTAAG CAACTCCATC ATCATAGGAT 3180 

TTTGATTAGA AACTCCAAAA TCTCGAACTT TACCTTGTTT ATAAAGGAGA TTAAAGGCTT 3240 

CTGCTACTTG GTCAGATTCC ATCAAAGCAT CTGGTCCATG AAGGAGCAAG CTATCTAGAT 3300 

GATCAATCTT CAATCTTTGC AAAATACCGT CTACTGATTT TATAATATAG TCCTTAGAAA 3 360 

AATCAAAATA GGTAAATTCT TCAATGCGAA TGCCACATTT GGACTGAATC CACATCTTTT 3420 

CTCTTAAATC TGGACGATTT TTTAGGACAA GACCTAACAG TTCTTCACAA CGACCACGAC 3480 

CATAAATATC AGCCAAGTCG AAGGCATTGA TTCCAACAGA AAGTGCTGTT TCTACAAGCT 3540 

CTTCAACTTC TTTTACAGAT TTATCTTTTA TTCTCATCAT TCCGAGAACA ATTTCTGATA 3600 

ATTCTTTGTC ATCTTGACCA AGAGTTATGT ATCTCATCAA ATTTTTCTCC TTTAATTTCT 3660 

AACATTCTTC CCTTCATTAT AACAAAAAAC CGCTTTGCAA CGACTTTTTG ACTATACTTC 3720 

ACTCCATTTT ATCTTCTTAA ACCCACGGAA CAAGACAAAG ATTCCAATAA AGAGGACAGC 3780 

TAAAGGAATA ACTTTTGTAA GGAAAACATT TGAAATTCCC ATCCACTCAT AATAACGGAG 3840 

CAGAGAACCC ACCACAAGAT GGGCAATAAT CATACTGACA AATGGACGAA AGACCCCTTC 3900 

TTTCCAATTC CAAATACCGA TAACTAGCGA AATCGTAAAG ACAGACAAAC TATCCCAGGG 3960 

AGCCGGAATA TAAAAGGCTC CTTCTTGTAT GAAGCTTGCC ATTCCTACAT ATCCTAAAAC 4020 

AACTAGAAGA ACTATAGTCC CAACAACAAT GTAAGTGCCA ATTTTCATTT TAGGAGAATC 4080 

TTGGACTAAA CTTCTTCGTA AAATTGTGGC CACAAGTCCA AATCCAATCA GAAAAATAAG 4140 

AAGTTGCCCT AAAAATGTGA GCAAATTGAC TGTTAAGAGA GGACCTTTAG AAAAATCACT 4200 

TAGTAGTTGA TAATAACGTA ATACCGCCAG GACAAGAATT GGCGTCAAAA GGGACTCTTT 4260 

GATAGAACTG CGAGGTGCTC CCTTGAGAAT CTCTTTCATT ATTTTTTTAG GATTCTTACC 4 320 

TAGATAATCC TCTGCACTCA TGCCATCTCG TTCTGCTTCT GAGAAATCTA GCATCATCAA 43 BO 
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ATAGATCTGC TCTCTGAGAT AGTCTTCATC ATAGAGAAAT CCAGCAAGAT TAAAACTTTC 4440 

CCACAACTCC TCAAAATACT TTTGATTCTC CTCAGAAAAC TCATGTAGCA AAGCGCTTCT 4S00 

TTCTTCGTAA TACTTCATTT TCTTCATGGT TTAACCCCCA TTCTTAATCC CTTCTACTTT 4560 

TTGACTCAAA TCGTCCCATT GTTGCCAAAA GACTGAGACA CGCTCTTCTC CTTCTTTCAT 4620 

TAATGAAAAA TACTTCCGAT CTGGACCATC TGGCGACGGG CGCATGTCGC CTCTTATCCA 4680 

TTGATTTTTT TCTAACTTTT GCAACAAAGG ATAAATAGTT CCTGGAACGA TAGTATCAAA 4740 

TCCAGCCTCT CGCAAAGTCT GAACCAACTC ATAACCATAC CGCTCTTTTT GACCAATCAT 4800 

ATCCAAGACA CAACCTTCAA GAACACCTTT TAATAGCTGA GTTTCTTTCA TCACTTCTCC 4860 

CTTCTAATCT ATTTTGTAAT ACCTACTAGT GACTTCACCT ATAGTATATC ACTTCTACAC 4920 

TAGTTTGTAA AGCATAATAG TTAATACTCT TCGAAAATCT CTTCAAACCA CGTCAGCGTC 4980 

GCCCTACCGT ATGTATGGTT ACTGACTTCG TCAGTTTCAT CTACAACCTC AAAAACATGT 5040 

TTTGAGCTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAC AGTGTTTTGA GCTCACTTCG 5100 

TCAGTTTCAT CTACAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT TTCATCTACA 5160 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTTCGT CTACAACCTC AAAACAGTGT 5220 

TTTGAGCAAC CTGCGGCTAG CTTCCTAGTT TGCTCTTTGA TTTTCATTCA GTATAAATAA 5280 

AAAAACAGAA CTAGCCTGAA CTAGTCCTCT CTACTTTTAC CCAATCACAC TTCCATTTGG 5340 

TACAGCTGGA TCAACTGTGA GAAGGGTTAA TTTGCCATCA TGTTCAGCTG AGAGAATCAT 5400 

ACCCTGGCTG ACATATTTTT TCATCATTTT ACGTGGTTTG AGGTTAGCAA CGATTTGAAC 54 60 

TTTCTTGCCG ACCAATTCTT GTTCATTTGG ATAGTATTTT GCAATTCCTG AAAGAATCTG 5520 

ACGATCTTCT CCATCACCAG CATCCAACCG GAATTGAAGC AACTTATCTG AACCTTCTAC 5580 

TTTAGACACT TCTTTGACTT CTGCGACACG GATTTCAACC TTGTCAAAGT CTTCAAACTT 564 0 

GATTTCATCC TTGTTTAGTT TGAGCTCAAC TTCGTCCGGA TTCCATTCTT TTTCGACTGC 5700 

TGGTTTATTG CCTTCCATTT GTTCCTTGAT ATAGGCGATT TCTTCTTCCA TATTTAGACG 5760 

TGGAAAGATA GGTGTTCCTT TGGCAACTAC AGTCACATCT GCTGGGAAGT CAGCCAAACT 5820 

CAAGTTTTCA AGACTAGAAA CTTCTTCCAA ACCAAGTTGA GTCAAAACTG CACGACTACT 5880 

TTCCATCATA AATGGTTCAA TCAAGTGAGC AACTACACGA ATGCTGGCTG CCAAGTGGCT 5940 

CATGACACTT GCCAATTGGT CACGAAGAGC TTCATCCTTG GCCAAGACCC ATGGTGCGGT 6000 

CTCATCGATG TATTTATTGG TACGAGAGAT CAGAGTCCAG ACTGCTTCAA GCGCACGTGG 6060 

ATAGTCAACT GCTTCCATGT GTGTATGGAA GTCTGCGATT GATTGTwCTG CAACCTCAGC 6120 

AAGAACATGA TCATATTCAG TCACACCTTC TACATAGCCA GGGATTTGTC CATCAAAGTA 61 BO 
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CTTATTAATC ATGGAAACCG TACGGTTAAG GAGCTTCCCA AGGTCATTAG CCAATTCATA 6240 

GTTGATACGG CCGACATAGT CTTCAGGAGT AAAGGTTCCC TCTGAACCAA CTGGAAGGTT 6300 

ACGCATGAGG TAGTAACGAA GTGGATCTAG TCCATAACGC TCTACCAACA TTTCAGGGTA 6360 

AACGACATTC CCTTTTGACT TAGACATTTT TCCGTCTTTC ATGACAAACC AACCATGGGC 6420 

AATCAAACGA TCAGGTAATT TAACATCCAA CATCATAAGA AGGATTGGCC AGTAGATAGA 6480 

GTGGAAGCGA AGGATATCTT TTCCTACCAT ATGGAAGACT GTTCCATTCC AGAACTTGTC 6540 

AAAGTTACCA TGTTCGTCTT GAGCGTAGCC AAGAGCTGTC GCATAGTTAA GAAGGGCATC 6600 

AATCCAAACG TAGACAACGT GTTTTGGATT TGATGGGACA GGCACTCCCC ATGTAAAGGT 6660 

TGTACGAGAT ACCGCCAAAT CTTCCAAGCC TGGCTCGATG AAGTTGCGTA GCATTTCATT 6720 

AAGGCGACCA TCTGGCGTGA TAAATTCAGG ATGAGCTTTG AAAAATTCGA CCAAACGGTC 6780 

TTGGTATTTG CTAAGGCGAA GGAAGTATGA TTCTTCAGAA AcCCATTCAA CCTCATGACC 6840 

TGATGGAGCA ATACCACCAG TCACATTTCC AGCTTCATCA CGGAAAACTT CTGCCAGCTG 6900 

CCTTTCTGTA AAGAATTCTT CGTCTGATAC TGAATACCAA CCAGAGTATT CACCCAAGTA 6960 

GATATCATCT TGAGCAAGTA AGCGTTCAAA GACTTGTGCG ACAACTTTTT CATGGTAGTC 7020 

ATCAGTTGTA CGGATAAATT TATCGTATGA GATATCTAGT AATTGCCAGA GTTCTTTAAC 7080 

TCCAACCGCC ATTCCATCAA CATAGGCTTG AGGTGTAATA CCAGCTTCTT CCGCTTTCTG 7140 

CTGGATTTTC TGACCATGTT CATCAAGACC TGTCAGATAA AATACATCGT AGCCCATCAG 7200 

GCGTTTGTAA CGTGCTAGGA CATCACATGC GATAGTTGTG TAGGCACAAC CGATATGAAG 7260 

TTTCCCAGAT GGATAGTAAA TCGGCGTTGT AATATAAAAA TTTTTTTCAG ACATAATTTT 7320 

TCCTTTCCAG GCAAATGAAA CCTGTTTTTC TAACACTTCA TTATATCACA TTTTTAATGA 7380 

ATTTCAATAG GGAAATCCAT ACAAAAACAA GATAGACGAG TGTCCATCTT GTTGATCTCA 7440 

TTCATAACGA AGGGCTTCAA TTGGATCAAG TTTCGATGCC TTGTTGCCTG GCAAGACTCC 7500 

AAAAATCATA CCAACACTAG CCGAAACTGC AAGACTAAAT AGGGCGACTG GGATTGATAC 7560 

TCCAACTTCT ATACCTTCTA TTAAACCTTG CAGTAACAAA CCTGCTAAGg CAGTTAAACC 7620 

ACTTGCAATT GTCAAGCCAA TTAAGCCACC TAACAAGGTC AAAATCATGG ATTCAATCAA 7 680 

AAACTGAATT AAAATATTGG CACGTGTTGC ACCCAAAGCC TTACGAAGAC CAATCTCACG 7740 

AGTGCGCTCT GTCACCGAAA CCAGCATGAT GTTCATGACA CCAGTTCCTC CAACAAAGAG 7800 

AGAAATCCCT GCGATGGAAC TAATAATCGT CGTCATAAAA CTAAACGATT GTTGAATTTC 7860 

TGCAAATACA ACGGACTCAT CTGCCACCTG GTATTCTCCC TGTTGTAAGC CTGCAAGCTC 7920 
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TGTCATTTTT 


CGTGCCAGTT 


CTGGACCCAG 


AGTTGGGGTT 


AAACTGGTAT 


CATTCACTCG 


7980 


AAAGACAATA 


TTAGCTATTT 


CATCTACATT 


AAAATTCGCA 


GCAAGGGAGA 


TATTGGTAGT 


8040 


AATAGGCAAG 


CCACCAAACC 


CATATATTTT 


TGATCTTTTA 


GCCTCCGGAC 


TAGTATAAAC 


8100 


CCCAATGACC 


CGGTAACTAA 


ATCCATTGAC 


TTCTACAACC 


TTGTTAATAG 


CCTCTTGAGG 


8160 


AGATTCAAAT 


AAACTAATGG 


ACAATTCCTC 


ATCTAGCAAA 


ATGACACTTG 


CAAACTCTTT 


8220 


GAAATCTTGC 


TCTCTCAGAC 


TACGACCTGC 


AATAATTTCA 


TTCTTAACAG 


CGTCCATGTA 


8280 


AGTTCTGTTT 


CCACCTGTCA 


AATTAGCATT 


CTCAACCTTT 


TTATCTTGAT 


AGGTCAAGAT 


8340 


GGCATTCGTT 


GAATTGGTTA 


CATAGTAACT 


ATCCACTCCC 


TTCAGTTTAG 


CTGCCTCTTG 


8400 


GACCCAGGAT 


TCTTGCGGTT 


TTGGCGGTTC 


AACAGGAACT 


TCCTCTTCCT 


TTCCAGAAAC 


8460 


CGTAAAAGCT 


GATTGTTTCT 


GAGTAAAAGA 


CCCGTCTTTA 


CTTTTTTTAG 


GAGAGAAAAA 


8520 


GACGCTAATA 


TTTTTCTGAG 


ATTTAGTCAT 


ATCTTTATTG 


ACTTGACGAG 


ATAGCGAATC 


8S80 


ACCCAAAGCC 


ATAATCACAA 


CAACTGATGA 


AACACCGATA 


ATAATCCCAA 


TCATAGTAAG 


8640 


CAAAGAACGC 


ATCTTGTGAG 


CCATGATAGA 


TGAAAAGGCA 


AATTTCAGAT 


TCTCCATCTT 


8700 


AGTTTTCCTC 


CTTTCCTAAC 


TGAGCACTGT 


CAGACGAAAT 


GACCCCATCC 


CGAATGACAA 


8760 


TCTGACGTTT 


GGCATAGGCA 


GCAATCTCAC 


GCTCATGCGT 


TACCATGATA 


ATGGTTTTTC 


8820 


CTTCTTTATT 


CAAATCAACC 


AATAATTGCA 


TAATTTGGTT 


ACCTGTTTTG 


GTATCCAAGG 


8860 


CTCCTGTCGG 


TTCATCCGCT 


AGGATAATAG 


AAGGATTGTT 


TACCAAGGCA 


CGCGCAATGG 


8940 


CTACACGTTG 


CTTTTGACCA 


CCAGATAATT 


CTGAAGGTAA 


ATGGTGACTA 


CGTTCTGTCA 


9000 


ATTCAACCTT 


GTCTAAATAT 


TCCTCAGCCA 


ACTTGCGACG 


TTTTGAAGAC 


GAAACTCCTG 


9060 


CGTAAATCAA 


GGGCAATTCT 


ACATTTTGCA 


GAGCATTGAG 


CTTCGATAGA 


AGAAAGAACT 


9120 


GCTGAAAGAC 


AAAACCGATT 


TGTTGGTTAC 


GGACCTTAGC 


TAGTTGTTTT 


TCACCAAGCC 


9180 


CAGCCACTTC 


TTGACCTTCA 


AGATAATATT 


CTCCACTGGT 


TGGTGTATCC 


AACATGCCAA 


9240 






TTACCAGACC 


CAGATGGTCC 


CATGATGGCT 


ACAAATTCAC 


9300 


CCTCATTCAC 


TTCTAGATTG 


ATATTTTTGA 


GAACCTGCAG 


TTCTTGGTCA 


CCATTACGGT 


9360 


AACTTCTGAA 


GATATTTTTT 


AGACTAATTA 


GTTGCTTCAT 


CAGCCTTCAC 


CTCTTTTCCT 


9420 


TCTTCCAAGG 


AAGATGTTGG 


ATTACTGATG 


ACCTTAGCAC 


CGTTCGTTAA 


ACCAGAAGTG 


9480 


ATTTCTTGAT 


TTTCTGCGTC 


AGCATTTCCC 


AATGAAACCT 


CAACTTTTTT 


AGCCTTTTGT 


9S40 


TGTTCATCCA 


CAATCCAGAC 


ATAATTTTTA CTATCATCCA TTACTAGACT 


GCTAACAGGA 


9600 


ACAAGAATAG 


CCTTAGTTTT 


GCTTTTAACC 


TCAATG1TGA 


CAGAAAAACC 


TTGTTTCAAA 


9660 


TCACCAACCT 


CGCCTGTCAC 


ATCAATAGTA 


TAAGGGTATT 


TAGAACCTGT 


ATTATTCCCG 


9720 
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GCTGCTGGAC TAGCTGCTTC ACCATTGTTT TTAGGATAGT CACAAATATA GCTTAATTTC 9780 

CCAGTCCATT TTTTATCAGG ATACACTTTA GAAGTAAAGC TTACTTCTTG ACCTACAGAA 9840 

AGGTTGGCTA GATTGTACTC AGACAATTCT CCCTTGACTT GTAAATTTTC ATTGCTGACA 9900 

ATATGAACCA TAACTTGACT CGCCCCTCTT GGAGATTTAG AAACATTGCT ATTGACTTCG 9960 

ACCACAGTTC CCTCTAGGGT ACTGAGAACA CTTCTTGCAT CCAATTGACT TTGAGCCTTG 10020 

CTTAATTGCG CCGCAGCATC TGCACGCGCA TCACGGGCAT CACCCAATTG ACCGTCAATA 10080 

GAAGCAACAG AATTTCCAGC CACTGGAGTT GGGCTTTGCA CCGTTGCATC TTCTCCTCCT 10140 

ACTGGCGCTG GTAACTGTGG AGCCGGAGCT GAAGCGGCTT CATTTCGTGC TTGATTGAGT 10200 

TCATTGATAT GACGATCTGC CCTAGCTACT GCTCGACTAG CTGAATCATA GGCCGCCTGC 10260 

GCTTCTGAAC TACTGTACTT GACTAAAGCC TGCCCTTCGC TGACCTTATC GCCCACAGAA 10320 

ACAAGGATTT CATCTAAATC ACCCTTACTA GCATCAAAAT AAACATATTG TTCATTTTTT 10380 

GCTGTTACTG TCCCTGACAA TAAAACAGAG GAGGCCACGC TTCCTTCCTT GGCAACAACA 10440 

AGATGAGTAG GCTCATCTTT TAGAGCAGTC TGAGAAGGTT GTCTAAAGAG TAAAATCCCC 10500 

CCAGCACCCA ATACAACTAC ACTCGCAGCA CCGATTGCTG CATACAGTTG CCACTTTTTA 10560 

GCTTTACCAT TCTTTTTCTT CATAATGAAA CTCCTTTTCT TTTTTACAAT ACTTTGCTAT 10620 

TATACCAAAT TTCCCTCCAG CAAACAATAC AGTTCAGGAT TAAACAATCG TTCGGAATTT 10680 

TGCTTTTCGG 10690 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8195 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GAGAAAGCGC CCACGTTTCC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCCA 60 

GGGTCGGAAC AGGAGAGCGC AACGAGGGAG CTTCCCAGGG GGAAACGCCT GGTATCTTTA 120 

TACTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 180 

GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 240 

CTGGCCTTTT GCTCACATGT TCTTTCCTGC CTTATCCCCT GATTCTGTGG ATAACCGTAT 300 

TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGACTC 360 
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AGTGAGCGAG GAAGCGCAAG AGCGCCCAAT ACGCAAACCG CCTCTCCCCG CGCGTTGGCC 420 

GATTCATTAA TGCAGCTGGC ACGACAGGTT TCCCGACTGG AAAGCGGGCA GTGAGCGCAA 480 

CGCAATTAAT GTGAGTTAGC TCACTCATTA GGCACCCCAG GCTTTACACT TTATGCTTCC 540 

GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT CACACAGGAA ACAGCTATGA 500 

CaTGATTACG AATTCGAGCT CGGTACCCGG AAAATCCAGA AAATGCTTGA AAAAAATCCT 660 

AGAAGATGGT ATAATACTAA ATTGTAAGGG TTATCACATA TAACTCAAAA AAAGAAAGAA 720 

CAAAAGGAGA GTCAAACTAT GGCTTCTAAA GATTTCCACG TAGTGGCAGA AACAGGTATT 780 

CACGCACGTC CAGCAACATT GTTGGTACAA ACTGCTAGCA AATTTGCTTC ACATATCACT 840 
CTTGAGTACA AAGGTAAATC AGTTAACCTT AAATCAATTA TGGGTGTTAT GAGTCTTGGT . 900 

GTTGGCCAAG GTGCTGACGT AACTATCTCA GCTGAAGGTG CAGATGCAGA TGACGCTATC 960 

GCTGCAATCT CAGAAACAAT GGAAAAAGAA GGATTGGCAT AAGGGAAATG ACAGAAATGC 1020 

TTAAAGGAAT CGCAGCATCT GACGGTGTTG CAGTTGCAAA AGCATATCTA CTCGTTCAGC 1080 

CGGATTTGTC ATTTGAGACT ATTACAGTCG AAGATACAAA CGCAGAAGAA GCTCGCCTTG 1140 

ATGCCGCTCT ACAGGCATCA CAAGACGAGC TTTCTGTTAT TCGCGAGAAA GCAGTAGGTA 1200 

CGCTCGGTGA AGAAGCAGCT CAAGTTTTTG ATGCTCACTT AATGGTTCTT GCTGACCCAG 1260 

AAATGATCAG CCAAATCAAG GAAACTATCC GTGCGAAGAA AGTGAATGCA GAAGCAGGTC 1320 

TGAAAGAAGT TACAGATATG TTTATCACTA TCTTTGAAGG CATGGAAGAC AACCCATACA 1380 

TGCAAGAACG CGCAGcGGAT wTCCGCGACG TGACAAAACG TGTATTGGCA AACCTTCTTG 1440 

GTAAAAAATT GCCAAACCCA GCTTCTATCA ATGAAGAAGT GATTGTGATT GCGCATGACT 1500 

TGACTCCTTC AGATACAGCT CAATTGGACA AAAACTTTGT AAAAGCTTTT GTAACCAACA 1560 

TTGGTGGACG TACAAGCCAC TCAGCTATCA TGGCACGTAC ACTTGAAATT GCTGCTGTAT 1620 

TAGGTACAAA TAACATCACT GAAATCGTTA AAGACGGTGA CATCCTTGCT GTTAACGGGA 1680 

TCACTGGAGA AGTGATTATC AACCCAACAG ATGAACAAGC GGCAGAATTT AAAGCAGCTG 1740 

GTGAAGCCTA TGCGAAACAA AAAGCTGAAT GGGCACTTTT CAAAGATGCT CAAACAGTGA 1800 

CTGCTGACGG TAAACACTTC GAGTTGGCTG CTAATATCGG TACTCCAAAA GACGTTGAAG 1860 

GTGTTAACAA CAACGGTGCA GAAGCTGTTG GACTTTACCG TACAGAGTTC TTGTACATGG 1920 

ATTCTCAAGA CTTCCCAACT GAAGATGAGC AGTATGAAGC ATACAACGCT GTTCTTGAAG 1980 

GAATGAACGG TAAACCTGTT GTCGTTCGTA CAATGGATAT CGGTGGAGAT AAGGAACTTC 2040 

CTTACTTCGA TATGCCTCAC GAAATGAACC CATTCCTTGG ATTCCCTGCT CTTCGTATCT 2100 

CTATCTCTGA GACTGGAGAT GCTATGTTCC GCACACAAAT CCGTGCTCTT CTTCGTGCGT 2160 
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CTGTTCACGG TCAATTGCGT ATCATGTTCC CAATGGTTGC GCTCTTGAAA GAATTCCGTG 2220 

CAGCGAAAGC AGTCTTTGAT GAAGAAAAAG CAAACCTTCT TGCTGAAGGT GTTGCAGTTG 2280 

CGGATAACAT CCAAGTTGGT ATCATGATCG AGATTCCTGC AGCGGCTATG CTTGCAGACC 2340 

AATTTGCTAA AGAAGTTGAC TTCTTCTCAA TTGGTACAAA CGACTTGATC CAATATACAA 2400 

TGGCAGCAGA CCGTATGAAC GAACAAGTTT CATACCTTTA CCAACCATAC AACCCATCAA 2460 

TCCTACGCTT GATTAACAAT GTGATCAAAG CAGCTCACGC TGAAGGTAAA TGGGCTGGTA 2520 

TGTGTGGTGA GATGGCTGGT GACCAACAAG CTGTTCCACT TCTTGTCCGA ATGGGCTTGG 2580 

ATGAGTTCTC TATGTCAGCA ACATCTGTAC TTCGTACACG CAGCTTGATG AAGAAACTCG 2640 

ACACAGCTAA GATGGAAGAG TACGCAAACC GTGCCCTTAC AGAATGCTCA ACAATGGAAG 2700 

AAGTTCTTGA ACTTCAAAAA GAATACGTTA ATTTTGATTA ATCGAAAAGT CCCTGCAACT 2760 

CAGTTACAGG GATTTTTTTG ATATTTTAAA AAGAATTTTC AAGAAAATCT TTCTTATAGA 2820 

AAGTCCAACC TTGAAAAAGT AGTGGTCAGA ACAAAAAATA CTTAAATGGT TCATAAAATT 2880 

CTTGACAAGT TGGATATTTA GGAGTAAACT ATTAACCAGT TAAGTAATAG AGAGGAGTTT 2940 

CTGCAATTTA GAAATGAATT GCAACTAGAA ATATCAAATA GAAACAGAGT TTCGATGAAA 3000 

ATTAATAAGA AATACCTTGT TGGTTCTGCG GCACTTTGAT TTTAAGTGTT TGTTCTTACG 3060 

AGTTGGGACT GTATCAAGCT AGAACGGTTA AGGAAAATAA TCGTGTTTCC TATATAGATG 3120 

GAAAACAAGC GACGCAAAAA ACGGAGAATT TGACTCCTGA TGAGGTTAGC AAGCGTGAAG 3180 

GAATCAATGC TGAGCAAATC GTCATCAAGA TAACAGACCA AGGCTATGTC ACTTCACATG 3240 

GCGACCACTA TCATTATTAC AATGGTAAGG TTCCTTATGA CGCTATCATC AGTGAAGAAT 3300 

TACTCATGAA AGATCCAAAC TATAAGCTAA AAGATGAGGA TATTGTTAAT GAGGTCAAGG 3360 

GTGGATATGT TATCAAGGTA GATGGAAAAT ACTATGTTTA CCTTAAGGAT GCTGCCCACG 3420 

CGGATAACCT CCGTACAAAA GAGGAAATCA ATCGACAAAA ACAAGAGCAT AGTCAACATC 3480 

GTGAAGGTGG AACTCCAAGA AACGATGGTG CTGTTGCCTT GGCACGTTCG CAAGGACGCT 3540 

ATACTACAGA TGATGGTTAT ATCTTTAATG CTTCTGATAT CATAGAGGAT ACTGGTGATG 3600 

CTTATATCGT TCCTCATGGA GATCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA 3660 

GCGAGTTGGC TGCTGCACAA GCCTTCCTAT CTGGTCGAGG AAATCTGTCA AATTCAAGAA 3720 

CCTATCGCCG ACAAAATAGC GATAACACTT CAAGAACAAA CTGGGTACCT TCTGTAAGCA 3780 

ATCCAGGAAC TACAAATACT AACACAAGCA ACAACAGCAA CACTAACAGT CAAGCAAGTC 3840 

AAAGTAATGA CATTGATAGT CTCTTGAAAC AGCTCTACAA ACTGCCTTTG AGTCAACGAC 3900 
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ATGTAGAATC TGATGGCCTT GTCTTTGATC CAGCACAAAT CACAAGTCGA ACAGCTAGAC 3960 

GTGTTGCAGT GCCACACGGA GATCATTACC ACTTCATCCC TTACTCTCAA ATGTCTGAAT 4020 

TGGAAGAACG AATCGCTCGT ATTATTCCCC TTCGTTATCG TTCAAACCAT TGGGTACCAG 4080 

ATTCAAGGCC AGAACAACCA AGTCCACAAC CGACTCCGGA ACCTAGTCCA GGCCCGCAAC 4140 

CTGCACCAAA TCTTAAAATA GACTCAAATT CTTCTTTGGT TAGTCAGCTG GTACGAAAAG 4200 

TTGGGGAAGG ATATGTATTC GAAGAAAAGG GCATCTCTCG TTATGTCTTT GCGAAAGATT 4260 

TACCATCTGA AACTGTTAAA AATCTTGAAA GCAAGTTATC AAAACAAGAG AGTGTTTCAC 4320 

ACACTTTAAC TGCTAAAAAA CAAAATGTTG CTCCTCGTGA CCAAGAATTT TATGATAAAG 4380 

CATATAATCT GTTAACTGAG GCTCATAAAG CCTTGTTTGA AAATAAGGGT CGTAATTCTG 4440 

ATTTCCAAGC CTTAGACAAA TTATTAGAAC GCTTGAATGA TGAATCGACT AATAAAGAAA 4500 

AATTGGTAGA TGATTTATTG GCATTCCTAG CACCAATTAC CCATCCAGAG CGACTTGGCA 4560 

AACCAAATTC TCAAATTGAG TATACTGAAG ACGAAGTTCG TATTGCTCAA TTAGCTGATA 4620 

AGTATACAAC GTCAGATGGT TACATTTTTG ATGAACATGA TATAATCAGT GATGAAGGAG 4 680 

ATGCATATGT AACGCCTCAT ATGGGCCATA GTCACTGGAT TGGAAAAGAT AGCCTTTCTG 4740 

ATAAGGAAAA AGTTGCAGCT CAAGCCTATA CTAAAGAAAA AGGTATCCTA CCTCCATCTC 4800 

CAGACGCAGA TGTTAAACCA AATCCAACTG GAGATAGTGC AGCAGCTATT TACAATCGTG 4860 

TGAAAGGGGA AAAACGAATT CCACTCGTTC GACTTCCATA TATGGTTGAG CATACAGTTG 4920 

AGGTTAAAAA CGGTAATTTG ATTATTCCTC ATAAGGATCA TTACCATAAT ATTAAATTTG 4980 

CTTGGTTTGA TGATCACACA TACAAAGCTC CAAATGGCTA TACCTTGGAA GATTTGTTTG 5040 

CGACGATTAA GTACTACGTA GAACACCCTG ACGAACGTCC ACATTCTAAT GATGGATGGG 5100 

GCAATGCCAG TGAGCATGTG TTAGGCAAGA AAGACCACAG TGAAGATCCA AATAAGAACT 5160 

TCAAAGCGGA TGAAGAGCCA GTAGAGGAAA CACCTGCTGA GCCAGAAGTC CCTCAAGTAG 5220 

AGACTGAAAA AGTAGAAGCC CAACTCAAAG AAGCAGAAGT TTTGCTTGCG AAAGTAACGG 5280 

ATTCTAGTCT GAAAGCCAAT GCAACAGAAA CTCTAGCTGG TTTACGAAAT AATTTGACTC 5340 

TTCAAATTAT GGATAACAAT AGTATCATGG CAGAAGCAGA AAAATTACTT GCGTTGTTAA 5400 

AAGGAAGTAA TCCTTCATCT GTAAGTAAGG AAAAAATAAA CTAATGAAAA ATGAAAGTCT 5460 

CGATAAAGAG GCTTTCATTT TTATTATGTA TATATCTAAA ATTCTTGACA AGCAATATTA 5520 

AAAAGAGTAA ACTATTAACT AGTTAATTAA CCGGTTTATT ACTTTATAGT GAATCAAATA 5580 

TACTTAAGAA AAGAGGAAAG AATGAAAATT AATAAAAAAT ATCTAGCAGG TTCAGTGGCA 5640 

GTCCTTGCCC TAAGTGTTTG TTCCTATGAA CTTGGTCGTC ACCAAGCTGG TCAGGTTAAG 5700 



WO 98/18931 



PCT/US97/19588 



AAAGAGTCTA 
TTGACACCAG 
ATTACCGATC 
GTCCCTTATG 
AAGGATTCAG 
TACTATGTTT 
AAACGTCAGA 
GTTGCAGCCA 
GATATCATTG 
ATTCCTAAGA 
AAGCAGGGAT 
TTGTCAGAGA 
ATTTCAAGCC 
GATGGCCTTA 
CCTCATGGTA 
ATTGCTCGTA 
GAACAACCAA 
CCTCAACCAG 
GTAGGCGATG 
CTTTCAGCAG 
CATAAGCTAG 
GCTTATGACT 
GATTTTGAGG 
AAGTTAGTGG 
AAACCAAATG 
AAGTACACAA 
GATGCCTATG 
GAAGCTGAGA 
ACAGACCATC 



ATCGAGTTkC TTATATAGAT 
ATGAAGTCAG TAAGAGGGAG 
AAGGTTATGT GACCTCTCAT 
ATGCCATCAT CAGTGAAGAG 
ACATTGTCAA TGAAATCAAG 
ACCTTAAGGA TGCAGCTCAT 
AGCAGGAACA CAGTCATAAT 
GAGCCCAAGG ACGCTATACA 
AGGACACGGG TGATGCTTAT 
ATGAGTTATC AGCTAGCGAG 
CTCGTCCTTC TTCAAGTTCT 
ACCACAATCT CACTGTCACT 
TTTTACGTGA ATTGTATGCT 
TTTTCGACCC AGCGCAAATC 
ACCATTACCA CTTTATCCCT 
TTATTCCCCT TCGTTATCGT 
GTCCACAATC GACTCCGGAA 
CTCCAAGCAA TCCAATTGAT 
GTTATGTCTT TGAGGAGAAT 
AAACAGCAGC AGGCATTGAT 
GAGCTAAGAA AACTGACCTC 
TACTAGCAAG AATTCACCAA 
CTTTGGATAA CCTGTTGGAA 
ATGATATTCT TGCCTTCTTA 
CGCAAATTAC CTACACTGAT 
CAGAAGACGG TTATATCTTT 
TAACTCCACA TATGACCCAT 
GAGCGGCAGC CCAGGCTTAT 
AGGATTCAGG AAATACTGAG 



731 

GGTGATCAGG CTGGTCAAAA 
GGGATCAACG CCGAACAAAT 
GGAGACCATT ATCATTACTA 
CTCCTCATGA AAGATCCGAA 
GGTGGTTATG TTATCAAGGT 
GCGGATAATA TTCGGACAAA 
CACGGGGGTG GTTCTAACGA 
ACGGATGATG GTTATATCTT 
ATCGTTCCTC ACGGCGACCA 
TTAGCTGCTG CAGAAGCCTA 
AGTTATAATG CAAATCCAGC 
CCAACTTATC ATCAAAATCA 
AAACCCTTAT CAGAACGCCA 
ACAAGTCGAA CCGCCAGAGG 
TATGAACAAA TGTCTGAATT 
TCAAACCATT GGGTACCAGA 
CCTAGTCCAA GTCCGCAACC 
GAGAAATTGG TCAAAGAAGC 
GGAGTTTCTC GTTATATCCC 
ACCAAACTGG CCAAGCAGGA 
CCATCTAGTG ATCGAGAATT 
GATTTACTTG ATAATAAAGG 
CGACTCAAGG ATGTCyCAAG 
GCTCCGATTC GTCATCCAGA 
GATGAGATTC AAGTAGCCAA 
GATCCTCGTG ATATAACCAG 
AGCCACTGGA TTAAAAAAGA 
GCTAAAGAGA AAGGTTTGAC 
GCAAAAGGAG CAGAAGCTAT 



GGCAGAAAAC 5760 

CGTCATCAAG 5820 

TAATGGCAAG 5880 

TTATCAGTTG 5940 

AGATGGAAAA 6000 

AGAAGAGATT 6060 

TCAAGCAGTA 6120 

CAATGCATCT 6180 

TTACCATTAC 624 0 

TTGGAATGGG 6300 

TCAACCAAGA 6360 

AGGGGAAAAC 6420 

TGTGGAATCT 6480 

TGTAGCTGTC 6540 

GGAAAAACGA 6600 

TTCAAGACCA 6660 

TGCACCAAAT 6720 

TGTTCGAAAA 6780 

AGCCAAGGAT 6640 

AACTTTATCT 6900 

TTACAATAAG 6960 

TCGACAAGTT 7020 

TGATAAAGTC 7080 

ACGTTTAGGA 7140 

GTTGGCAGGC 7200 

TGATGAGGGG 7260 

TAGTTTGTCT 7320 

CCCTCCTTCG 7380 

CTACAACCGC 7440 
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GTGAAAGCAG CTAAGAAGGT GCCACTTGAT CGTATGCCTT ACAATCTTCA ATATACTGTA 7500 

GAACTCAAAA ACGGTAGTTT AATCATACCT CATTATGACC ATTACCATAA CATCAAATTT 7560 

GAGTGGTTTG ACGAAGGCCT TTATGAGGCA CCTAAGGCGT ATACTCTTGA GGATCTTTTG 7620 

GCGACTGTCA AGTACTATGT CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT 7680 

GGTAACGCTA GCGACCATGT TCGTAAAAAT AAGGTAGACC AAGACAGTAA ACCTGATGAA 7740 

GATAAGGAAC ATGATGAAGT AAGTGAGCCA ACTCACCCTG AATCTGATGA AAAAGAGAAT 7800 

CACCCTGGTT TAAATCCTTC AGCAGATAAT CTTTATAAAC CAAGCACTGA TACGGAAGAG 7860 

ACAGAGGAAG AAGCTGAAGA TACCACAGAT GAGGCTCAAA TTCCTCAAGT AGAGAATTCT 7920 

GTTATTAACG CTAAGATAGC AGATGCGGAG GCCTTGCTAG AAAAAGTAAC ACATCCTAGT 7980 

ATTAGACAAA ATGCTATGGA GACATTGACT GGTCTAAAAA GTAGTCTTCT TCTCGGAACG 8040 

AAAGATAATA ACACTATTTC AGCAGAAGTA GATAGTCTCT TGGCTTTGTT AAAAGAAAGT 8100 

CAACCGGCTC CTATACAGTA GTAAAATGAA TGGAGCATAT TTTATGGAGA AGTAACCTTT 8160 

CGTGTTACTT CTCTTTTTTA GAAAAACGTA ACAGA B* 95 

( 2 > INFORMATION FOR SBQ ID NO: 95: 

<il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



TTTACTAAAA 


GGAAAAAAGA 


ACTGATTTCT 


CAGTCCTTCA 


TTAATCTTAT 


TCCACACTAA 


60 


ATAGGTATGG 


GTAAACAGGT 


TGTTGACCTT 


GGTGAATCTC 


GACTTCAACG 


TCTTCGAATT 


120 


CTTCTACGAT 


TTCTTGAGCG 


ATTTCATTGG 


CAAGTTCTTC 


GCTTCCGTCT 


TCACCTACAT 


180 


AGAAGGTTAC 


GATTTCACTG 


TCTTCATCCA 


ACATATGTTT 


CAAGGTTTCA 


gtcaatgi rr 


240 


GGTCCATATC 


AGGGTTTGAC 


ACAAGAATTT 


TTCCATCCAC 


CATACCTAAA 


TTATCGTTTT 


300 


CATGGATTTC 


TAAGCCATCG 


ATCGTTGTAT 


CACGCACGGC 


TGTTCTGACG 


CTTCCGCTAA 


360 


CGACATCGCT 


AAGAGCAGCT 


CTCATACGCT 


CTTGGTTTTC 


TTCAATGGAC 


TTGCTTGGAT 


420 


CAAAGGCAAG 


AAGACTTGTC 


ATACCTTGAG 


GAAGAGTGCG 


AGCCTCTACC 


ACTACCGCTG 


480 


GTTGCTCCAA 


AACTTCTGCC 


GCAGATTGAG 


CTGCCATGAA 


GATGTTCTTG 


TTGTTTGGCA 


540 


AGAAGATGAT 


GTTACGGGCA 


TTAACCTGTT 


CAACAGCCTT 


GATAAAGTCT 


TCTGTTGAAG 


600 


GGTTCATGGT 


TTGACCGCCT 


TCGATAACAT 


AATCCACCCC 


TTGAGAACAG 


AAGATATCTG 


660 
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CTAGACCTTT ACCAGCCACC ACAGCAATCA AAGCATACTC TTTTTCTTCA GCCGACTTGA 720 

TAACTTGAGT AGCTTCTTTC TCAACCTGTG CTTCGTGTTG GTTACGCATA TTGTCAACTT 780 

TTACCTTGAC CAAGCTACCA TATTTGACAC CTTCTTGCAT AACAAGTCCT GGATCTTCTG 840 

TATGAACATG GACTTTGACA ATTTCATCAT CGTTAACAAC AAGGAGAGAA TCTCCAAGCT 900 

CATCCAAGTA GTTACGGAAT TCATCGTAGT CAAAATCTTT AGCATAGGTT GGACCTTGCT 960 

TAAGAGCTAC CATGATTTCA GTACAGTAAC CAAACGTGAT GTCCTCAGTC GCTACGTGAC 1020 

CACCTACAGA CTTATGATGC TCTACATTGA TCATCTCACT CATGTTGGCA GGAGTCGCTA 1080 

CAAAGTCCTC AGATGCAATA TATTCGCCAG TAAGGGCTGA AAGGAAACCT TCGTAGATGA 1140 

AGACCAATCC TTGACCACCT GAGTCCACAA CGCCAACTTC TTTCAATACT GGAACCATGT 1200 

CTGGTGTTTT AGCTAGAGCT GTTTTAGCAC CTTCCAAGGC TGCGCGCATG ACTTCAACAG 1260 

CGTCATCTGT TTGCTCAGCT TTTTTCTTAG CACCGATAGC AGCTCCACGA GAAACTGTTA 1320 

AAATCGTTCC TTCAACAGGT TTCATCACTG CCTTATAGGC AACTTCCACA CCTGATTGGA 1380 

AGGCCAGAGC CAAGTCTTGA CCTGTTAACT CGTCTTTATC CTTGATAGCT TGGGAAAATC 1440 

CACGGAAAAG CTGAGACGTA ATCACTCCTG AGTTCCCACG CGCACCCATC AAAAGCCCTT 1500 

TGGCAAGAAT GCTCGCTACT TCTCCAACTG TAGAAGCTGG CTTGTCTGCA ACTTCTTTAG 1560 

CACCATTTTC AATGGTCATT CCCATATTTG TCCCAGTATC TCCATCTGGA ACTGGAAAGA 1620 

CGTTTAATGA ATTGACATAT TCAGCTTGCT TATTCAAGCG AGTTGATGCA GCCTGCACCA 1680 

TTTCTTGAAA TAAGCTAGTA GTAATTTTTG ACACGGTTAT TCTCCTACAA CTTTGATATT 1740 

TTGAATGTAG ACATTTACAG TCTGAGCAGT AATTCCAAGC TGGTTTTCCA AGCTAAAGGC 1800 

AACACGCTCT TGAATGTTTT TTGACACTTC ACTAATCTTT GTTCCGTAGC TTAACACGGT 1860 

ATATACATCA ACTGCAATAC TGCCATCTTC GGCTGCCTTT ACGACGACAC CTTTAGAATA 1920 

ATTTTCCTTA CCTAGCAGGG CTTGGAAATT ATCTTTGAGG GCATTTTTAC TAGCCATACC 1980 

GACCACACCA GAAATCTCAG TTGC 2004 



(2) INFORMATION FOR SEQ ID NO: 96: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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CCGGGTTGGG 


CTGTTCGCCC 


ATTAAAGCGG 


CACCACAGCT 


GGGTTCAGAA 


CGTCGTGAGA 


60 


CAGTTCGGTC 


CCTATCCGTC 


GCGGGCGTAG 


GAAATTTGAG 


AGGATCTGCT 


CCTAGTACGA 


120 


GAGGACCAGA 


GTGGACTTAC 


CGCTGGTGTA 


CCAGTTGTCT 


TGCCAAAGGC 


ATCGCTGGGT 


180 


AGCTATGTAG 


GGAAGGGATA 


AACGCTGAAA 


GCATCTAAGT 


GTGAAACCCA 


CCTCAAGATG 


240 


AGATTTCCCA 


TGATTATATA 


TCAGTAAGAG 


CCCTGAGAGA 


TGATCAGGTA 


GATAGGTTAG 


300 


AAGTGGAAGT 


GTGGCGACAC 


ATGTAGCGGA 


CTAATACTAA 


TAGCTCGAGG 


ACTTATCCAA 


360 


AGTAACTGAG 


AATATGAAAG 


CGAACGGTTT 


TCTTAAATTG 


AATAGATATT 


CAATTTTGAG 


420 


TAGGTATTAC 


TCAGAGTTAA 


GTGACGATAG 


CCTAGGAGAT 


ACACCTGTAC 


CCATGCCGAA 


480 


CACAGAAGTT 


AAGCCCTAGA 


ACGCCGGAAG 


TAGTTGGGGG 


TTGCCCCCTG 


TGAGATAGGG 


S40 


AAGTCGCTTA 


GCTCTAGGGA 


GTTTAGCTCA 


GCTGGGAGAG 


CATCTGCCTT 


ACAAGCAGAG 


600 


GGTCAGCGGT 


TCGATCCCGT 


TAACTCCCAT 


TTTAGCGGGT 


GTAGTTTAGT 


GGTAAAACTA 


660 


CAGCCTTCCA 


AGCTGTTGTC 


GCGAGTTCGA 


TTCTCGTCAC 


CCGCTTTGAA 


CTTTGTTCTT 


720 


TGTACCAAGT 


TTTTGACTTG 


GGCGCGTAGC 


TCAGGTGGTT 


AGAGCGCACG 


CCTGATAAGC 


780 


GTGAGGTCGG 


TGGTTCGAGT 


CCACTCGTGC 


CCATAGTGTT 


TAGTCCATTA 


CTAGGGGATT 


840 


GGAATATTAT 


CTGTTCACTA 


AGAGGACACG 


GGCTTGTTCC 


CGTATAAACT 


ATTTTGGAGG 


900 


ATTACCCAAG 


TCCGGCTGAA 


GGGAACGGTC 


TTGAAAACCG 


TCAGGCGTGT 


AAAAGCGTGC 


960 


GTGGGTTCGA 


ATCCCACATC 


CTCCTTTTAT 


ATTAACGCGG 


GATGGAGCAG 


CTCGGTAGCT 


1020 


CGTCGGGCTC 


ATAACCCGAA 


GCTCGTAGGT 


TCAAATCCTG 


CTCCCGCAAT 


AAGGCTCGGT 


1080 


AGCTCAGTTG 


GTAGAGCAAT 


GGATTGAAGC 


TCCATGTGTC 


GGCGGTTCGA 


TTCCGTCTCG 


1140 


CGCCATTTAT 


ATATTTTGGA 


AGGGTAGCGA 


AGAGGCTAAA 


CGCGGCGGAC 


TGTAAATCCG 


1200 


CTCCTTCGGG 


TTCGGGGGTT 


CGAATCCCTC 


CCCTTCCATT 


TTACGGGCAT 


AGTTTAAAGG 


1260 


TAGAACTAAG 


GTCTCCAAAA 


CCTTCAGTGT 


GGGTTCAATT 


CCTACTGCCC 


GTGTTAATAG 


1320 


AATTATGGCG 


GGTGTGGTGA 


AGTGGTTAAC 


ACACCAGATT 


GTGGCTCTGG 


CATGCGTGGG 


1380 


TTCGATCCCC 


ATCACTCGCC 


TATTTTATAT 


TGGGCTATAG 


CCAAGCGGTA 


AGGCAAGGGA 


1440 


CTTTGACTCC 


CTCATGCGTT 


GGTTCGAATC 


CAGCTACCCC 


AGTTACTATT 


TGCCGGCGTG 


1500 


GCGGAATTGG 


CAGACGCGCT 


GGACTCAAAA 


TCCAGTGTCC 


GCAAGGACGT 


GCCGGTTCGA 


1S60 


CCCCGGCCGC 


CGGTATAGTA 


TAGTGTTAGG 


AACGTTGTTA 


TTCTTCGTTC 


CTTTTTTATA 


1620 


TTATTTTTGG 


TATAATTATA 


GTTATTCAAA 


TTTTATTTAG 


ATTAAGAAAG 


TGTAGGGGAG 


1680 


TATGTCTTGT 


TCTATCGATT TATTAAAACA TCGGTATTTG 


AAAAATATTA 


AAGAAAATCC 


1740 


TGAATTGTTT 


GTCGGAATTG 


AGTTGGAGTA 


> TCCTGTTGCA 


AGTTTAGAAG 


GGGATGCTAC 


1800 
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AGATGTTGAA 


GTTATGAAGG 


ATCTATTTCA 


TTATTTAGTT 


TCTACTTTGG 


ATCTCACCGT 


1860 


AGCAAAGGTA 


GATGATTTTG 


GCAATCTGAT 


CCAGTTAGTA 


GATCCGATAA 


GTCAGGATGC 


1920 


TATTTTATTT 


GAAGTTTCCT 


ATACAACGAT 


TGAGTTTGCA 


TTTGGTAAGG 


CTGAAACGAT 


1980 


TCAAGAGGTC 


GAAAATCGTT 


TCAATAATTA 


TATGAATGTA 


ATTCAGAGAA 


AGTTAGCTCA 


2040 


ATCAAATCAT 


GCTATTGTTG 


GCTGTGGTAT 


CCATCCCAAC 


TGGGATAAAA 


ATGAGAATTG 


2100 


TCCAGTGGCT 


TATCCACGCT 


ATCAGATGTT 


GATGGATTAT 


TTGAATTTGA 


GTAGAAATAT 


2160 


TATTAAATCA 


GATTTACATC 


ATTTCCCTGA 


ATATGGTACT 


TTTATCTGTG 


GGAGCCAGGT 


2220 


TCAGCTGGAT 


ATTTCAAAAA 


CCAACTACTT 


ACGGGTGATT 


AATGCTTTTA 


CTCAAATTGA 


2280 


AGCGGCTAAG 


GCTTATTTAT 


TTGCAAACTC 


TGAATTTTCG 


GGTGCGGATT 


GGGATACGAA 


2340 


AATTTCAAGG 


GATATTTTCT 


GGGAAGAATC 


TATGCATGGT 


ATCTATCCAG 


AGAATGTTGG 


2400 


GGTCAATGCT 


AGACTCCTTA 


ATGATGAAAC 


TGATTTTTTT 


GACTATCTAA 


ATCATTCTGC 


2460 


GATTTTTACT 


GCGGAACGTG 


ATGGGCAGAC 


CTATTATTTT 


TATCCTATTC 


AGGCTGGGGA 


2520 


CTATTTGGCT 


ACGTCCGAAA 


TCCAAGCATT 


TGCTCTGAAT 


GGGGATGAGG 


TTATTATTTA 


2S80 


CCCCCAAGAG 


AAGGATTTTG 


AAACTCATCG 


TAGTTACCAG 


TACCAAGATT 


TAACGACTCG 


2640 


AGGAACAGTT 


GAGTTTCGTA 


GTGTGTGTAC 


ACAGCCACTT 


GATAGGACTT 


TTCCTTCTGC 


2700 


AGCTTTTCAC 


TTGGGATTAT 


TGGTTAATTT 


AGACAAGTTA 


GAAGCTTACT 


TAGAAACAGC 


2760 


ACCTTTCTTT 


AAAGTATTTG 


GTTATGATTA 


CAAGTCTTTA 


AGGAGACAAT 


TTTCTAAGAA 


2820 


AAATCTTACA 


GATGAGGAAG 


AAACTACGAT 


TATTGAATTT 


TCCAAAGACT 


TACTCCTACT 


2880 


AGCTGAGGAG 


GGACTAGTGG 


TGAGAAATAA 


GGAAGAAATG 


ACCTATTTAC 


AGCCTTTGAG 


2940 


AGAAGAATTG 


AGCCTATAAT 


TTCTCTTATA 


AAGGGAGAAT 


TTTCTGAAAA 


ATCATGATAT 


3000 


AATGGACGAG 


ACTATAGATA 


AAGGATAGAG 


AGTAATGACA 


TTAGTTTATC 


AATCAACGCG 


3060 


TGATGCCAAC 


AATACAGTAA 


CTGCCAGCCA 


AGCAATTTTG 


CAAGGTTTGG 


CGACGGACGG 


3120 


CGGTTTGTTT 


ACACCGGATA 


CTTATCCAAA 


GGTACATTTG 


AACTTTGACA 


AATTGAAAGA 


3180 


TGCTTCTTAC 


CAGGAAGTTG 


CTAAGCTAGT 


TTTGTCAGCA 


TTTTTAGATG 


ACTTTACAGT 


3240 


TGAGGAGTTG 


GACTACTGTA 


TCAACAATGC 


CTACGATAGC 


AAATTTGATA 


CTCCAGCTAT 


3300 


TGCACCATTA 


GTGAAATTAG 


ATGGGCAATA 


CAATTTGGAA 


CTTTTCCATG 


GTTCAACGAT 


3360 


TGCCTTTAAG 


GATATGGCCT 


TGTCTATTTT 


GCCATACTTT 


ATGACGACTG 


CTGCTAAGAA 


3420 


ACATGGTTTG 


GAGAACAAGA 


TTGTTATCTT 


GACAGCGACA 


TCTGGTGACA 


CGGGGAAAGC 


3480 


TGCTATGGCG 


GGGTTTGCGA 


ATGTGCCTGG 


TACTGAGATT 


ATCGTCTTTT 


ATCCAAAGGA 


3540 
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TGGTGTCAGC AAGATTCAAG AGTTACAAAT GACCACTCAG ACTGGCGACA ATACTCATGT 3600 

TATTGCTATT GATGGTAACT TTGACGATGC GCAAACAAAT GTGAAGCACA TGTTTAACGA 3660 

CGTGGCTCTT CGTGAAAAAT TGACTACCAA CAAGTTGCAA TTTTCATCAG CTAACTCTAT 3720 

GAACATTGGT CGTCTGGTGC CACAAATTGT TTATTATGTT TATGCTTACG CTCAATTGGT 3780 

TAAGACTGGT GAAATTGTAG CTGGTGAAAA GGTTAACTTC ACAGTACCAA CAGGAAACTT 3840 

TGGAAATATC TTGGCTGCCT TTTATGCCAA ACAAATTGGT TTGCCAGTTG GTAAATTAAT 3900 

CTGTGCTTCA AATGACAACA ATGTTTTGAC AGACTTCTTT AAAACACGTG TCTATGACAA 3960 

AAAACGTGAG TTTAAGGTAA CAACCAGCCC ATCTATCGAT ATCTTGGTAT CTTCAAACTT 4020 

GGAGCGCTTG ATTTTCCATC TTTTGCGAAA TAATGCTGAA AAGACAACTG AACTTATGAA 4080 

TGCCTTGAAC ACGCAAGGAC AATATAAGTT GACAGACTTT GATGCAGAGA TTTTGGACCT 4140 

CTTTGCAGCT GAATATGCGA CTGAGGAAGA AACGGCAGCA GAGATCAAGC GTGTTTGTGA 4200 

GTTAGATTCT TATATCGAGG ACCCTCATAC AGCTGTTGCT TCAGCAGTTT ATAAAAAATA 4260 

CCAATCGGCC ACTGGAGATG TAACTAAGAC AGTGATTGCT TCAACAGCTA GTCCATACAA 4320 

GTTCCCAGTA GTTGCAGTAG AAGCTGTAAC TGGAAAAGCA CCTTTAACAG ACTTTGAAGC 4380 

CTTGGCTCAA TTACATGAAA TCTCAGGCGT TGCAGTGCCA CCAGCAGTTG ATGGGCTTGA 4440 

AATAGCTCCA ATTCGTCACA AGACAACAGT GGCAGCTGCT GACATGCAAG CAGCGGTTGA 4500 

GGCTTATTTA GGACTTTAAG ACAGAGGGAG CAAACTCGCT TGGGAAACCA ACTGAGTTTC 4560 

TTTTCATCAG GAGGAGAGAT TGTTTAAGAA AAATAAAGAC ATTCTTAATA TTGCATTGCC 4620 

AGCTATGGGT GAAAACTTTT TGCAGATGCT AATGGGAATG GTGGACAGTT ATTTGGTTGC 4680 

TCATTTAGGA TTGATAGCTA TTTCAGGGGT TTCAGTAGCT GGTAATATTA TCACCATTTA 4740 

TCAGGCGATT TTCATCGCTC TGGGAGCTGC TATTTCCAGT GTTATTTCAA AAAGCATAGG 4600 

GCAGAAAGAC CAGTCGAAGT TGGCCTATCA TGTGACTGAG GCGTTGAAGA TTACCTTACT 4860 

ATTAAGTTTC CTTTTAGGAT TTTTGTCCAT CTTCGCTGGG AAAGAGATGA TACGACTT'iT 4920 

GGGGACGGAG AGGGATGTAG CTGAGAGTGG TGGACTGTAT CTATCTTTGG TAGGCGGATC 4980 

GATTGTTCTC TTAGGTTTAA TGACTAGTCT AGGAGCCTTG ATTCGTGCAA CGCATAATCC 5040 

ACGTCTGCCT CTCTATGTTA GTTTTTTATC CAATGCCTTG AATATTCTTT TTTCAAGTCT 5100 

AGCTATTTTT GTTCTGGATA TGGGGATAGC TGGTGTTGCT TGGGGGACAA TTGTGTCTCG 5160 

TTTGGTTGGT CTTGTGATTT TGTGGTCACA ATTAAAACTG CCTTATGGGA AGCCAACTTT 5220 

TGGTTTAGAT AAGGAACTGT TGACCTTGGC TTTACCAGCA GCTGGAGAGC GACTTATGAT 5280 

GAGGGCTGGA GATGTAGTGA TCATTGCCTT GGTCGTTTCT TTTGGGACGG AGGCAGTTGC 5340 
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TGGGAATGCA ATCGGAGAAG TCTTGACCCA GTTTAACTAT ATGCCTGCCT TTGGCGTCGC 54 00 

TACGGCAACG GTCATGCTGT TGGCCCGAGC AGTTGGAGAG GATCATTGCA AAAGAGTTGC 5460 

TAGTTTGAGT AAACAAACCT TTTGGCTTTC TCTGTTCCTC ATGTTGCCCC TGTCCTTTAG 5520 

TATATATGTC TTGGGTGTAC CATTAACTCA TCTCTATACG ACTGATTCTC TAGCGGTCGA 5580 

GGCTAGTGTT CTAGTGACAC TGTTTTCACT ACTTCGGACC CCTATGACGA CAGGAACAGT 5640 

CATCTATACG GCAGTCTGGC AGGGATTAGG AAATGCACGC CTCCCTTTTT ATGCGACAAG 5700 

TATAGGAATG TGGTGTATCC GCATTGGGAC AGGATATCTG ATGGGGATTG TGCTTGGTTG 5760 

GGGCTTGCCT GGTATTTGGG CAGGGTCTCT CTTGGATAAT GGTTTTCGCT GGTTATTTCT 5820 

ACGCTATCGT TACCAGCGCT ATATGAGCTT GAAAGGATAG GAAATGCAAA AAACAGCTTT 5880 

TATTTGGGAT TTAGACGGGA CTTTATTGGA CTCTTACGAA GCGATTTTAT CAGGGATTGA 5940 

GGAGACTTTT GCTCAGTTTT CTATTCCTTA TGATAAGGAG AAGGTGAGAG AGTTTATCTT 6000 

CAAGTATTCG GTGCAAGATT TGCTTGTGCG GGTGGCAGAA GATAGAAATC TGGATCTTGA 6060 

GGTGCTAAAT CAGGTGCGTG CCCAGAGTCT GGCTGAGAAG AATGCTCAGG TAGTTTTGAT 6120 

GCCAGGTGCG CGTGAGGTGC TAGCTTGGGC AGACGAATCA GGAATTCAGC AGTTTATATA 6180 

TACTCATAAG GGGAACAACG CTTTTACCAT TCTCAAGGAC TTGGGGGTGG AATCCTATTT 624 0 

TACAGAGATT TTAACCAGTC AGAGTGGCTT TGTGCGGAAG CCAAGTCCAG AAGCGGCTAC 6300 

CTATCTGCTA GATAAGTATC AGTTGAATTC TGATAATACT TATTATATAG GGGATCGGAC 6360 

TCTGGATGTG GAATTTGCCC AGAATACTGG GATTCAAAGC ATCAACTTTT TAGAGTCTAC 6420 

TTATGAAGGG AATCACAGGA TTCAAGCGTT AGCAGATATT TCCCGTATTT TTGAGACTAA 6480 

GTGATAAAAA GATTGTGTCA GTTTTGTGAC AGAGACCTAA CAAACTATTT CAAGTAACCT 6540 

AGTTTGTTAC AAGGAATAGA CAGTTCTGTT AAATAGGCCC GAGAGGGCTT TTTTTCTACA 6600 

TTTTTTGTGT TATGATAGAC AGGTACTCAT TTGAAAGGAA TTTGAAAGAA TGAAGAAAAG 6660 

AATGTTATTA GCGTCAACAG TAGCCTTGTC ATTTGCCCCA GTATTGGCAA CTCAAGCAGA 6720 

AGAAGTTCTT TGGACTGCAC GTAGTGTTGA GCAAATCCAA AACGATTTGA CTAAAACGGA 6780 

CAACAAAACA AGTTATACCG TACAGTATGG TGATACTTTG AGCACCATTG CAGAAGCCTT 684 0 

GGGTGTAGAT GTCACAGTGC TTGCGAATCT GAACAAAATC ACTAATATGG ACTTGATTTT 6900 

CCCACAAACT GTTTTGACAA CGACTGTCAA TGAAGCAGAA GAAGTAACAG AAGTTGAAAT 6960 

CCAAACACCT CAAGCAGACT CTAGTGAAGA AGTGACAACT GCGACAGCAG ATTTGACCAC 7020 

TAATCAAGTG ACCGTTGATG ATCAAACTGT TCAGGTTCCA GACCTTTCTC AACCAATTGC 7080 
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AGAAGTTACA AAGACAGTGA TTGCTTCTGA AGAAGTGGCA CC ATCTACGG GCACTTCTGT 7140 

CCCAGAGGAG CAAACGACCG AAACAACTCG CCCAGTTGAA GAAGCAACTC CTCAGGAAAC 7200 

GACTCCAGCT GAGAAGCAGG AAACACAAGC AAGCCCTCAA GCTGCATCAG CAGTGGAAGT 7260 

AACTACAACA AGTTCAGAAG CAAAAGAAGT AGCATCATCA AATGGAGCTA CAGCAGCAGT 7 320 

TTCTACTTAT CAACCAGAAG AGACGAAAAT AATTTCAACA ACTTACGAGG CTCCAGCTGC 7 380 

GCCCGATTAT GCTGGACTTG CAGTAGCAAA ATCTGAAAAT GCAGGTCTTC AACCACAAAC 7440 

AGCTGCCTTT AAAGAAGAAA TTGCTAACTT GTTTGGCATT ACATCCTTTA GTGGTTATCG 7500 

TCCAGGAGAC AGTGGAGATC ACGGAAAAGG TTTGGCTATC GACTTTATGG TACCAGAACG 7560 

TTCAGAATTA GGGGATAAGA TTGCGGAATA TGCTATTCAA AATATGGCCA GCCGTGGCAT 7620 

TAGTTACATC ATCTGGAAAC AACGTTTCTA TGCTCCATTC GATAGCAAAT ATGGGCCAGC 7680 

TAACACTTGG AACCCAATGC CAGACCGTGG TAGTGTGACA GAAAATCACT ATGATCACGT 7740 

TCACGTTTCA ATGAATGGAT AAACCCGACT TGATAACATC ATTTTGACGA ATGAGATCTA 7800 

GCTTTCGTGA TGGAAAGCGA TTCTCGTTCG TTTTTTCTTT GTCATACTCT TCGAAAATCT 7860 

CTTCAAACCA CGTCAGTTTT ATCTGAAACT TCAAAGCTGT GCTTTGAGCA ACCTGCGACT 7920 

AGCTTCCTAG TTTGCTTTTT GATTTTCATT GAGTATCAAT TTGAATGGAA AATGGAAAGT 7980 

TATCATCTTG TAATGAGTTA AGCAACATTC TTGCAATCTA TTTTACTTTA TATCACAATT 8040 

AATTAGTCAA ATATTGATAA ATCAATAAAA AGAGAGGGGA AGAAATGCTA GAGATTCAAG 8100 

ATTTACTGTA TCAACTCCGC TTGTCTGAGC AAGCGAGTAC GCAATTGTTT GAAAAAAGCC 8160 

TTGGGATTAG TTTGACACGG TATCAGATTT TACTGTTTTT GCTGGAGCAT TCTCCTTGTA 8220 

ACCAAATGGC GGTTCAGGAG CGTTTGAAAA TTGATCAGGC TGCTTTGACA CGGCATTTCA 8280 

AAATTTTGGA AACGGAACGT TTGGTGGAGC GTCATCGTAA TCCTGAAAAT CAGCGGGAAG 8340 

TGTTGGTAGA GGCTGCGAAG TATGCCAAGG AGCAGTTAGT GGTGAATCCC CCTCTGCAAC 8400 

ATATCAGGGT TAAGGAAGAG ATAGAAAGTA TCTTAACAGA GTTTGAGAGA ACAGAACTCA 8460 

GCCGTTTATT AAATAAATTG GTTTTGGGTA TTGAAAATAT AGAAATTTAA GGAGAAATAG 8520 

ATGTCAATTA TTTTAACAAC GATCGTTGCT TTGGAGCATT TTTACATTTT TTATTTGGAA 8580 

AGTATTGCCA CGCAATCAGA TGCGACTAGT CGTGTATTTA ATATGGAAAA GGAAGAATTG 8640 

GCTCATCCGT CAGTAAGTTC ATTGTTCAAA AATCAAGGAA TTTATAAGGC TCTGCTAGGA 8700 

GTCTTTCTCT TGTATGTCAT TTATTTCTCA CAGAATTTAG AAATTGTGAC TATTTTTGTC 8760 

TTATTTGTGA TTGGTGCTGC GACTTACGGC TCTTTAACAG CGGATAAAAA AATTATTTTG 8820 

AAACAAGGTG GATCAGCTAT TTTGGCCTTG ATTAGTATTT TACTCTTTAA ATACACTTGA 8880 
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AGGTCGATTC 


TAATCTCGCT 


AATCCTTTTT 


AATCCAGAAT 


AAGGGAAATA 


TGTTATACTT 


8940 


GTTTTTAAGA 


AAAAAGTCTC 


ATTGAATTGG 


TTTTGAGGAG 


TTAGAAATGA 


AAGTATTACT 


9000 


GACAGGTTTT 


GAGCCCTTTG 


GAGGGGAAAA 


GGGCAATCCA 


GCTTTGGAGG 


CCATTAAAGG 


9060 


TTTACCAGCT 


GAAATCCATG 


GTGCTGAGGT 


CCGTTGGCTA 


GAGGTGCCGA 


CAGTTTTTCA 


9120 


CAAATCTGCT 


CAAGTATTGG 


AAGAAGAGAT 


GAATCGTTAT 


CAACCTGACT 


TTGTCCTTTG 


9180 


TATTGGGCAA 


GCTGGTGGAA 


GAACTAGTTT 


GACACCTGAA 


CGAGTGACCA 


TTAATCAAGA 


9240 


CGATGCATCC 


ATTTCTGATA 


ACGAAGATAA 


TCAACCGATT 


GACCGTCCCA 


TTCGCCCAGA 


9300 


TGGTGCTTCG 


GCCTACT1TA 


GTAGTTTGCC 


GATTAAAGCG 


ATGGTTCAAG 


CTATAAAAAA 


9360 


AGAGGGCTTA 


CCGGCCTCTG 


TTTCCAATAC 


GGCAGGGACT 


TTTGTCTGCA 


GCCATTTGAT 


9420 


GTATCAGGCT 


CTCTATTTGG 


TAGAAAAGAA 


ATCTCCATAT 


GTTAAGGCAG 


GTTTTATGCA 


9480 


TATTCCTTAT 


ATGATGGAAC 


AGGTGGTGAA 


CAGACCGACT 


ACTCCAGCTA 


TGAGTTTAGT 


9540 


GGATATTCGG 


CGAGGGATAG 


AAGCAGCAAT 


CGGCGCTATA 


ATAGAACATG 


GAGATCAGGA 


9600 


ACTCAAGTTG 


GTAGGCGGAG 


AAACTCATTG 


ATAGAAAAAA 


GCTTGAGGGG 


AAAAACCTTC 


9660 


AAGCTTTTGG 


ACGTTTTCGG 


GCCAATACTG 


CTCGGTAAAA 


CATAATTTTA 


GTGCATTGGA 


9720 


TATAAGGTAG 


GAGTGAAAAA 


CTACCAATGC 


CAAAGGTAAT 


CCAATTGAGG 


AAGTACCAAG 


9780 


GAAGAAGCTG 


TAAATCTAGG 


ACAAAGTGCT 


GGAACTTGTA 


GCCCTTCATA 


AAGGAACGGC 


9840 


TAGTTTTTAG 


GATTCGTCTT 


GGTGGGACCT 


GTCCTAGGTC 


TAGACTATAA 


CAGAGAAGAA 


9900 


ATTCCACCTG 


TGAATAGGCA 


TAATACTGTG 


GAATATAGAG 


GATATTTCCT 


ACAATGATCA 


9960 


AGATGAGACT 


TGCAAGAAAC 


TAGAGTCCAA 


AGACCATGAG 


GAAACGCTCG 


GTTTCAACTG 


10020 


ATGAGAGATC 


TAGATTTGGA 


AACTCAGGAT 


GTAGGGTGAC 


GAATTTTTTG 


CCTAAAAAGC 


10080 


TACTATAAAA 


GAGGAGGTAA 


ATCCCAAGTA 


AATTAGGCAT 


ACTCCATAAA 


AAGAGATAGA 


10140 


AACGTTTGAG 


AAGTAGGGTC 


AAAAAGGTTT 


GAGAAAAGCG 


CTCCTCATCA 


AAGAGAGCTA 


10200 


GGCTGTTTTT 


TACAGATGGC 


TCCGTTTTAG 


AATCTTTCAT 


GAGTGTCAGT 


GTTGCATAGA 


10260 


CGGAACTGGT 


CAAAAGAATA 


GTCCCGATAA 


AGGAGACTAC 


TAGAGGAAAG 


AGGTAGGTTT 


10320 


GAAGTATTTG 


GCCAAGTATG 


CTGAAAAATG 


GCTGTTCTAA 


AACAGTCCCG 


TGGATCCGAG 


10380 


ATAAGGGATT 


AAGAAAACCA 


GATAAGATGA 


CCAGCATACT 


GGGAAGGATA 


TAGAGGAGAA 


10440 


AGAGACGGGG 


GGTGTCAGCC 


TGAAAATGTT 


TTGACTCCTC 


ACGAATTGTT 


TTTAAATCAA 


10500 


TTTTTGGATA 


GTTCATTCTC 


TTATTATACC 


ATAGTTCTTA 


TACATAGTTC 


GTGACAGTTC 


10560 


CTACTTTTTT 


TGATAAAATC 


ATACACTGTG 


TCCTTGGGCA 


CACTGTATGA 


ACTGGGACTG 


10620 
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TCTTTCCCAG CTTCGGACCT AAAAAATGTC AGATTCACCA ATCAAATATC GTTTGATTAA 10680 

GAAAGAAAAA CACACAGGAG CTCGTCTGGG AGAAATCATC ACTCCCCACG GTACCTTTCC 10740 

GACACCTATG TTTATGCCAG TTGGGACACA AGCCACTGTC AAAACTCAGT CACCTGAAGA 10800 

ATTGAAGGAG ATGGGTTCGG GAATTATCCT ATCAAACACC TATCATCTCT GGCTTCGCCC 10860 

TGGAGATGAA CTCATTGCAC GCGCTGGTGG TCTCCACAAG TTCATGAATT GGGACCAGCC 10920 

TATCTTGACA GATAGTGGTG GTTTTCAGGT TTATTCTTTA GCAGATAGCC GTAATATCAC 10980 

AGAAGAAGGA GTAACCTTTA AAAATCATCT AAATGGTTCT AAGATGTTCC TATCCCCAGA 11040 

AAAAGCCATC TCTATTCAGA ATAATCTGGG TTCAGACATC ATGATGTCCT TTGATGAATG 11100 

TCCTCAGTTT TATCAACCTT ATGACTACGT TAAGAAATCG ATCGAGCGTA CCAGCCGTTG 11160 

GGCTGAGCGT GGTTTGAAGG CTCACCGTCG TCCACATGAC CAAGGTTTGT TTGGAATTGT 11220 

GCAAGGTGCA GGATTTGAAG ACCTTCGCCG CCAATCAGCT CATGATCTTG TCAGCATGGA 11280 

TTTCTCAGGC TACTCTATCG GTGGTTTGGC AGTGGGAGAA ACCCATGAAG AGATGAATGC 11340 

GGTCTTGGAC TTTACAACTC AACTGCTGCC TGAAAATAAA CCTCGTTATC TGATGGGTGT 11400 

GGGAGCGCCA GATACCTTGA TCGATGGGGT CATTCGTGGG GTGGATATGT TTGACTGTGT 11460 

CTTACCGACT CGAATTGCTC GTAACGGGAC TTGTATGACC AGTCAAGGAC GTTTGGTTGT 11520 

GAAAAATGCC CAGTTTGCTG AGGACTTTAC GCCACTGGAT CCTGAGTGTG ATTGCTACAC 11580 

ATGTAATAAC TATACACGCG CTTACCTTCG TCACCTGCTC AAGGCTGATG AAACCTTTGG 11640 

TATCCGCTTG ACTAGCTACC ACAATCTTTA CTT C TTCCTT AACCTGATGA AGCAAGTGCG 11700 

ACAAGCCATC ATGGATGACA ATCTCTTGGA ATTCCGTGAG TATTTTGTGG AAAAATATGG 11760 

CTATAATAAG TCAGGACGTA ATTTCTAAAA TGGAATTGAT ATAAAAAAAT CCTAAGTTTT 11820 

CTCTTAGGAT TTTTCTTCTT TTTTTGATAG AATAAAGTGT ACAATGAAAG GAAGAATAAA 11880 

CTCGTATGCG CATTAAATGG TTTTCCTCGA TTAGG 11915 

(2) INFORHAT I ON FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY: linear 

(xi I SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GAGAGGGCAA CAGTTCTATC GCTTCAAATT TTTTCTTGGT TTGCAGATAT TCAAGAATCG 60 

GGAGTTTTTC TATAGTATTC GGCAGATTTA TTACAGCCAA GCATCTCAAA AATACGGACA 120 
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GCATCCTCCA TCTTTTTCTG GCCTTCCTTG ACTCTACCTT GCTTGCTATC AAGGAGACCT 180 

TCTGCCCACA GATAAACAAT TCGGAAATAG GTCTCATTTT CCTTGTAGAA ATGCTCTTCG 240 

ATAACACGTT TAAAATAATA GGCATTGGTA AATTCTTCAC ACTCAATACT AGCTAAAAAG 300 

CCATTCAATA GTATAGTATG AAAAAGGTTT CGATTGCCAG ACATTTCCAT TAGAAAATCA 360 

GATTTACGTA CCATTTCTCG TACATATCTA GTAAAAAGAG AAACAGATAA AAATGGAGAA 420 

CTGACTGAAA ATAAATTGAG TTCATAGATT CCCCAGATCT CGGTAGAAAA CAAATAATCA 4 80 

TGAAGGACTT TTCCTTCCTC TGCTGTTAAG TCTACCCTTT CATCTATGCT CTTCATATAA 540 

GACTTGATAA TAATGGCATT TAGAATATGT TTCTGTTTGT TGTGAGAATG GGCATGCTTT 600 

TATACTCCCT GCGATATAAG TCCTCAAGAG GTGCTATATT CTTTGGTTCC AAGACATCTG 660 

TAATTTCTTT TCTCAACTCA GAATCTGTAT CATACTGGAA ACCTCTTGCC AGAAAGAGGA 720 

TCTCCTCCAC ACTGGCAGAT ATATTTTCCA GAGCAAATAG AAACTTTTCC ACCGAAAGCT 780 

CACTCTGACC TGTTTCAAAA CGGGACAACA TAGACGGCGA AAATTGTCCT CCGGTTGCTT 840 

GTCTCAGTGA GATATTTCTT GACTCTCGTA ATTGTCTAAA GACTTTTCCA ATCTGCTCCA 900 

TAGACTTCCC CTTGATTCCG TATTTTCTTC ATTTTATCAT ATTTTTCAGA AAATTCATCA 960 

AAAACTTGCC AAATTGTCAG AATTATGAGA AAATAGAGGA TATTTATCAC GTGGAGGGAC 1020 

TGCTATGAGA GACGATATCA AAATCAATGA CCGTGCTTTG GCCTTCCAAG ACCAAATTAT 1080 

CGAAAAACTA GAGAAAGTTT TTGATACAGA TGTGGAATTG GATGTTTACA ATCTAGGTCT 1140 

GATTTATGAA ATCAATCTGG ATGAAACGGG GCTCTGCAAG ATTGTCATGA CCTTCACCGA 1200 

TACTGCCTGT GATTGCGCCG AAAGCCTGCC TATTGAAATC GTGGCAGCTC TGAAACAAAT 1260 

CGAGGGTATC AAAGATATCA AGGTTGAAGT TACCTGGTCG CCTGCTTGGA AAATCACACG 1320 

AATCAGTCGC TATGGCCGTA TTGCCCTTGG ACTACCACCT CGTTAAGCAG ACCAATCACT 1380 

TTTAAAGATG AAAATCAAAG GGCAAACTAG AAAACTAGCC GCAGGTTGCT CAAAACACTG 1440 

TTTTGAAGTT ATGGATAGAA CTGACGAAGT CAGCTCAAAA CACTGTTTTG AGGTTGTGGA 1500 

TAGAACTGAC GAAGTCAgCT CAAAACACTG TTTTGAGGTT GTGGATAGAA CTGACGAAGT 1560 

CAGCCCAAAA CACTGTTTTG AGGTTGTGGA TAGAACTGAC GAAGTCAGTA ACCATACCTA 1620 

CGGCAAGGCG ACGTTGACGT GATTTGAAGA GATTTTCGAG TATGAGTTTA TTTTTTACCT 1680 

GACTTGTCCA TATTCCAGAA GTCTGTCACG GCTCCGCGTG AAGCAGATGA TACGATGTGG 1740 

GCATATTTAC CGAGGACACC ACGGCTGTAA AGTGGTGGCA AGGTTGTTTC TGCCTTGCGT 1800 

TTTTCAAGTT CTTCTTCCGA TACGGCCATA GAAATTTCTT TGGTATCTTG GTCAACCGTA 1860 
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ACGATATCGC < 


CGGTACGGAG 


ATAGGCAATT 


GGTCCACCAT 


CCTGAGCTTC 


AGGAGCGATA 


1920 


TGTCCAACAA 


CCAGACCATA 


AGTACCACCA 


GAGAAACGTC 


CCTCCGTCAA 


GAGGGCCACC 


1980 


TTATCTCCCT 


GACCTTTACC 


AACAATCATT 


GAAGAAAGTG 


ATAGCATCTC 


AGGCATACCA 


2040 


GGACCACCTT 


TAGGTCCAAC 


AAAACGAACA 


ACGACTACAT 


CGCCATCAAC 


GATTTCATCT 


2100 


GTCAGAACGG 


CCTGAATCGC 


ATCTTCTTCT 


GAGTCAAAGA 


CCTTAGCTGG 


CCCAACGTGA 


2160 


CGACGCACTT 


TAACACCTGA 


TACCTTGGCA 


ACTGCACCGT 


CAGGAGCAAG 


GTTCCCGTTC 


2220 


AAGATGATAA 


GCGGACCATC 


CGCACGTTTT 


GGATTTTCAA 


GTGGCATGAT 


AACTTTTTGG 


2280 


CCTGGAGTCA 


AGTCTGCAAA 


GTCAGCCAAG 


TTTTCAGCTA 


CAGTCTTACC 


AGTACATGTG 


2340 


ATGCGATCTC 


CGTGAAGGAA 


ACCATTTGCC 


AACAAATACT 


TCATAACCGC 


AGGGACACCA 


2400 


CCGACTTCGT 


AGAGGTCTTG 


GAAGACATAC 


TGACCAGATG 


GTTTCAAGTC 


GGCCAAGTGA 


2460 


GGCACACGTT 


CTTGAATCGT 


ATTGAAGTCC 


TCAAGTGACA 


AGTCAACATT 


TGCGGCATGG 


2S20 


GCAATGGCGA 


GCAAGTGAAG 


AGTGGCGTTT GTAGAACCAC 


CGAGAGCCAT 


CGTTACAGTG 


2580 


ATAGCATCTT 


CAAAGGCTTC 


ACGAGTCAAG 


ATATCTGATG 


GTTTGAGACC 


AAGTTCCAAC 


2640 


ATCTTAACAA 


CAGCACGTCC 


TGCTGCTTCG 


ATATCTTCTT 


TCTTATCAGC 


TGATTCAGCT 


2700 


GGGTGAGAGG 


ATGACCCTGG 


CAAACTCATC 


CCTAGAACTT 


CGATAGCAGT 


TGCCATGGTA 


2760 


TTAGCAGTAT 


ACATACCACC ACAACCACCA GGGCCAGGGC 


AGGCATTACA 


TTCAAGACGT 


2820 


TTCACGTCCT 


CAGCTGTCAT 


GTCACCGTGG 


TTCCATTTTC 


CGATACCTTC 


AAAGACAGAA 


2880 


ACCAAGTCGA 


TATCTTTACC 


ATCAAGATTT 


CCCGGTGCAA 


TAGTTCCACC 


ATAGGCGAAA 


2940 


ATAGCTGGGA 


TATCCATATT 


AGCAATACCA 


ATCATAGATC 


CAGGCATGTT 


CTTGTCACAG 


3000 


CCACCGATAG 


CGACGAAGGC 


ATCCACGTTG 


TGACCACTCA 


TAGCCGCCTC 


GATGGACTCC 


3060 


GCGATGATGT 


CACGAGATCT 


TAGAGAGAAA 


CGCATACCAG 


GCGTTCCCAT 


AGCGATCCCG ¥ 


3120 


TCCGCTACGG 


TAATGGTTCC 


AAACTGTACA 


GGCCAAGCGC 


CTGCAGATTT 


GACACCTTCT 


3180 


TTAGCCAGTT 


TCCCGAAATC 


ATGCAAGTGA 


ATGTTACATG 


GTGTATTTTC 


CGCCCAAGTC 


3240 


GAAATCACTC 


CCACAATCGA 


TGTTTCAAAG 


TCCTTATCTG 


TCATACCAGT 


CGCACGAAGC 


3300 


ATAGCACGGT 


TAGGTGATTT 


AACCATGCTG 


TCATAAATGC 


TACTGCGGTG 


ACGTTTATCT 


3360 


AATTCAGTCA 


TCTTATCCCT 


CCCATTTCAG 


TTTTTACTAT 


TATAGCACAA 


TTTTCGCATG 


3420 


AAGAACAGAA 


TAAAATTCTT 


GAATTTTCAG 


AAAATTCTAT 


ACACATGTGA 


AATATTTAAA 


3480 


ATTAAAAACA 


ACAAAGCGGA 


TTAGTGCACT 


TTCTGATGAC 


CAGAATATGC 


TTTTTAATCC 


3540 


GCTTTCTTTA 


AATAACGTAC 


TGTAATTTTT 


ACAGAAATTC 


TTTCAAATAA 


GTGTATTTAA 


3600 


CATCTATCTT 


GCATTATAAA 


TTTCTAGAAC 


: CTTCTCTTTT 


ATATTCGATT 


> CACTCAAACC 


3660 
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ATACTCATTA AGAAGATAAT CCATTTTCCC TACTTGACCG AATCTTTCTT GAACACCCAT 3720 

CCGATGAATT TTTGTTATTC CATCATCAGA GAATAATTCA CATAAAGCAC TGCCAATTCC 3780 

ACCTATCTGA TTGTGGTTTT CTACAGTAAA TATAGTTTTT CCACTTAACA TTGTTTTTAT 3840 

CTGTTCTGGT ATCGGTTTGA TTCTAAATAA ATCTATCACA CCTACTGAAT AACCTAATTT 3900 

AGACAGTTCA TCTGCAACTC GAATACTTGG AGCAACCATT ATGCCAGAAG CAACGATTAC 3960 

AAGATCTTCA CCATGCCTTA ACTCAATGTA GCCTTTAGAA AAATCTTCTC CACCTTGATA 4020 

CACAGGAACT GGAGCTTTTC TAATTGTTCG AATATATTTT AGTCCTTTTA AGTCTAATGT 4080 

CTGGTTCAAT ATTTCACGAA ATTGGATATC ATCACTTGCT TCGAAAATGA TTGATTTAGG 4140 

AATTAAACGT AACAATCCAA TTTCTTCAAA TGGCATATGT GTTCCACCAT TCATCTCTGC 4200 

CGTTACTCCT GCATCTGATC CAATCACAGT GGCATCCAAT TGTGCGTATC CAAGAGAAAT 4260 

AAATAATTGA TCAAATACTC TTCGTGAAGC AAAAGGACCA AATGTATGAA GATAAGGTCT 4320 

AAACCCCTGA ATAGACAAGC CTGCTGCAAG GCCGACCATT TCTGCTTCCA TAATCCCAAC 4 380 

ATTCACATAA CGCTCTCCAA AGTCCTTTTC AAGATTATTA GTAGCCATCG AACTTGACAA 4440 

ATCGGCTTCT AAGACTACTA TATCAGAATC ACTTTGATTA GCCTCTAAAA GGAAGTCTCT 4 500 

ATATACATGC CGTAATTCTT TCGTACTTCT CATCATTCTG TTTCCTCCAA TTCCTGACTT 4560 

AATCTTTCTA CAACTGAAGT TAACATTTGT TTCTCCTCTA CAGTAGGGCG AAGATGATGA 4620 

TTGGATTTCA TTTCTTCCAG CTCTTGAACC CCTTGACCTT TAATAGTATC TAATACAATG 4680 

CACTTAGGTG ATGAATTATT TGACTGTTTT AATTGGACAA TCCCTTCATA AATTTCTCTA 4740 

ATATCTGAAC CCTTGACCCT AATGGATTCA AATCCAAATG CTGAAAATTT TTCTACGAAA 4800 

TCACCTGGAT TACAAATATC CTTTGTAAAA CCATCTAATT GTTTTTTGTT ATCATCAACA 4860 

AATACAATTA AGTTGGATAA CTGTTGATGA GAAGCAAACT GTATAGCCTC CCAACATTGT 4 920 

CCCTCATTTA ACTCACCATC TCCAACAATA GCGTAAGTAT AAAAGGGACT CTTTCTTATT 4 980 

CTCTGACCAT ATGCAAGTCC AGTTGCAACA CTAATTCCTT CTCCTAAAGA GCCCGTTGTC 5040 

ATATCTATGC CTGGCGTTAG ATTTCTATCA GGATGAGACG GTAATTTGGT TCCATTTGTA 5100 

TTTAAAGAAT ATAAGAATTC TTTGTCAAAG AAACCATTCA AATAGAGTGT ACTGTATAGA 5160 

GCTGGTCCTC CGTGACCTTT TGATAATATG AAATAATCTC TATCTCGTGC TGCAAATATT 5220 

TCTGGAGTCA TTGGCATTAT TTCACCATAA AGCACCGCTA AAACTTCTAC GATAGACAGA 5280 

CTTCCTCCGT AATGTCCGAA TCCAAGATGA TTCAATGTTC TAAGAGTATT TAATCGGATG 5340 

TTAGTCGCAA ATTTTCTTAA CCCATCTTCT CTATTTTTAC TTAAAATCAT CCCTTATTCC 5400 
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TCCGTTGCAG ATGGCTTTTT AATAAAGGAT ACTCCAAACA TAACTGCTAG AATAAGAACA 54 60 

AGACCAATCA CAATGCCTGC TTGTGAGCCA AATTGATTTA ACATTCCTAA AATAATTCCT 5520 

GATAGACCAA AATCTGCATC TGAGAAAGTT GATCCTTGGA AACCAAGTCC TCCCAAAACT 5560 

GGCATTAAAA AGACTGGAAG AAAACTGATT AAAATACCTT GTAAAAATGC TCCAATAGTG 5640 

GCTCCACGAA CACCACCAGA TGCATTCCCA ATGACACCTG CAGTCGCTCC ACAGAAGAAA 5700 

TGAGGCACAA CACCTGCTAA GATAACAACC GTTCCTGAAG CAATCATAAT TACCATACTT 5760 

ACTAAACCAC CAACAAAACT AGAGATAAAT CCAATTAGAA CTGCATTGGG TGCATAAGTA 5820 

TAAACAATCG GACAATCCAA AGCAGGTTTT GAATTAGGTA CAAGACGCTC TGAAATACCT 5880 

TTAAAGGCTG GAACAATTTC GCCCAAAATA AGGCGAACAC CTGCTAAAAT AACAAATACC 5940 

CCTGCTGCAA ATTGACCTGC TAATTGTAAA GCATAAACTA GACCACTTGT ACCACTACTG 6000 

ATTTCTTTTT CTATATATTC TGACCCTGCA AAGATAGCTA CAATAATGTA AATAACTGCC 6060 

ATGGATAAAG TAATACTAAC AGTACTATCA CGTAAAAAAG CTAAACTCTT TGGAAATTTA 6120 

ATGTCCTCTG TTGATTTTGA TTTGTCACCG ATAAGGCTAC CAGTAAAACC ACTCAACCAA 6X80 

TATCCCAAAG AACTGAAATG ACCTAAAGCT ACCTTGTCAT TTCCAGTTAA TTGAACCATA 6240 

TATTTTTCCA CAAATGCTGG GGAAATACTC ATAATAATAC CGACTCCTAA TCCTCCTAGT 6300 

AAGATGAGAG GCAAGCTAGT AAAGCCAGCA ACTGATAAAA TGACCGCAAT CATACATGCC 6260 

ATATATAGAG TGTGCTGCCC TGTTAAAAAA ATATATTTAA ATCCAGTAAA ACGAGCGATT 6420 

AAGATATTGA ACACCATGCC TGCAAACATA ATCATTGCAG TAGCTGAGCC ATATGTTGTT 6480 

AAAGCTACAG CTACAATTGC TTCATTATTC GGCACAACGC CAGATAAATG AAAAGCATGC 6540 

TCAAACATGG TACCAAATGG ATTTAAAGAA TTTTGTACAA TTCCTGCACC ACCAGATACA 6600 

ACTAAGAAAC CAACAAAGGT CTTAATTCCA CCTTTAATAA TATCAGGTAA TTTCTTCTTC 6660 

TGAAGAACTA ATCCTAAGAT TGCAATTAAA GCTACTAAAA TAGCTGGTGT ACTAACAATA 6720 

TCCAATATGA ACTTCATCAT GACGCTAGCC TCCTATATAA GTCCTTTTTC TTCACAAAirT 6780 

TTAGTAATTA ATTCTCGTAG TTCATCCATA TCAATAATAC TATTTAAGAT ACGAACATCT 6840 

CCAAGATGAC TAGCTGAATC AGCTAGATCA CGACCAACAA TCCAAATATC AGCTGCATTT 6900 

GGATCTGCTC CACCTAAATC ATAATGTTCA ACTTCTACAT CCGAAACATT CAAATCACTC 6960 

AATACAGATT CAATATTCAT CTGTACCATA AAACTTGAAC CTAATCCTGA ACCACAAGCT 7020 

GTACCAATTT TTAACATTAT CTAATCCTCC TGTTTAATTA TCATTTTAAT GTCATCATAG 7080 

TTTTTTGATG ATATTAAAGT TTGAACATGA TTTTTATCTC TTAAAATTGT TGTTAAATGT 7140 

GACAAAGCCT TTAAATGACT CTCATTATCA ATGGCTGCAA TACAAATCAA CAATCTTACC 7200 
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TCTTGTTCTG 


GATTATCCAA 


TAAATAAATC 


GGTTCTTCCA 


AAACTAACAT 


TCACATTCCT 


7260 


ATTTCATTCA 


CACCTTCATC 


TGGCCGAGCG 


TGAGGAATTG 


CTACTCCCTT 


CCCTAAATTA 


7320 


ATAAAACGTC 


CAAACTCTTC 


TACTTTTTGA 


ATCATTGCCT 


CAGGGTAGTT 


CTCAGTTATC 


7380 


TTATCTTGAT 


CCAAAAGCGG 


TTTAGCTGCT 


AAACGAATCG 


CCTCCTTCCA 


TCCTAATTTT 


7440 


TGCGAACTAA 


CCTGATAGGT 


TTCTTTGGTA 


ATAAGTTGTT 


CTAGCACTGG 


TACAATTTCC 


7500 


TTTCTATCAT 


TTTTTTGGTA 


AAGATAATTC 


TTTAACGCCA 


ATCTTAATTC 


CAATTCTTGT 


7560 


GTAATAATTC 


CATATCTTTT 


GACAATATTC 


AGGATTTGTT 


CAATCTCAAA 


ATCTCCATAC 


7620 


TCTAAATTCG 


GAAAATCTTT 


TAACACTAGT 


TCTACTAGTT 


GTATTGCTTG 


CTCTTCAGTC 


7680 


ATCATAACCG 


AAACTAGATA 


ATTTGGCTTT 


TCTGTCTCCA 


CCTTTATGGT 


AGAAAAAACC 


7740 


ATATCATAGT 


CACTACTAGC 


TTTCACCTGT 


AAATCATCAA 


TCTTTGAGGT 


TCCTATAAAC 


7800 


TCAATTTGAG 


GAAATAATGC 


TAATAGATTC 


TCTTTTAACA 


TCAATGAAGA 


ACTAACACCA 


7860 


TTAGGACAAA 


TGATTGCTGC 


TTTATACCAT 


TTTTGAGGCA 


AAGTATCTGC 


TTTCTTTAAA 


7920 


TAACCTCCGA 


AATGGATAAC 


AAAATATGCT 


GTTTCACTAT 


CAGGTATGGG 


ATTGTCAATA 


7980 


GCGTCCATCA 


AGGGCATCAA 


AGAATCTTTG 


ACTAATTCAA 


ATAAATCAGG 


ATAATGTTCT 


8040 


TTAACATGCA 


ATACATATTC 


ATTTGAACTA 


GGTAGGCCGA 


ACTTTAATCT 


ATAGTAAGCC 


8100 


GGTATAAGGT 


GGCGGCGAAG 


ATTTTCTCTC 


AATCCTTCCC 


tttgtttaaa 


ATGTAACAAA 


8160 


GAAATATCTT 


CCATTCTACT 


TATAATAGCC 


TCTGTTAATT 


GATTAAAGTA 


AACCGGAGCA 


8220 


ACATCTACTT 


CACCTTCAAA 


GCAACTTGAT 


AATAAAACGG 


TGATATAGCG 


ATAATCATCC 


8280 


TCAGAAAACA 


CCGTATCTAT 


AATTCCCAAA 


TCAACCACTG 


TATCCAATAA 


AATAGTGGTT 


8340 


ATATCTTGAA 


TAACAGGAGA 


TACTAATGTC 


TCTGAAAGAC 


ATACTCTTTC 


AACATCCCTT 


8400 


TGATACCTAC 


ACAGAATGAA 


TACTAAACCG 


AAAAGGTAAA 


CTTTTAATTG 


ATTAACAATA 


8460 


GGTACTAGCT 


GTAGCTTCTC 


ATAATAATCT 


TTAACTACCT 


GATCAATCAA 


ATCATAAGTT 


8520 


AATGAATACC 


CCCAACTGGA 


TAAAACATAA 


TCCAAACCCC 


AAATCCCTAT 


GGAGGATTCC 


8580 


AGCAACTCAC 


TAACCATTTG 


AAAAGCTAAG 


CGGTGCTTAT 


TCCACTCTGA 


ACCGTGTAAA 


8640 


GTATAACCTT 


TTGCTCTACT 


GTACCCTAGC 


TCCAAATCAT 


TATCTAACAT 


AATCTTTCTT 


8700 


AATGATTGAA 


TATCAGATAA 


GGTTGTATTC 


TTACTTACTT 


TCAAAAAGTC 


TTGGTAATGA 


87 60 


CTATTCGATA 


TAAAATCTAA 


TCGGCAAAAA 


GTGTAAAGAT 


AGATTAAAGC 


TAAGCGAGTC 


8820 


GACTTTGGTA 


AAACCAATTC 


ATCCGACTTA 


ATAATATCTG 


TCAAAGACTG 


CTTCGTACGA 


8880 


TTTGATAAAC 


TATAGCGACC TTGCTTTTTA TCCAGCACTA TCCCTTTATT 


AGCTAGATAA 


8940 
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GGCACTAAAT AATCTATTCC TTCTTTGACT TCCTTTATAG GTAAGCTCAC CTTAACAGAT 9000 

AATTCATATA ACGATAGCTC ACAATGATCC ATCAAAGTCA TCAAAATAAC TAGTGCTCTA 9060 

TAATCAAAC 9069 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8654 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CGAGACAACA AGATGAAGAA AAATTTGCCC TATCGTTTGT GGCGCTTGCA AGTGTAGCAC 60 

TTCTTGCAGC CTGTGGAGAA GTGAAGTCTG GAGCAGTCAA CACTGCTGGT AACTCAGTAG 120 

AGGAAAAGAC AATTAAAATC GGGTTTAACT TTGAAGAATC AGGTTCTTTA GCTGCATACG 180 

GAACAGCTGA ACAAAAAGGT GCCCAATTGG CTGTTGATGA AATCAATGCC GCAGTGGTAT 240 

CGATGGAAAA CAAATCGAAG TAGTCGATAA AGATAATAAG TCTGAAACAG CTGAGGCTGC 300 

TTCAGTTACA ACTAACCTTG TAACCCAATC TAAAGTATCA GCAGTCGTAG GACCTGCGAC 360 

ATCTGGTGCG ACTGCAGCTG CGGTAGCGAA CGCTACAAAA GCAGGTGTTC CATTGATCTC 420 

ACCAAGTGCG ACTCAAGATG GATTGACTAA AGGTCAAGAT TACCTCTTTA TTGGAACTTT 480 

CCAAGATAGC TTCCAAGCAA AAATTATCTC AAACTATGTT TCTGAAAAAT TAAATGCTAA 540 

GAAAGTTGTT CTTTACACTG ACAATGCCAG TGACTATCCT AAAGGGATTG CAAAATCTTT 600 

CCGCGAGTCA TACAAGGGTG AAATCGTTGC AGATGAAACT TTCGTAGCAG GTGACACAGA 660 

CTTCCAAGCA GCCCTTACAA AAATGAAAGG GAAAGACTTT GATGCTATCG TTGTTCCTGG 720 

TTACTATAAT GAGGCTGGTA AAATTGTAAA CCAACCGCCT GGCATGGGAA TTGACAAACC 780 

AATCGTTGGT GGTGATGGAT TCAACGGTGA GGAGTTTGTA CAACAAGCAA CTGCTGAAAA 840 

AGCATCAAAC ATCTACTTTA TCTCAGGCTT CTCAACTACT GTAGAAGTTT CAGCTAAAGC 900 

TAAAGCCTTC CTTGACGCTT ACCGTGCTAA GTACAATGAA GAGCCTTCAA CATTTGCAGC 960 

CTTGGCTTAT GATTCAGTTC ACCTTGTAGC AAACGCAGCA AAAGGTGCTA AAAATTCAGG 1020 

TGAAATCAAG AATAACCTTG CTAAAACAAA AGATTTTGAA GGTGTAACTG GTCAAACAAG 1080 

CTTCGATGCA GACCACAACA CAGTCAAAAC TGCTTACATG ATGACCATGA ACAATGGTAA 1140 

AGTTGAAGCA GCAGAAGTTG TAAAACCATA ATAGAAAAAT GTTGAAATAG GGAATGAGCC 1200 

TTTGACTCAC TCCCTGTTTC GATATTTAAT ACTCTTCGAA AATCTCTTCA AACTGCGTCA 1260 
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ACGTCGCCTT GGATTATATA TGTCACTGAC TTCGTCAGTC TTATCTACAA CCTCAAAGCA 1320 
GTGCTTTGAG CAACCTGCGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATAA 1380 
GAACCTATCA AAAAGTGAGG GAAAACCCTC GGAATTATAA ATAGAAAGAG TGAATCTTAT 1440 
GCTCCAACAA CTCGTAAATG GTTTGATTCT AGGTAGTGTT TACGCGCTGT TAGCCCTAGG 1500 
ATATACCATG GTTTACGGAA TTATCAAGCT CATCAACTTC GCCCATGGTG ATATTTATAT 1560 
GATGGGAGCC TTTATCGGTT ATTTCTTGAT CAATTCTTTC CAAATGAATT TCTTTGTAGC 1620 
GCTTATTGTA GCTATGCTAG CGACAGCTAT TCTTGGTGTC GTGATTGAGT TTCTTGCTTA 1680 
CCGACCTTTG CGCCACTCTA CTCGTATTGC TGTTTTGATT ACGGCTATTG GGGTTTCTTT 1740 
CCTATTGGAG TATGGAATGG TCTATCTGGT TGGTGCCAAT ACCCGTGCCT TCCCTCAAGC 1800 
GATTCAAACA GTTCGATATG ATTTGGGACC AATTACCTTA ACAAATGTGC AGTTAATGAT I860 
TTTGGCCATT TCCTTGATTT TGATGATTTT GTTACAAGTC ATTGTCCAAA AGACTAAGAT 1920 

GGGGAAAGCC ATGCGTGCAG TATCAGTAGA TAGCGACCCG GCGCAATTGA TGGGGATCAA 1980 

TGTAAACCGT ACGATTAGCT TTACCTTCGC TTTGGGTTCT GCTCTTGCGG GTGCGGCTGG 2040 

TGTTCTGATT GCTCTTTATT ATAACTCTCT TGAGCCTTTG ATGGGGGTTA CTCCAGGTCT 2100 

TAAATCTTTC GTTGCCGCAG TACTTGGTGG TATCCGAATT ATTCCTGGTG CGGCTCTTGG 2160 

TGGCTTTGTG ATTGGTCTAT TGGAAACCTT TGCGACTGCC TTTGGGATGT CAGATTTCCG 2220 

TGATGCCATT GTTTATGGAA TCTTGTTGTT GATCTTCATT GTCCGCCCAG CTGGTATCCT 2280 

TGGTAAGAAT GTGAAAGAGA ACGTCTAAAC GATGAAGGAA AATTTAAAAG TTAATATTCT 2340 

ATGGTTACTC CTTTTGTTAG CTGGCTATAG CTTGATTAGT GTACTGGTTT CACTCGGAGT 2400 

ACTTAATCTA TTCTATGTAC AGATTTTACA ACAAATTCGA ATTAATATTA TTTTGGCTGT 24 60 

TGGTCTCAAC TTAATCGTTG GTTTTTCAGG ACAATTTTCA CTTGGTCATG CTGGTTTCAT 2520 

GCCGATTGGT GCCTATGCAG CAGCTATTAT TGGTTCTAAA TCACCAACCT ACGGTGCCTT 2580 

CTTTGGAGCT ATGCTTGTAG GGGCTTTGCT TTCAGGAGCA GTTGCCTTAC TTGTCGGCAT 2640 

TCCAACCTTG CGCTTGAAGG GGGACTATCT TGCGGTAGCA ACTCTGGGTG TTTCTGAAAT 2700 

TATCCGTATC TTTATCATCA ATGGTGGAAG CCTTACAAAT GGTGCGGCAG GTATCTTAGG 27 60 

GATTCCTAAC TTTACAACTT GGCAAATGGT TTACTTCTTT CTCGTGATTA CAACCATTGC 2820 

AACCTTGAAC TTCTTGCGTA GCCCAATTGG TCGTTCAACC CTCTCTGTTC GTGAAGATGA 2880 

AATCGCTGCT GAGTCAGTTG GGGTTAATAC GACTAAAATT AAAATCATCG CTTTTGTCTT 2940 

TGGTGCCATT ACTGCAAGTA TTGCTGGGTC ACTTCAGGCA GGATTTATCG GGTCTGTTGT 3000 



WO 98/18931 



PCT/US97/19588 



748 



ACCGAAAGAT 


TACACCTTCA 


TCAACTCAAT 


CAACGTTTTG 


ATTATTGTTG 


TATTTGGTGG 


. 3060 


ACTCGGTTCC 


ATTACAGGTG 


CGATTGTTTC 


GGCTATTGTT 


CTGGGAATTT 


TGAATATGCT 


3120 


TCTCCAAGAT 


GTTGCTAGTG 


TGCGTATGAT 


TATTTACGCT 


TTGGCCTTGG 


TATTGGTAAT 


3180 


GATTTTCAGA 


CCAGGTGGAC 


TCCTTGGAAC 


ATGGGAACTG 


AGCCTATCAC 


GTTTCTTTAA 


3240 


AAAATCTAAG 


AAGGAGGAAC 


AAAACTAATG 


GCATTACTTG 


AAGTAAAACA 


GTTAACCAAA 


3300 


CATTTTGGTG 


GTCTAACAGC 


TGTTGGAGAT 


GTGACTCTTG 


AATTGAACGA 


AGGGGAACTG 


3360 


GTTGGATTAA 


TCGGTCCAAA 


CGGAGCTGGG 


AAAACCACCC 


TTTTCAACCT 


TTTGACCGGT 


3420 


GTTTATGAAC 


CAAGCGAGGG 


AACAGTAACC 


CTAGATGGTC 


ACCTTTTGAA 


TGGGAAATCA 


3460 


CCTTATAAGA 


TTGCCTCTTT 


GGGACTTGGA 


CGTACTTTCC 


AAAATATCCG 


TCTCTTTAAA 


3540 


GATTTAACAG 


TTTTAGATAA 


TGTTTTGATT 


GCTTTTGGAA 


ACCATCACAA 


ACAGCATGTT 


3600 


TTTACTAGTT 


TCTTACGCTT 


ACCAGCTTTT TACAAGAGTG AAAAAGAATT AAAGGCTAAA 


3660 


GCTTTGGAAT 


TGTTGAAAAT 


CTTTGATTTA 


GATGGTGATG 


CAGAGACTCT 


TGCTAAAAAT 


3720 


CTTTCCTACG 


GACAACAACG 


TCGTTTGGAA 


ATTGTTCGTG 


CCCTTGCTAC 


GGAACCTAAA 


3780 


ATTCTCTTCT 


TAGATGAACC 


AGCAGCAGGT 


ATGAACCCAC 


AGGAAACAGC 


CGAATTGACT 


3840 


GAGTTAATTC 


GTCGTATCAA 


AGATGAGTTT 


AAGATTACAA 


TCATGTTGAT 


TGAACACGAT 


3900 


ATGAATCTGG 


TCATGGAAGT 


AACAGAACGT 


ATCTACGTAC 


TTGAATATGG 


CCGTTTAATC 


3960 


GCTCAAGGAA 


CTCCAGACGA 


AATTAAGACC 


AATAAACGCG 


TTATCGAAGC 


TTATCTAGGA 


4020 


GGTGAAGCCT 


AATGTCTATG 


TTAAAAGTTG 


AAAATCTTTC 


TGTGCATTAC 


GGTATGATCC 


4080 


AAGCAGTTCG 


TGATGTAAGC 


TTTGAAGTTA 


ATGAAGGAGA 


AGTTGTTTCC 


CTTATCGGTG 


4140 


CCAACGGTGC 


AGGTAAGACA 


ACTATTCTTC 


GCACCTTGTC 


AGGTTTGGTT 


CGACCAAGTT 


4200 


CAGGAAAGAT 


TGAATTTTTA 


GGTCAAGAAA 


TCCAAAAAAT 


GCCAGCTCAG 


AAAATCCTGG 


4260 


CAAGTGGTCT 


TTCACAAGTT 


CCAGAAGGAC 


GCCACGTCTT 


TCCTGGCTTG 


ACTGT7ATGG 


4320 




nJ\ 1 uuunuv. i 


TTCTTAAAGA 


AAAATCGTGA 


AGAAAATCAA 


GCTAACTTGA 


4 J BO 


AGAAGGTTTT 


CTCACGCTTT 


CCTCGTCTTG 


AAGAACGGAA 


GAACCAAGAT 


GCAGCCACTC 


4440 


TTTCAGGGGG 


GGAACAACAA 


ATGCTTGCCA 


TGGGACGCGC 


CCTCATGTCA 


ACACCAAAAC 


4500 


TTCTTCTTTT 


AGATGAACCA 


TCAATGGGAC 


TTGCCCCAAT 


CTTTATCCAA 


GAAATTTTTG 


4560 


ATATCATTCA 


AGATATTCAG 


AAGCAAGGAA 


CAACGGTCCT 


CTTGATTGAA 


CAAAATGCCA 


4620 


ATAAAGCACT 


TGCAATCTCT 


GACCGAGGAT 


ATGTACTGGA 


AACAGGGAGA 


ATCGTCCTAT 


4680 


CAGGAACAGG 


AAAAGAACTC 


GCTTCATCAG 


AAGAAGTCAG 


AAAAGCATAT 


CTAGGTGGCT 


4740 


AAAACAATCC 


AGTGGATTGT 


TTTAGTCGGC 


AGATGGAGAT 


TACGAAGTAA 


TCATCAATAT 


4800 
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AGTCCGGGGG ACCTTTTTAG TCGGTAGATT GAGATTGCAA ACAAATCTGC ATCTACATTG 
AAAGCTTAAT TTCTAATAAT TGAAAAAATC GAATGAAAAA TTTCTTACCT TCATTCACAG 
AGCTCGATTT CAGAGCTCTT TTTGCTAGCT TATTCATACT TTTCTGAATT TCGAAAAAGA 
AATGTAAGCG TTTGATAGAT TTACAAAAAG ATTGTATAAT AGGGATAAGA ATAGAAAAGG 
AGAAGTCTCA TGGCAGTTAA AGATTTTATG ACCCGCAAGG TAGTTTATAT TAGTCCAGAT 



ATAACAGTAT CTCATGCAGC 
ATCGAAAATG ATCAATTAGT 
TCTAAAGCAA CAAGTCTTTC 
AAAGATGTCA TGATTCGCGA 
ACTTATCTGA TGTTGAAAAA 
TACGGAGTTA TTACTGACCG 



AGATTTGATG AGAGAGCAAG GTTTGCACCG 
TGGTTTGGTG ACTGAGGGAA CCATTGCACA 
TATCTATGAG ATGAATTATC TTCTGAATAA 



TGTTGTCACT GTCTCAGGCT 
TAAGATTAGT ATTCTCCCTG 
TGACGTTTTC CAAGCCTTTC 



GAAGAAGGGA TTCGTGTACG 



CTTTGTTACA 
GTTTCTTTGA TTGTAGAAGA AAATTTGAAT 
GATGGTAAGG TGATTATCGA AGTGCAAATC 
GAAAAATTTG AAGCAAATGG TATTCAAGTG 

AGGCTTCTTT 
GCTATAATGA 



TTGTAAGAAG GGAAGCCCAA 
GGAAAGAAAT GATAAAATAT 



TTTCAGTAAT TCGTCAGAAA ATTGACGCAA 
CTCTTTGACC TCGAAGGGCT AGAGGAAGAG 
CCTGATTTTT GGAACGATAA TATTCCGGCC 
AAAAACACTT ACAATACCTT CCATAAGATG 

CGAGTCAGTG 
CTACGAGATG 



TTGGATTTTT TGGCTGAAGA 



CTTGATAAGA TAATGACCAG 



GAAGATGAAG 
ATCTCCCATA 
GATGGATCAA 
GAAGAAATCG 
TTTCATGAAA 
AATAATGTAA 
ATCGTGAAAA 
ATTGCCATCT 
CAAAAAACGT 
GAAGAGTTGC 
CATGATGAAC 
ACTCTACTCT 



ATGCTAGTCT 

TCGTAGATAA 

TTGAAATTGC 

TTGGTGTTCT 

CAGTCAATAT 

TTGATTTACC 

CTCGCACTTC 

AGGGGATTAG 

AAAAGGAGTA 



TCTGCCTGTT 

AGCAAGTCCA 

GACAAAAGTA 

AGAAGATGCA 

CCATCAAGTA 

AGGTTATGGC 

TGGAAAAATT 

TCCGCGTAAG 

AGCCTTGAAA 

AGCAAAAGTC 

AGCAAAAGAT 

TTTATGGACA 



ATTAGCTTCT TTCAGGGGGT 
TGGAAAACAA GATGACAGAA 
CCCAAGAATT AAATGAATTA 
AGGATGAACT CGAAATTTTA 
TGGTAGCGCA GTTAGCCGAA 
TGTCAGAACC TTATGACCAC 



AACAATGCCA 
GATATGTTGC 
TTGGATTACC 
CCTAATGCCT 
CCATTTGACT 
TTGGATGATA 



TTCGTATGTA TACTCGTTAT 
AAGCAGGTGA TGAGGCTGGT 



CTGAGGCGCA GGACTGGGGT 
AAGGCTTTAA AGTGGAAGTG 



TCTTGGAAAT CCATCCAGGT TCTGGTGGTA 

GGTAATGCTA 

ATTAAGTCGG TAACTTTATC ATTTGAAGGG 

ATGGTCTCCT CAAGTCAGAA ATGGGTGTTC 

TCTTTCACAT 
GAAGATGATA 



CTGCCAAACG TCGCCATACC 
CTATTGAAGT GGAAATCCGT 



ACCGCTTAGT GCGAATCTCA 
CTGTAGAAGT GATGCCAGAA 
TCAAGATGGA TACCTTCCGT 



4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 
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TCAGGTCGTC CCGGTGGACA AAACGTCAAT AAGGTTTCAA CAGGTGTACG TTTAACCCAC 6600 

ATTCCAACTG GAATTGTTGT CCAATCAACA GTAGATCGTA CCCAGTATGG AAATAGAGAT 6660 

CGTGCCATGA AGATGTTGCA GGCTAAGCTC TATCAAATGG AGCAAGATAA GAAGGCTGCG 6720 

GAGGTAGATT CTCTCAAAGG TGAGAAAAAG GAGATCACTT GGGGAAGCCA AATCCGTTCT 6780 

TATGTCTTCA CGCCTTATAC TATGGTAAAA GATCACCGAA CTAGCTTTGA GGTTGCTCAG 6840 

GTAGATAAGG TTATGGATGG GGACCTAGAT GGTTTTATCG ATGCTTATCT CAAGTGGCGA 6900 

ATTAGCTAAG ATAGAAAGGA ACTCACATGT CAATTATTGA AATGAGAGAT GTCGTTAAAA 6960 

AATACGACAA CGGAACAACT GCTCTACGCG GTCTTTCGGT TAGCGTTCAA CCGGGGGAAT 7020 

TTGCTTACAT CGTAGGACCT TCAGGAGCAG GGAAGTCAAC TTTTATTCGT TCTCTGTATC 7080 

CTGAAGTAAA AATCGATAAA GGAAGCCTAT CAGTTGCTGG TTTTAATCTG GTTAAGATCA 7140 

AAAAGAAAGA TCTCCCGCTT CTACGTCGTA GTGTTGGGGT TGTCTTCCAG GATTATAAAT 7200 

TGTTACCAAA GAAAACTGTC TATGAAAATA TTGCTTACGC TATGGAAGTA ATCGGGGAAA 7260 

ATCGCCGTAA TATCAAAAGA CGAGTGATGG AAGTTTTGGA CTTGGTTGGA TTGAAGCATA 7320 

AGGTTCGTTC TTTCCCAAAT GAACTCTCAG GTGGGGAGCA ACAGCGGATT GCGATTGCGC 7380 

GTGCAATTGT AAATAATCCC AAAGTATTGA TAGCTGATGA GCCAACAGGA AATCTGGATC 7440 

CGGATAATTC ATGGGAAATT ATGAATCTCT TGGAACGGAT TAACyTACAA GGAACAACTA 7500 

TTTTGATGGC GACTCATAAT AGCCAGATTG TAAATACCTT CCGCCACCGT GTCATTGCCA 7560 

TTGAAAATGG CCGTGTCGTT CGTGACGAAT CAAAAGGAGA GTATGGATAC GATGATTAGT 7620 

AGATTTTTTC GCCATTTATT TGAAGCCTTA AAAAGTTTGA AACGAAATGG TTGGATGACA 7680 

GTAGCTGCTG TCAGTTCACT CATGATTACT TTGACCTTGG TGGCAATATT TGCATCTGTT 7740 

ATTTTCAATA CAGCGAAACT AGCTACAGAT ATTGAAAATA ATGTCCGTGT AGTAGTTTAT 7800 

ATCCGAAAGG ATGTGGAAGA TAATAGTCAG ACAATTGAAA AAGAAGGTCA AACTCTTACA 7860 

AATAATGACT ACCACAAGGT ATATGATTCT TTGAAGAACA TGTCTACGGT TAAAAGTGTT 7920 

ACCTTTTCAA GTAAAGAAGA ACAATATGAA AAATTAACCG AGATAATGGG AGATAACTGG 7980 

AAAATCTTTG AAGGAGATGC CAATCCTCTC TATGATGCCT ATATTGTAGA GGCAAACACT 8040 

CCAAATGATG TAAAAACTAT AGCCGAAGAT GCTAAAAAAA TTGAAGGTGT CTCTGAGGTT 8100 

CAAGATGGCG GTGCCAATAC AGAAAGACTC TTCAAGTTAG CTTCATTTAT CCGTGTTTGG 8160 

GGACTAGGGA TTGCTGCTTT GTTAATTTTT ATCGCAGTTT TCTTGATTTC AAATACCATT 8220 

CGTATTACCA TTATTTCCCG CAGTCGCGAA ATTCAAATCA TGCGCTTGGT CGGAGCTAAA 8280 

AACAGTTATA TCCGTGGACC GTTCTTGTTA GAAGGAGCCT TTATCGGTTT ATTGGGAGCT 8340 
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ATCGCACCAT CTGTTTTGGT CTTTATTGTT TATCAAATTG TTTACCAATC TGTCAACAAA 8400 

TCGTTGGTAG GGCAAAATCT ATCCATGATT AGTCCAGATT TATTTAGTCC GTTGATGATT B460 

GCCCTACTAT TTGTGATTGG GGTTTTCATT GGTTCATTGG GATCAGGAAT ATCCATGCGC 8520 

CGATTCTTGA AGATTTAGGT AAAATAGCTG CTTTTATGAG GAGATTGTAA AATCTCCTTT 8580 

TTTGCTACAA GAGTTTTTGA AAAGAGATGC GCAGAAGAAA AGAGCTTCCA AAGAAGTCCC 8640 

CCAGAGAAGA CTTC 8654 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

TGTCGCGTCA AAATCATTAC TATGCCTATG TATAGCCCTT ACTATGACTT GGCTAAACAC 60 

GTTCGCTTTC AAATTTCTAG GCTCAGGCTG AAACAGTCTC CCAGGCTGTT CACTCCCGAA 120 

TGCTAAAATC GTTCTTGATC GCTTTCACAT TGTACAACAT CTTAGCCGTC CTATGAGTCG 180 

TGTGCATGTC CAAATCATGA ATCAGTTTCA TCGAAAATCC CATGAATACA AGGCTATCAA 240 

GCGCTACTGG AAACTCATTC AACAGGATAG CCGTAAACTG AGTGATAAGC GATTTTATCG 300 

CCCTACTTTT CGCATGCACT TAACAAATAA AGAAATTCTT GACAAGATTT TAAGCTATTC 360 

AGAAGACTTG AAACACCACT ATCAGATCTA TCAACTCTTA CTTTTTCACT TTCAGAACAA 420 

AGACCCTGAG AAATTTTTCG GACTCATTGA GGACAATCTG AAGCAGGTTC ATCCTCTTTT 480 

TCAGACTGTC TTTAAAACCT TTCTCAAAGA TAAAGAAAAG ATTATCAACG CCCTTCAACT 540 

ACACTATTCT AATGCCAAAC TGGAAGCGAC CAATAATCTC ATCAAACTTA TCAAGCGCAA 600 

TGCCTTTGGT TTTCGAAACT TTGAAAACTT CAAAAAACGG ATTTTTATCG CTTTGAACAT 660 

CAAAAAAGAA AGGACGAAAT TTGTCCTTTC TCGAGCTTAG CTGACTTCAA CCCACTACAG 720 

TTGACAAAGA GCCTAATTTC CATAAAAATT GACATGGAAA TTATAAAACC ATTACTAGTT 780 

TAGTCCTTTT TGATAACGTG CCAATTCGGC TTGGTTCGCC CAAACATAGT GACCTGGACG 840 

GATTTCTACC ATAGATGGCT TATCAGTCTC ATAGTCGTGT TGACTTGGAT CGTAAACCTT 900 

CAAGACCTTC TTACGTTCCA AGATTGGATC TGGGATTGGT ACCGCTGAAA GCAAGGCTTG 960 

AGTATATGGG TGAATTGGAT TGTTAAACAA TTCTTCTGTT TCTGCAACCT CTACAATAAC 1020 
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ACCCTTGTAA 


ATAACTGCGA 


TACGATCTGA 


AATAAAGCGA 


ACAACCGACA 


AGTCATGGGC 


1080 


GATGAAGAGA 


TAGGTCAGGC 


CGAGCTCTTT 


TTGGAATTTT 


TTGAGCAAGT 


TCAAGACTTG 


1140 


GGCACGTACA 


GAAACGTCCA 


AGGCTGAAAT 


TGGCTCATCT 


GCAATAACAA 


AGTCTGGTTG 


1200 


CATGACCAAG 


GCACGGGCAA 


TACCGATACG 


TTGACGTTGA 


CCGCCTGAGA 


ATTCATGAGG 


1260 


GTAACGAGTC 


AAGTGCTCAG 


CAAGAAGACC 


TACTTCACGG 


ATAATATTTT 


GAACTTTCTC 


1320 


TTTACGTTCT 


TCTTCATCCT 


TAAATAAACG 


GTGATTGTAA 


AGACCTTCAG 


AAATAATATA 


1380 


ATCAACAGTC 


GCACGTTCAT 


TCAAACTTGC 


GGCAGGGTCT 


TGGAAAATCA 


TCTGGATTCG 


1440 


ACGAATCAAT 


TCCGCACCTT 


GTTCACGCGA 


TTTCTTACCA 


TTAATCTTTT 


G AC CATC AAA 


1500 


AATGATATCT 


CCATTACTTG 


TATCATTTAG 


ACCGATGATA 


GCACGACCAA 


TAGTTGTTTT 


1560 


CCCACTACCG 


. GACTCACCTA 


CAAGCGAGAA 


AGTTTCTCCC 


TTGTTGATAA AGAAGTTAGC 


1620 


ATTTTTAACC 


GCGACAAACT 


TCTTACTTCC 


TTCACCGAAG 


GAAATTTCTA 


AATCTTTGAT 


1680 


TTCTACTAAT 


TTTTCAGACA 


TTTCCTTCCT 


CCTAGTCAGC 


CAGATGGGCA 


AATCCCATTT 


1740 


TTTCACGGAT 


CTTATCATGG 


AGATTTGCAA 


TCACAGCTGG 


TTTTTCTACT 


TTCGGAGCAT 


1800 


CCTCATGAAG 


AAGCCAAGTT 


TTAGCCCAAT 


GTGTCTCTGA 


TACTGAGAAT 


TGAGGAGCTT 


1860 


TTTGTTCGAA 


GTCAATCTGC 


ATTGCGTAGT 


CAGAACGCAA 


GGCAAAAGCA 


TCCCCTTTCA 


1920 


GGTCAGTATA 


AAGTGACGGA 


GGTGTTCCTG 


GGATTGACTA 


AAGATCCCCT 


TTATCATCAG 


1980 


CAAGCTGAGG 


CAAGCTAGAC 


AAGAGACTCC 


ATGTATATGG 


ATGGCGAGGG 


TCATAGAAGA 


2040 


CTTCCTCAAC 


CGTTCCATAC 


TCAACGATTT 


CTCCTGCATA 


CATAACCGCT 


ACCTTATCCG 


2100 


CAATACTTGC 


CACCACACCA 


AGGTCGTGGG 


TAATAAAGAT 


TGTTGTGAAA 


TGATACTCGT 


2160 


TTTGTAAAGA 


TTTTAGCAAA 


TCAATAATCT 


GAGCTTGAAT 


AGTTACATCC 


AAGGCAGTTC 


2220 


TTGGCTCATC 


ACAGATCAAG 


ACATCAGGTC 


GGCAGGCAAG 


GGCAATAGCA 


ATAACGATAC 


2280 


GTTGACGCAT 


TCCTCCACAA 


TATTGGAATG 


GGTATTCATT 


AAAACGTCTA 


TCTGCGTCTG 


2340 




L I rt\x ILATU 


TAC»TQ_AATGG 


CCAATTCTTT 


CGCTTCTTTA 


GCTGTTTTTC 


2400 


CTTGGTGTTT 


TACAATAACT 


TCTGTAATCT 


GACTACCAAT 


TGTTTTAATG 


GGGTCCAAAC 


2460 


TAGTCATTGG 


GTCCTGGAAG 


ATAGTCGCAA 


TCTTAGCACC 


ACGAATTTCT 


tCCCAATCCT 


2520 


TGTGAGAAGA 


TAAAGCTGTC 


AAGTCCTGAC 


CACGCTAGTC 


AATACTACCT 


TGGGCAATAC 


2580 


GACCATTTTC 


TTCGAGCATA 


CCTGTGAAGG 


TCTTTGTCAA 


AACAGATTTA 


CCTGATCCTG 


2640 


ACTCACCTAC 


CAAGGCTAAT 


ACTTCTCCTT 


CGACTACTTC 


AAGGGAAACG 


CCGCGAATGG 


2700 


CTGTCAATAC 


TTTGTCACGA 


ACGTCAAATT 


CCACGACAAT 


ATCGCGAGCA 


GTCAAAATTA 


2760 


catttttttc 


TTTTGTCATT 


TCTACTCCTA 


TCTATGTGTA 


CGTGGATCAC 


TAGCATCCGC 


2820 
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TAAGTTTTGA CCAACTACGA AAAGGGACAA GGATACCAAG ACAAGGGTTG TCAATGGAAT 2880 

CCACAACAAG TAAGCATTGG TTGTTACGTT TTGTGAATAA TCCGAAATCA AACGACCCAA 2 940 

ACTTGGCACT GTAATCGGTA ATCCAAGACC GAAGAAAGAC AAGAAGGCTT CGTATGAGAT 3000 

AAAGCTTGGA AGCATTTGAG TCATGGTTGT CACAATAACA GATACCAATT GAGGCATGAT 3060 

ATTTTTGGCA ACAATCTTCA AGGTTGGTGT TCCCAAAGTA CGTGACGCCA AGTTGTATTC 3120 

CAAGTCACGA TAGCGCAAGA TTTGCACACG GATCATGAAG GCAATACCAA TCCATGTTGT 3180 

TACGCTCATG GCAAAAATCA GATTCCAGAA TCCAGCTCCG ATTGAGTAAG TCAAGACAAT 3240 

AACAATCAAA AGAGGTGGGA TGTTTGAGAT GACGTTGTAA ACTTCCATCA TGACACGGTC 3300 

AACTGATTTT GAAATACCCC AAATACCACC GACAAAAACA CCGATAACCA AGTTAATCAC 3360 

TGTCGCAATC ACAGAAATGA GGATGGAGTT ACGACCTCCG AACCAGACAC CGTCAAAGAG 3420 

CGATTTACCC TTACTGTCAG TACCGAACCA ATGCTCCGCA TTTGGCTTGA TATAACGAAC 3480 

ACTAAAGTCG TTTACCTTGC TGACATCATT GAAATCAAAC TTAGAAAACA TTGGGTAGAT 3540 

GAAACTTATC AAAATGATGG CTACCAAGAT TCCCAACATG ACTACAGTTG ATTTTTTCTT 3600 

CATAAATTGT TTAAACACTG ATTTCCAGTA AGAATATGCT GGCGCATCAA TAGTTTCAGA 3660 

GGCAAAATCG TCACGTTTTA CAAACTGAAA TTTTTCTTTA TCGATTGTAG ACATTATTTG 3720 

CCTCCTTTCT CAGTCAATTT AATACGTGGG TCAATAATAG TCATCCAAAT ATCTCCCAAA 3780 

AGACGTGAGA AGATAGAAAT ACATGTAAAG ATGAAGACAA GACCAACGAC CATAGAGTTA 3840 

TTAGATGCTT TTACAGAGTC AATCAACATT TTACCCATAC CTGGGAAGGC GAAGACTGTT 3900 

TCAGTAACGG TTGCACCACC GATAACCCCA ATAATGGCAG CAGGAATTCC TGAAACCACC 3960 

GGAACCATGG CATTTTTAAA GATGTGTTTG TTTGAAATTT CTTTTTCAGA CAAACCTTTT 4020 

GCACGAGCGA AACGAACAAA GTCTTGAGAT TGCAAGTCAA TCATGTAACG ACGAATCCAA 4080 

ATGGCTGTAC CAGGAGCACC CAACAAACCA AGGATGACTG CTGGTAAAAC GTAAGAACGC 4140 

CAATCTCCAG CTCCCAAGAT AGGGAATGAA TCTGGAAGGG CAATAGATGA TCCAATCAAT 4200 

CGAACGATGT AAACCAAGGC AATCGTTGGA AGAGCAAGCA AGAAGGTCAA AGCCCCTGTT 4260 

GAGAGGCTAT CAATCCAAGT GTTCTTGAAA CGAGCCATGG CTGAACCAAG TGGCACGGCA 4320 

AGAGCATAGG CAAGAACCAA ACCAATCAAA CCAGTAATAG CAGAGCTGAC AATCATAGAT 4380 

GGATATTGGT AATTACTTTC ACTCGCTGTA TAAGGATCAT CTTTCCCATA GCTACCTACT 4440 

TCACGAGAGT CAGCCTGACT AGGTGACTTG TAGGTTCTTG AGTAAATATT TACAGAAGAC 4500 

GTTTTCTTAC CTGTTGGGAA CTGAACTTGG GCAGTTTTGG TTTGTCCTTG ACCTTGAGTA 4560 
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ATAACCTGAA 


GAACTGGTGT 


ATTAGCATAG 


GTTGGGTAAG 


AGTCACCTAA 


ATTCAACTTC 


4620 


ACAAAGTTTT 


GATGAACAAA 


TGGGAACTGA 


CTGTTAAAGT 


ACAAGAGATA 


TTTATGTTTA 


4680 


GTTCCTGAAC 


CGACCAATGA 


CCATCCGATA 


GCTGGATCAT 


TTTCAAAACG 


AAGGTAGCGT 


4740 


TTCAAGTCTG 


GATTTTCAGG 


GTCTTGGATT 


TTATTTGTAT 


GGTCAATGTC 


AATCAAGTTA 


4800 


GCATAGAAGT 


GAAAAACACG 


TTCAAAAATT 


GGAATTTCAC 


GAGTAGCATA 


GAATTGACCA 


4860 


CTTTCAGTAA 


ATTCTCCCAA 


AGTCCAACCA 


TGACCTAATT 


GATTGATGTA 


CTTTTCATAA 


4920 


ATAGCTTTAT 


TGGTCGCATT 


TGCTTCTACT 


GTTACAGAAG 


AATCCATGCT 


ACTTGCCTTT 


4980 


TCTTGCAACT 


CTTTAGTATC 


GTAATACTCA 


ATGTAGCCCA 


TACGCTCAAA 


CACAGTATTT 


5040 


TCATAGTTAT 


CACGTTTATC 


ACCCGTTGTC 


GCAATTTTAT 


TATACTTAGG 


ATCCTGCTTG 


5100 


AAAATCAATT 


TTCGAGGAAC 


CAAGGTATAG 


ATAATCGTGT 


AGGTCAAAGT 


CGTTACTAAG 


5160 


AAAATCGAAA 


CCAATGACCG 


CAAAACACGC 


ATAAAAATAT 


ATTTTTTCAT 


ATTATTTCCT 


5220 


TTAAAAATCC 


CAAAAGAACC 


TTCTCCTCAT 


GGAGAGAAAG 


TPCTATTAGA 


AATTATTTAC 


5280 


TTCACATGAC 


TTGCCAATTC 


TTTTTGAGCT 


TTCTCATTTG 


ATTCAGCTTT 


TTCTTTCAAC 


5340 


CATTTTTCAC 


GAGCTTTTTC 


ATACTCTTCC 


TTAGTCACCA 


CTTTATCTTG 


TGATTTCAAA 


5400 


TATTTGAAGT 


AAACATCTGA 


CCCCTTAGAG 


CCTGTTTGCG 


CAGAAGCTCC 


AGTAAATGGA 


5460 


ACAATTCGTG 


AAAGCACTGG 


TGCTGCACCA 


GAAGAAGCCA 


TAGCAGGAAT 


AAAGAGTGAA 


5520 


CTATCTGTCA 


ACCATCCTTG 


AGCCGCTGCA 


TATTTTTCAT 


AACGGACATT 


CAAGTCGCTT 


5580 


GTCTCTCTGG 


CAGCTTCATC 


AACTAATTTA 


TCGTATTCTT 


TCAAACCAAC 


TTGAACTACT 


5640 


GAAGGGCTAT 


TTGGATTATC 


AAATCCTAAA 


TATGTTTTTG 


TAGTTTCACT 


GCTAGTTGTT 


5700 


TTTAAAATAT 


CCAGGTAAGT 


AGATGGGTCT 


TGATAGTCTG 


GCCCCCATGA 


AACTCCTCCT 


5760 


GATACATCCC 


AATCCTCAGA 


TGAAGCATTG 


GCAGCATAGT 


AAGTAATATT 


AAGGAATTCA 


5820 


TCACTTGTCA 


TTTGTTGAAT 


ATCAACAACG 


ACATTTTCAA 


CACCAAGAAC 


TGTTTCTACA 


5880 




nuunL 1 unrt i 


ACflAGATATG 


TAGTTTTTTG 


ATGCTTGGTC 


TACTGCAACG 


5940 


TCCAGATGAA 


TAGGAAACTG 


AACGCCGTCT 


GCTTCTAAAG 


CTTTCTTAGC 


TTTCGCAAAC 


6000 


TCTGCCTTGG 


CCTTGTCAGC 


ATTGAATAAA 


CCATCCTGCC 


CATCAGCTAA 


ATTCACACCT 


6060 


TTCCACTCAT 


CACCATAAGC 


AGGAAGTTGA 


GCAGCGACTA 


AATCACCAAA 


GGTCTTCTCA 


6120 


CCAGCTGAAA 


CAAAGTCTGG 


TTTTACAAAT 


AAATTACGAA 


CTGCTAAAGC 


TGCTCCATCT 


6180 


TTACCATTGA 


TTTGAGCTGA 


GTAAGCTGAG 


CGATCAAGAG 


CAAAATTCAA GGCTTGACGG 


6240 


AAATCTTTGT 


TAAGCAATGC 


CTTCTTAGTA 


GCTACTTTCT 


CTGAATCTGT 


AGTTTTAGAA 


6300 


GTATAGTTGT 


AACTTTGGCG 


ATCAATATTC 


ACACCCAGAC 


CAGCAATCCC 


AGAGCCTGAT 


6360 
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TGTGTGTAAT AGATATTGTC CTTGTATTCT TCTGCAACCT TAGAATAGTT GGACCTGGTA 6420 

GGGTAAAGAC GGGCATAACT ATAAGCTCCA CTAGTGAAGT TACGCTCTAG CGACTCCTGA 64 80 

TCTGATCCAT CATAGTAAGC TAGATTGATA GTATCTAGGT GGACATTTTC TTTATCCCAA 6540 

TATTGCTCAT TTTTTACAAA CTCTACAGAA GATTTTGCAG TCAACCCTTT CAACAAGAAT 6600 

GGACCATTAT AAAGCAAGGA TGTCGGATCT GTTGGTTTAG CAAAATCGCT TCCTTTTGAT 6660 

GTTTCGAATT CTTCATTCAG AGGCCAGAAA ATAGAATAGG TCAACTTAGA GTTCCAGAAC 6720 

GGTTCAGGCT GGTTCAAAGT GTATTGTAAC GTATAATCAT CAACCGCCTT GACACCAACT 6780 

GTTGAAAAAT CTGTTGAAGT TCCTGATAGA TAATCTGCCA AGCCTTTAAC CGAATTTTCA 6840 

GCTAAATACA TAGCTTCTGA TTTTTTATCT GCTGCGTGTT TTAAACCGTT CACGAAATCT 6900 

TTAGCCGTCA CCTCTGCATA TTCTTCTCCA TCAGAGGTAA ACCATTTAAC CCCTTTACGA 6960 

ATCTTATAAG TGTAGGTCAA ACCATCCTTA GAGACTTCCC AATCCTCTGC AACTGCAGGA 7020 

GCAAGATTAC CGTAATTATC GTTAGTGAAT AAACCATCAA TCCCATTTGA AGTCACTACT 7080 

GTTGTACTAT TTTTACTTGA AATCAGGTAC TCCAAGCTTT CTGGGTCTGC TGTATAAACA 7140 

TAGCCATAAG CTTTAGGGGC TGATGAATCA GATGATTTTG AAGAACTGCA TGCTGCAAGT 7200 

ACACCTGCTG CTAATAAAAC AAGACCTGCT GTAGCAAATA CACGATTTTT TTTCATTTTC 7260 

TACTCCTCTG TTTATGTGAA TTATAGATTG ACAACCATTA TATCACATTA TCCATTAAAA 7320 

ATCAAACAAA TTTTCAGAAT ATTTAGGCTT GTTGGCACAA ATTTTTCATT TTTTTTGAAT 7380 

ATATGATTCA AATTGTCGTT CGAAGTGTCA AAGACTACAG TGAAAATAGG AAATTTGACG 7440 

CAGAAACTTT GGACTTTAGG AAGACATACA GTAAAATGAA ATACGGACGG AACAATGTGA 7500 

TTTTGGAATT CAAATTAAAT TATAACAATA TTGTAGAAGT ATCATTCTAG TATTCAAGAT 7560 

TCAGTTTACT ATGTCTTTTC ACACCAACCT TATCCCGAAT TCAATTACTT TTGTGATTTA 7620 

CATATATAGA TTAAGACTAT CTTTTATACT TTAAAATTTC TCGCTACCTT ATCCACTATA 7680 

TGCTCCTCGC TATCACGTTT CTATTCATAG CCTACCATTT CACTATTGCT TTCTCTGACA 7740 

ATTCTTATTT CCTGCGTCAG ACTTAAAACG ATCTATCCCC AGACCATTTT AATCCGCTAC 7800 

CTCACGATAG TCAGGCTTGG GGAGCGCTAT TGTATTCACC GGTAGTGGAG CCCTACAGAG 7860 

GACTTACACC TCAGATGCAC GACATGCCCA TCGTATAAAA AATCTCCTAC CCAAGGTAGA 7920 

AGATTTCAAA CTTATAAAAC TTAATCCGTC ATGTCCGATA CCAACATTCG ATGCTCCAAT 7980 

GGAATACTGC ACATAACTAG CAAGAAAATA AAGCCTGACT GAATCCAGAA GAGAGCCAAG 8040 

TCAAAAATTC CGTGCACAGC AACCACTGTA AGGAAAGATA GATAAAGGCC GATAATCGGA 8100 
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CGTTTCCCCG ACTCCTGACT CATATCCATC ATCAAGCGAA CAGGAGCAAC AGAAGACAAA 8160 

ACTAATAAAA TAGTCCCCAC AATTCCGTAA CTCAGAATCG TATCAATATA AAGACTGTGG 8220 

GCATGTTCAT GATAAGGAGC ATGTATCCGA GGATAAGAGT TCATATAGGT CAATGGCCCT 8280 

TCACCCCAAA AAGGATTTTG CTTAAACAAG GCCATCCCAG CATCCCAGAT AGAAATGCGT 8340 

TCTTCCATAG AAGAGTCTAA AGTACCCATT CGAACTCCCA AATCACTAGA AAAGAGGAAA 8400 

CTCAAACCAA TCGCGAAGAC CCCAATACTA AGCCAAAAGG CCTTCCAGTT TTTAATAGTC 8460 

GTAAAGAGAT AGATAATTGC TCCAGCGATA ATAGCAGGAA AGGCAGTTCG ATTTTGAGTA 8520 

AAGTTCAAAC CAAAGAGATT AACAAAGCCT GCAATCACAC AGAATACTTT CAACCAATTC 8580 

AACTTGGTCG TTGTAAACAG ATAGAAAGCA ATCATAATAC AGAAACAACA AATAATTCCA 8640 

TAATAATTAG GATTAAAGAA GGTCACTTCT GCCCGGTTCT GATGCCACAC CTGCATATTG B700 

GGTGAAAGAA AAGCATAGTT AAATTTCTTC ACAATTTGGA AATGTTCTAA ACTGCCAAAA 8760 

GCAGCTGACA AG AC ACT AC C AAACAAGACA AACTGCAAAA TCAATCGAAA GAATTTATGG 8820 

GATAAAATCG ACTGATAGTG CAAAAAGAAA ATACTAAATA GAAACATTCC TACTGAAGCC 8880 

ACAAGACCCA TCCAATTTTG TGCAAGAATG GATATAACAG TACTATAGCT AAGAAAAAGA 8940 

AGCAGCATCG GATGCTCCCC CATTTTCTGA AGAATACTTT TCATGTCTCC TGTAAAAATC 9000 

AAACTGATAA TATATAAACA GAGTACAACT ACAAAAAGAT AAAAGGGTAA AAAGATACTC 9060 

AGGATAATTC CCAATAAAAT CAGCTCTTTA CTAGACAACC CCTTCAGCTT TTCAATAAAG 9120 

CCTATTGATT TCAAAATGAA TCCTTTCTCT CCAAATCAGC TGATTCAGAT AATAGTAAGC 9180 

TATCCTATAT TGTACCACTT TTTTAGCAAT TTGAAAACAA AGGAAACGTT TTCCAAAATA 9240 

AAAACCCTAT TTTATCCACC ATATCAAGGC TTCAAAATGA TACTTCAACT CCATTCTCAA 9300 

TTACCCGATA AGTCTGATTT TGCAAATCAA TTTCTACTAC TGCTGTTACG GACTTATCTT 9360 

TATTTTGACG TTTGATTACA ATGCTGTGAG CTGTTGGTGT CTCTATCTCA GTAGTCCCTT 9420 

CTAGATCAAA GGCTTCTGAA CGCTTACGGA AAGAAAATAG ATTGAGAAGG GCCTTCACAA 9480 

CAGGTCGTTG CACTTCTTTT GCTATTTCCT CGTTGCTATA GTAATGACGA TTAATATTTC 9540 

GACCTTCTTT AGTTTCTTCT AATAATTTCA AGTCATTCTT GCCTGCTAAT AGACCCACAT 9600 

AGTAAATCTG AGGAATACCT GGGGCAAAAG CTTGAATTAG ACGAGCGAGA AAATACTTGA 9660 
CATCATCATC TCCAAGCGCT GAATAGTAGG TTGAATTGAT TTGGTAGATA TCTAAGTTGT 9720 
TATACTCGGC ACTAGAGTAC TTACGTTTGA CATTGGCTCC AACCTTATAG AGTTCATTTG 9780 
AACCATAGTC AATCTCCTCA TCGGTCAGGA TATCCTTGAC ATCTACTACT CCAATCCCAT 9840 
CATGGGTATC TAGCGTCGTA AATTGCTTCA TCGGGCTCAT CTTTAACCAC TTAGCCAAAC 9900 
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GCTCTGTTCT GGAACTGTAA AGAGTATAAA GTGTCACCAT TGCAAGAGCA AAATCATAAA 9960 

CATAGTAATC ATGGTCTGCT ATTTTAAACT GAATCGAATA GTCTTCATGA ATCTCAGGTA 10020 

AAAGCTCTGT CCCATACTCA GCAGCGATAT CTCGAACTTT GTCCAATAAA TCCCAAATAT 10080 

CTGGTTCCAC AAAGAAATCA TTAGTATCCA ATTTCTTCAC TGCATAAGCA AAGGCATCTA 10140 

GACGAATCAA ATCACACCCA TTACTTGCCA AGTGCTGAAT GCTCTTACGG ATAAATTCCA 10200 

TAGTTACTTC TTTGGTCACA TCAAGATCAA TCTGCTCCTC ACCAAAGGTA TTCCACAAAT 10260 

GTTCCACTGA ACCATCTTCA AACACAATCT CTTGCTTTGG TGCACGATCC TTACGCTTGT 10320 

AAATTAAATC TACATCAGAC TGTGTCGGAC GGTTTTCTGG CCAAAACTTA TCCCAGTTTA 10380 

AAAAGAGAGC TTTAAATTCA CTGGCTTCAT GTTTTTCTTG ATAGTCCTTA TAATACTTGG 10440 

ATTGACGAGA AATATGATTA ATCATAAAAT CAAACATAAG ATAATATTTC TCACCTAAAC 10500 

GCTTCACATC CTCCCAATCA CCAAAAGCTG AGTCCACTTC GTCGTAGTCA ACTGGCGCAA 10560 

ATCCACGATC AACTGTTGAT GGGAAAAATG GTAAAAGGTG AACTCCTCCA ATAGCATCTC 10620 

CAAAATGCTC TTCCAAATTA TCATATAAGT CTTTAAGATT ATTTCCAAGG CTATCAGAAT 10680 

AGGTAATCAA CATGGTTTTA TTTTGAATTG GCATCATTAC TCTCCTTTTT CTAATTGAAG 10740 

CCAAGTCTCA TATGATCTGG CTTCATAAAT AAAATTCATT TTAAATCTCT ATTTATCATC 10800 

AAACTCGTAC TAATATAGAC TGTGATAAAC AAAGTACTAC TTTCTTGTTT TCTGCATAGA 10860 

ATTATCAACA AGCTAAACTC TTCCTCTGTG TCAAAGACTA TAGATTCCAT GAGCTCTTCT 10920 

TATACTCTTC GAAAATCTCT TCAAACCACG TCAGCTTCAC CTTGCCGTAC GTATGGTTAC 10980 

TGACTTCGTC ACTTTCATCC ACAACCTCAA AACAGTGTTT TGAGCAACCT GCGGCTAGCT 11040 

TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTACTTCAC TGCCCCGTTG CTCATTCCTG 11100 

AAATGATATG GCGTTGGAAG AAGAGATAGA CAATGGTGAT ACTGATAATG CCGACCACGT 11160 

AAGAGGCAAA GCTTGGTCCG TAGTCGTTGA AATATTGGCC TGCGTAGTTG TATTGGAACA 11220 

AAGGCAGAGT CCACATTTTG GAATCCCGGT TCAAGACAAG GAGTGGCAAC ATGAAGTCAT 11280 

TCCAGAACCA AAGGGCATTG ATGATCATGG TTGTCGCATG CATCGGTTTC ATCATTGGGA 11340 

AGATGATGCG GAAATAGGTT GTAAATTGAT TAGCCCCATC GATCTCTGCT GCTTCATCCA 11400 

GACTTTCTGG AATCGAGATT TTGATATAGC CAACATAGAG AAAGAGGGTC TGTGGAATCG 11460 

CATAGGTCAA GTAGAGCAAG ATCAAACCAA AGGTATTAGC CAAACCGAGT TTACTCATCA 11520 

TAACCGTAAT CGGAATCATG ATGACTTGGA AAGGTACGAA GATTCCGAGG ATTAAGAGGG 11580 

TATACATGAT GGTAAAGGCT TTTCTTTTAC TCATATTGCG AGCGATGGAG TAGGCTGCCA 11640 
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TAGGGATAAA 


GATCATTACT 


GC AAGTAAAG 


ACAAGACAGT 


GATGACGACA 


GAGTTCCAAT 


11700 


AATAGCCTCC 


AATCCCATCA 


GCTAAGAGAC 


GGCTAAAGTT 


GTCCCATGTG 


AAGTTGGTTG 


11760 


GAAAGCCAAA 


GAAATTATCT 


ACAATATCCT 


TAGTGGGTTT 


GAAGGAACTA 


AAGAGGGTAG 


11820 


CAAGGAGCGG 


CACTAAAATC 


AGAACCGATC 


CTAGAATCAA 


TAGAATGTAT 


TTGCCAATCA 


11880 


GGGCTTTTCT 


TTCATCTTGT 


TTCATCATGC 


TTCTCCTCTT 


AAATTTCAAA 


TTTCTTAGAT 


11940 


ACTCTCAATT 


GGATGATCGA 


AATCACTACA 


ATTAAGAAGA 


ACAAGATTAC 


GGCAATGGCA 


12000 


TTGGCATAAC 


CGAATTGGTT 


GTTTTTAAAG 


GCATAGTTAT 


AAACCAAGAG 


CCCAAGTGAG 


12060 


GTTGTGGCAT 


TGTCTGGACC 


ACCACCGGTC 


ATGGCAAAGA 


CTTGGTCAAA 


GGCAGTCAGC 


12120 


CCACCTTTTA 


GGGCTAGGAT 


AAAGACCATA 


GAGACACTTG 


GTAGCAAGTA 


AGGCAATTCA 


12180 


ATGTTCCAGA 


AAACTTGCTT 


GCTAGTCGCA 


CCATCAATCC 


TTGCTGCCTC 


TGTAATCTCA 


12240 


GTTGGAATAG 


ATTGCAAACC 


AGCTAGGAAG 


ATGATGATGG 


GCATAGCCAC 


CCCTTGCCAA 


12300 


AGAAGGACAA 


AGACAGCCGC 


AAAGATTGCT 


CCCCACTTAG 


TCCCTAAAAG 


ACTGGTTTGG 


12360 


AAAAATTCAA 


TATGAAGGGC 


ATTTCCAATC 


GCTGGAAGAC 


CGTAGTTGAA 


GACTTGCTTG 


12420 


AAGATCAAAG 


CCACTGTCAA 


ACCAGATAAA 


ACAGCTGGGA 


AGAAGAACCA 


AGCACGGAAG 


12480 


AAGGTTTGGC 


CTTTGATTTT 


AGAATTCAAG 


ACACGCGCAA 


TGAAGATCCC 


GAGTGCAATC 


12540 


TCACCAACCA 


CCATGGCAAT 


CGCAATGATT 


GCGGTAAAGC 


CAATCGCATT 


CATGAATTTT 


12600 


GGATCCATGA 


AGAGGAGCTT 


AAAGTTGTTT 


AAGCCAACAA 


ATTTGTAGTT 


ATAAGTCAAT 


12660 


CCTGTCCAGT 


TGGTAAAACT 


GTAAAAGGCT 


CCTTGAAACA 


TCCCCACATA 


GAAGAAAATT 


12720 


GCTTGTAACA 


AGAGGGGGAT 


GACCACAAAA 


GCCCATGCCC 


AATATTTTTG 


TAATACTTTT 


12780 


TTCATAGTCT 


CTCTACTCCT 


AATCCACATC 


CGCTTTCATC 


GGGTTAAAGA 


AGGCATTCAA 


12840 


ATCATTGACC 


ATGCCTTGTT 


TATCACCGGT 


CAAGACATAG 


TTCATGGTCA 


AGGTATGGAA 


12900 


GTCTGCTTCA 


CTGGTCCAGT 


ATTGTTGCAA 


CCAGACCAAG 


TGACGATCCG 


TAAAGGCATA 


12960 




CCAGCAAGCG 


GTGAATCTTC 


TCCTGCTTGT 


TTGACCCCTT 


CGATCGCTGT 


13020 


TGGAGATCCG 


TCCACATCGT 


AGTATTTTTG 


CATGACTTCT 


GGACGGGTCA 


TATATTCCAC 


13080 


AAAGGCATTG 


GCTTCTTTTG 


GATGTTTGGT 


GGTGGCTGAG 


ATAGACCATG 


CCAAGTCTCC 


13140 


CGCACCAACG 


GTTAAGCTTT 


GTCCTTTTTC 


TTTTCCTGGA ATCATGAAGG 


TCCCAATCTT 


13200 


AAAGTTCGGT 


ttttgttcat 


TAATCGCTGT GATCGCCCAA 


GACCCATTTG 


GTGTCATGAG 


13260 


GACATCCCCA 


CGTGCGAAGG 


CTCCGATAAC ATCGGTATAG 


CCAGCACCTT 


CCCAGTTCTT 


13320 


TTGCTTAGAT 


CCATTGATGC 


GAAGGATGTC 


CATGACCTTG 


ATATCATCTT 


TCATAATCGG 


13380 


ATCCGACAAT 


TTAATGGCAT 


TTGGTTGAGA ATAACGAAGG 


TATTGATTTG 


CTTCTTTTCC 


13440 
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TCCACCTGTT GCTGTCGCAA AGGCTAATTG ATTGTAACCA TTGAGTGTCC AAGCATCTGC 13500 

ACCTCCAATT CCAAATGGTG TTTGTCCTTT AGCAACGATA TCTTTGACTA ACTGTTCAAA 13560 

TTCATCCCAG GTTTCAGGAA CCTTCAAGCC CAGTTCTTCG AATTTATCTT TGTTGTAGTA 13620 

AATTCCATAA GCATTAGCTG TAAAAGGAAC GTTGTAAACT TTTTCGTTTA CAGCATATTT 13680 

TTCAGCGTAG CCATTTTTCA CGCGTTTCAG GTAGTCTTTG TTGCTCAAAT CTTCAAAAAC 13740 

ACCTGCTTTT GCCCATTCTT GCAGTTCGAT GGACTGTGGG TAAATATTGA CCACATCAGG 13800 

CACATCTCCT GCGAGAACGC GTGTCTTCAA TACTTCACCA GCATTTGGTA CATTGACGAC 13860 

TTTGACCTTG ATCTTAGGGT TTTCCTTCTC AAAATCACGA GTGATTTCTT CCAAGGTTTT 13920 

GGTCATTTCT TTTTTCTGGT TGAAATACTC GATGGTCACT GTGCCATCCG CAGATTTACC 13980 

ATAGTTGGAG CAAGCGCCGA GCCCAAACAA AGCTAAACCT GTAGTTGCAA GAAGTCCGAT 14040 

TTTTTTATAC CATTCCATTA GAAAGCCTCC TTTATAAATT TATACACCCT TATTGAACTG 14100 

CACCCCAAAA GTTAGACAGA ATAAATCTAA CTTTTGGGGT CAGTACATAT CATAGTTTTC 14160 

TAAAAATATA CTGTCTACTC AAAAAATCTC CTTGGGATAA GATAACAGTT AAGCCCGCAT 14220 

ACATTAGTTC TGCACCTGAG TAAACTTCGC CATTTTCCTG TAATTTATAT AGTCCCTCTT 14280 

CATCCAAATC TTTTAATTTT AAAGTTGTTT CCATGGTCTC TACAACAGAT AAAACGCGAA 14340 

CGTAGGTTAC AATCGTTTGA TTTCCGTAAT TAAATTGTAC AGCTGCTTCA TTGGATACAG 14400 

TATCAGGATT AATTAGTCTA TACTGCTGTC CTAACTGAAC TACTGGTCGT AATTCTTTAT 14460 

ACAAGTTCAC CTGATTAGCA ATCCTAGCTT TCTCTTCATC TGATAAATTT GTCAAATCAA 14520 

GTTCATAGCC CAAATTTCCC ATCATTGCTA CAAGGCCACG TGTTTCTAAT CGTGTCATTC 14580 

GTCCCATCTG ATGATTCGGT ACTGCTGACA CATGAGCCCC CATAGAAATG GTTGGATAGA 14640 

GATAGGATGA ACCGTATTGA ATTGGTAAAC GTGCAATGGC ATCAGTATTA TCACTAGCCC 14700 

AGACTTGTGG GAAATAGCGC ATCATACCAA GATCATTTCG TCCACCACCA CCAGAGCAGG 14760 

ACTCAAAGAG AATATGGCTG TGCTTCTCTG TCAGATAAGA AACGAGTTCA TAAAGCCCCA 14820 

GCATGTACTG ATGAGATTGC ATCTGTGTCT CTAGATAAGT TAATCCATTC CCTAGCTTAG 14880 

TGATATTGCG GTTCATATCC CATTTAATGT AATCAATATC ATGATAAAAT AGGAGTTGAT 14940 

CTAAGACACT TTTCAAGTAT TCTACTACCT GAGGATTGGC AAGATTAAGT ACTAATTGAT 15000 

TCCGAGAATA AGTATGCTCA TAGCCAGGAA CCTGAATAGC CCAGTCAGGA TGTTGACGAT 15060 

ACAAATCACT ATCTACAGAA ATCATTTCGG GTTCTAACCA AAGTCCAAAC TGCAAACCTC 15120 

TTTCATGGAT AGCTGAAATC AGACTTTCTA GACTTCCACC CAGTTTTTCC TCATTAACAA 15180 
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CCCAATCACC 


TAAAGCACGA 


TTATCATCAA 


AACGATTGCC AAACCAACCA 


TCATCTAATA 


15240 


CAAAAAGTTC 


AATGCCAACT 


TTCTTAGCTT 


CATCTGCTAA 


CTCTAACAGT 


TTTTCTCTCT 


15300 


CAAAGTCAAA 


GTAAGTAGCT 


TCCCAGTTAT 


TGATTAGAAT 


TGGACGTTCT 


TTTTTAGAAA 


15360 


ATTCACTTAG 


CATAATGTGC 


TTCAGTACAA 


AATTCTGACT 


TTCATGACTA 


ATACCAGTTA 


15420 


ATCCCTGATC 


TGAATGAGTC 


ACTAAAGCTA 


CCGGTGTTTC 


AAAGTATTCC 


TCAGGAGCTA 


15480 


ACTTCCAAGA 


AAAGTTTTCT 


GGATTAATGC 


CAATAGCCAC 


CCGAACTTCA 


TTCAATTGAT 


15540 


TTTTTTGAAC 


AAAAGCTTCA 


AAGTTGCCAC 


TATACATTAG 


TTGAATAGCA 


AACACATTCC 


15600 


CAGCATCCTC 


TGTGACTCCT 


TGTTCGCATA 


GTAGAAGAGC 


TGGTGTTTGA 


GCATGACCAG 


15660 


AAGCACCTCG 


GTTTGAACTA 


ATCGAAAAGA 


TTCCTTGTTC 


TACCTGTTGA 


CGTCTAACAG 


15720 


TCTTTTCACG 


AGCATAAGCA 


CCCTGCAGAG 


TTACTATTTC 


GTAATCTGCA 


GCTGGAAAAT 


15780 


CAGCCATAAA 


AGAAAAATCT 


TTATGGATGA 


CAACTTCCTG 


ATTACTATTA 


TTATCTAATT 


15840 


TACTGTAGCT 


AGCAATAGTC 


GCATCATTAT 


TAAAAGTAGT 


ATAATACAAA 


GTCAGACTAA 


15900 


GTTGAGCCTT 


AGAATCTTCT 


AACATTAAGA 


CAAGAGTCTC 


TGTATCGTCC 


ATGCTATGTG 


15960 


GAGAAGGTAA GCCCTGTGGA 


CCATTCTGAC 


CTTTTAAAAT 


CTTTGCTTCT 


ACAAATCGAA 


16020 


AGTCTGTTAC TTCAGTTACA CTATGCTGAA CCTGTATGGT 


TGGTTTCCTA 


AAATCTCCTA 


16080 


AGCCATGTTG 


TCCAAAAATC 


TGTCGCTGAG 


TATCTAAACT 


AAAGGTTCGA 


TTAGTAGCCG 


16140 


TTGGATTTCC 


TGAAAAGGCA 


TGGTCTCGTT 


CATAAACACT 


ATTGGAACCT 


TTATAGTTCT 


16200 


TAATAGTCTT 


TCCTAAATGT 


TTCAAAAGTA 


AGTAGCCATT 


TCGATTTTCA 


ATAATCAAAC 


16260 


TTAGATTTTT 


ACTCTCAACA 


TAAAATAGAT 


TATTCTCTAT 


CCTAACTCCC 


ATTTACTTCA 


16320 


CCTCATCACT 


TTATTGATTA 


TATTTTATCA CCTGAAATCG 


CTTTCCAAAA 


TAGAAAAATG 


16380 


TCTCAAGAAT 


ATGGTAAAAT 


GTTAGGTAGG 


AGGTAGCACA 


TGTTAGTTTT 


TTCAGAATAC 


16440 


CAGACTGGAA 


CAATCGACCT 


TGCCCTAAGC 


TTTTATGGAT 


ATGAGGAATG 


CACACCTAAT 


16500 


TACTCTTTTG 


GTCCAGCCAT 


TCGTGATACA 


TACGTTCTAC 


ATTACATTAC 


TAAAGGACKA 


16560 


GGAAAATTTC 


ATTACAAGGG 


TAAAATTGTT 


GATTTAAAAG 


AAGGAGATTT 


CTTTCTATTA 


i6620 


AAACCAGAGG 


AACTAACCTT 


TTATCAAGCA 


GATAGTAAAG 


AACCTTGGGC 


CTACTACTGG 


16680 


TTAGGAATCA 


CTGGAGGGAA 


AGCCCCTGAT 


TATTTTGCTC 


TTTCCCAAAT 


TTCTGATCAA 


16740 


TCCTATCTCA 


TCCAATCTGA 


AACTTGTCAT 


ACCCAGACTA 


CTGCAAAACT 


CATCTCAGAC 


16800 


ATTGTCCGCT TCGCTCAGAT 


TACAAAATCA 


AGTGAATTAG 


CTCAACTCCA 


TATCATGGGA 


16860 


CAACTTCATG 


AACTGATGTT 


TCATCTGGGA ACTATTGCTC 


CCAATCAGAA 


AAAAAAGAAT 


16920 


ATTTCATCAA 


CCCACCAACT 


CTATCTTGAA 


TGCAAACGAT 


TAATTGATAG 


CCACTATCCT 


16980 
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CAATCACTTA CAATTCAAGA TTTAGCAAAA GAACTATCCG TTCACAGAAG CTACTTATCA 17040 

AGCCTATTCA AAGAATTTAA TACCTTATCA CCCAAAGAAT ACCTACTCTA CGTTCGAATG 17100 

CACCGAGCTA GACAACTTCT CGAAAATACC CAAGAGTCCA TCAAGGTAAT TGCATACTCG 17160 

GTAGGTTTTT CAGATCCACT CCATTTTTCG AAAGCTTATA AACAATACTT TAATCAGACT 17220 

CCAAGTCATA CAAGAAAAGA ATACTCTCAA TACCAACTAG TAAGAAAGGC AACATTATGA 17280 

AATCCTACCA AGCTGTCTAC CAAATCCTAT CTAAAGAAAC CGACTATATC AGCGGAGAAA 17340 

AAATCGCAGA AAAACTATCC CTAAGCCGAA CAGCAATTTG GAAAGCCATC AAGCGACTAG 17400 

AACAAGAAGG CATTGAAATT CATAGTATCA AAAATAGAGG ATATAAACTG ATGAATGGTC 17460 

ACCTTATTCT TCCAGAGATT CTAGAAGAAA ATCTTCCAAT TAAAGTCAGC TTTAAACCCG 17S20 

AAACAAAATC AACACAACTA GATGCAAAAG AAGCAATTGA TTTAGGCCAT GAAGCAAATA 17580 

CCCTCTATCT AGCTTCCTAT CAAACAGCAG GCCGAGGCCG TTTTCAACGT TCCTTCTACT 17640 

CACCACAAGG TGGTATTTAT ATGACACTCC ATCTTAAACC AAATCTCCCC TATGACAAAT 17700 

TACCATCCTA CACACTACTT GTAGCTGGAG CTGTCTACAA AGCCATTAAG AACCTAACTT 17760 

TAATAGATCT CGACATAAAA TGGGTCAATG ATATCTATCT AAACAATCAT AAAATTGGAG 17820 

GAATCCTTAC TGAAGCAATG ACCTCTGTAG AAACTGGCTT AGTCACAGAT ATCATTATTG 17880 

GAGTAGGTAT CAATTTCACT ATTAAAGACT TCCCTCAGGA ATTAAAAGAA AAACCTGCCA 17940 

GCTTATTTAA AGCTACAGCT CCTATAACAA GGAATGAATT GATCATAGAA ATCTGGCGTG 18000 

CTTTCTTCGA AACACCAGCA GAAGAGCTAT TATACCTATA CAAAAAACAG TCATTCATTC 18060 

TAGGAAAAGA AGTCACTTTC ACACTAGAGC AAAAAGACTA CAAGGGACTT GCTAAAGACA 18120 

TCTCAGAAAA TGGAAAACTT TTAGTTCAAT GTGATAACGG AAAAGAAATC TGGCTAAATA 18180 

GTGGCGAAAT TTCTCTCAAT AGTTGGAAGT AAAATAACAC AATTATAATA TAAACGATAT 18240 

AAAAATAACT TCAGATTAGT AATTCAATTA AGTTTTACGG ATCTGAAGTT TTATTGGCTC 18300 

TAAAAATAAA AAAGAGAGTT ACAGACTCTC ATTAAAACGG AGAATAAGGG ATTCGAACCC 18360 

TTGCGCCAGT TACCCGACCT AACGATTTAG CAAACCCTCC TCTTCAGCCT CTTGAGTAAT 18420 

TCTCCAATTA ATGGGCACGA GTGGACTCGA ACCACCGACC TCACGCTTAT CAGGCGTGCG 18480 

CTCTAACCAC CTGAGCTACG CGCCCAAGTT AAAAAACTTG GTAATTTGAA CAAAGTTCAA 18540 

AGCGGGTGAC GAGAATCGAA CTCGCGACAA CAGCTTGGAA GGCTGTAGTT TTACCACTAA 18600 

ACTACACCCG CATAAATACT ATCAATAAAA TGGCGCGAGA CGGAATCGAA CCGCCGACAC 18660 

ATGGAGCTTC AATCCATTGC TCTACCAACT GAGCTACCGA GCCTTATTGC GGGAGCAGGA 18720 
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TTTGAACCTA CGACCTTCGG GTTATGAGCC CGACGAGCTA CCGAGCTGCT CCATCCCGCG 
TTAATAATAT AAAAGGAGGA TGTGGGATTC GAACCCACGC ACGCTTTTAC ACGCCTGACG 
GTTTTCAAGA CCGTTCCCTT CAGCCGGACT TGGGTAATCC TCCAATATTC AAATGGACCT 
TGTAGGACTT GAACCTACGA CCACTCGGTT ATGAGCCGAG AGCTCTAACC AGCTGAGCTA 
AAGGTCCGAC AAGATCATTA TAGCGGCGAA GGGGATCGAA CCCCCGACCT CCCGGGTATG 
AACCGGACGC TCTAGCCAGC TGAGCTACAC CGCCATGAAT CGGGAAGACA GGATTCGAAC 
CTGCGACACC TTGGTCCCAA ACCAAGTACT CTACCAAGCT GAGCTACTTC CCGAGTTAAA 
TAGAAAAATG CACCCTAGAG GAGTCGAACC TCTAACCGCC TGATTCGTAG TCAGGTACTC 
TATCCAGTTG ACCTAAGGGT GCTCCATATT ATGCCGAGGA CCGGAATCGA ACCGGTACGA 
TCGTTACCAA TCGCAGGATT TTAAGTCCTG TGCGTCTGCC AGTTCCGCCA CCCCGGCCTC 
TCTAAGCGAA CGACGGGATT CGAACCCGCG ACCCCCACCT TGGCAAGGTG GTGTTCTACC 
ACTGAACTAC GTTCGCACTG TTTTCTTCTA TCTAAAAATG CCGGCTACAT GACTTGAACA 
CGCGACCCTC TGATTACAAA TCAGATGCTC TACCAACTGA GCTAAGCCGG CTCATTTGTT 
ATATCTTAAT GCGGGTTAAG GGACTTGAAC CCCCACGCCG TTAAGCGCCA GATCCTAAAT 
CTGGTGCGTC TGCCAATTCC GCCAAACCCG CATATATGAC CCGTACTGGG CTCGAACCAG 
TGACCCATTG ATTAAAAGTC AATTGCTCTA CCAACTGAGC TAACGAGTCT AAAATAACTT 
GCGTTACCTT AAACGGTCCG ACGGAATCGA CCCGGTAC 
(2) INFORMATION FOR SEQ ID NO: 100; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19718 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
CCCTGGAAAA GTCTGGATAG TGAATGGTCT TCACACAATG ACCTGAAAGA 
TAATTATGGA GAGTAGCATT CTGAGAGGTG TTACCAGAAC CATATGACAG 
AGAGGGAATA TTGAGGAGAA AAATCCTGAG CCTACCAGTT GGAGTTGGAA 
GTTAGATCAT GGTTTATTAT CCACAACCTG TGGATAACTT TGTGAATAAG 
AAAGAAGGAG ATATATAACG ATGAAGAAAA TCAAACCGCA TGGACCGTTA 
CTCAGCTAGC TTATCTGGGA GATGAACTAG CAGCTTTTAT CCACTTCGGT 
TTTATGACCA AGAATGGGGG ACTGGACAGG AGGATCCTGA GCGCTTTAAC 



AGCCTGAGAA 
AGCTGTTTGA 
AGAGCTGACT 
AGAAGTTGCT 
CCAAGTCAGA 
CCTAATACCT 
CCGAGTCAGT 



60 
120 
180 
240 
300 
360 
420 
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TGGATGCGCG 


TGAGTGGGTT 


CGTGTGCTCA 


AGGAAACGGG 


CTTCAAAAAG 


TTGATTTTGG 


480 


TGGTCAAGCA 


CCACGATGGC 


TTTGTCCTTT 


ATCCGACAGC 


TCACACACAT 


TATTCGGTTA 


540 


AGGTCAGTCC 


TTGGAGGAGA 


GGAAAGGGCG 


ACTTGCTCCT 


TGAAGTATCC 


CAAGCTGCCA 


600 


CAGAGTTTGA 


TATGGATATG 


GGGGTCTACC 


TGTCACCGTG 


GGATGCCCAT 


AGTCCCCTCT 


660 


ATCATGTGGA 


CCGAGAAGCG 


GACTACAATG 


CCTATTATCT 


GGCTCAGTTG 


AAGGAAATCT 


720 


TATCAAATCC 


TAACTATGGG 


AATGCTGGTA 


AGTTCGCTGA 


GGTTTGGATG 


GATGGTGCCA 


780 


GAGGAGAGGG 


CGCGCAAAAG 


GTTAATTATG 


AATTTGAAAA 


ATGGTTTGAA 


ACCATTCCTG 


840 


ACCTGCAGGG 


CGATTGCTTG 


ATTTTTTCAA 


CAGAAGGCAC 


CAGTATCCGC 


TGGATTGGCA 


900 


ATGAACGAGG 


GTATGCAGGT 


GATCCACTGT 


GGCAAAAGGT 


GAATCCTGAT 


AAACTAGGAA 


960 


CAGAAGCAGA 


GCTGAACTAT 


CTTCAGCACG 


GGGATCCCTC 


GGGCACGATT 


TTTTCAATCG 


1020 


GAGAGGCAGA 


TGTTTCCATC 


CGTCCAGGCT 


GGTTCTACCA 


TGAGGATCAG 


GATCCTAAGT 


1080 


CTCTCGAGGA 


GTTGGTCGAA 


ATCTACTTTC 


ACTCAGTAGG 


GCGAGGAACT 


CCACTCTTCC 


1140 


TTAATATTCC 


GCCGAATCAA 


GCTGGGCTCT 


TTGATGCAAA 


GGATATTGAA 


CGACTTTATG 


1200 


AATTTGCGAC 


CTATCGCAAT 


GAGCTCTATA 


AAGAAGATTT 


GGCTCTGGGA 


GCTGAGGTAT 


1260 


CTGGTCCAGC 


TCTTTCCGCA 


GACTTTGCTT 


GTCGCCATTT 


GACAGACGGC 


CTTGAGACCA 


1320 


GCTCTTGGGC 


AAGCGATGCA 


GACTTGCCCA 


TCCAGTTAGA 


ACTCGACTTA 


GGTTCTCCTA 


1380 


AAACTTTTGA 


TGTAATTGAG 


TTAAGAGAAG 


ATTTGAAGCT 


AGGGCAACGA 


ATCGCTGCTT 


1440 


TTCATGTGCA 


AGTAGAGGTG 


GATGGTGTCT 


GGCAGGAGTT 


TGGTTCGGGT 


CATACTGTTG 


1500 


GTTACAAACG 


TCTCTTACGA 


GGAGCAGTTG 


TTGAGGCACA 


GAAGATACGT 


GTAGTCATTA 


1560 


CAGAATCACA 


GGCTTTGCCT 


TTGTTGACCA 


AGATTTCCCT 


TTATAAAACT 


CCTGGATTAT 


1620 


CAAAAAAAGA 


AGTTGTTCAG 


GAACTAGCAT 


TTGCAGAAAA 


AAGCCTAGCT 


GTGGCAAAGG 


1680 


GAGAAAATGC 


CTATTTTACA 


GTTAAGCGCA 


GAGAATGTAG 


TGGTCCTTTA 


GAACCTAAGA 


1740 


TTTCGATTCA 


ACCGGGGACA 


GGTGTCCATG 


GTGTCGCCTA 


TCAGGATGAG 


ATTCAAGTCC 


1800 


TTGCGTTTCA 


AACTGGTGAG 


ACTGAAAAAA 


GTCTGACGCT 


ACCAACCTTC 


TATTTCGCAG 


1860 


GAGATAAAAC 


CTTGGATTTC 


TATCTGAACC 


TAACGGTGGA 


TGGTCAGCTT 


GTGGATCAAC 


1920 


TTCAAGTCCA 


AGTTTCATAA 


AAGAAGAACC 


TTTGCCCGAT 


GCAAAGGTTC 


TTTTGGTTAT 


1980 


TAGTGACTTG 


GTAACCAGCT 


GAGGGTGAAA 


GTTAGTTGTT 


CAGCTTTTAA 


GAGGTCTTGG 


2040 


TGTTGAATAG 


TTGATACGAG 


TGTTTTGTCC 


AGTCGGCATT 


CTTTGACAAA 


GTTAAAATGG 


2100 


TTGTGGTTTT 


GTTTAGTATG 


GATATCCAGC 


CATTTATCTT 


CTTTAGCGAG 


GTAGACTCGT 


2160 



WO 98/18931 



PCT/US97/t9588 



764 

AGATGGTCAA AGAGAGGGAT TCCGAGGTCA TAGCTTGGTT TTCCTGGACA GGTTGGATAA 2 220 

AATCCGAGAG CTGACCAGAT GTACCAAGCA GAGAGACTAC CATTGTCTTC ATCTCCAGGA 2290 

TAGGCTTCCC AACTTGGGTG AAAAGCTTTC TGACGGACCG TCTTGATAAG AAGGGCAGTG 2340 

TAGTCAGGGT AATCGCTGTA ACGGAAGAGA TAAGGAATGT GGAAACTAGG CTGGTTGGAA 2400 

ATGGCTATTT GTCCAAAAGG AGCAGTAGCC ATCTCGCTCA TTTCGTGAAT TTCGTAACCA 2460 

TAGCCTGTTG TTTCAAAGAG GGGAGCATCT TGACAGGCTT TCAAAAGATA GTTGCTAAAG 2520 

GTTTCTTTTC CACCCATCAG TTGGATTAAG CCAGGGATGT CGTGGAGAAC GCCTAAAGTA 2580 

GCTTGAATGG CAGAGCATTC AGCGTAGTCT CGCCCCCAAC TATAAGGAGA GAAGTCAGGG 2640 

TGAAAGTTTC CTTGATTGTC TCGTGCTCGC ATGTAACCTG TCTCAGCGTC AAATAGCTGG 2700 

CGGTAATTTT GTGAAGCAGC CTTGTAGGTT TCAGCGATTT CTATGTTCTC TAGTTTTTTG 2760 

GCACAGCTGG CGATACAAAA GTCACTATAG GCATAGTCTA GAGTATGGCT AACACTTTCG 2820 

TGGTGGTCGG TAGAGAGGTA ACCTACTTCT TGGTATTGGG CTAGTCCGTG GCGGCCATTG 2880 

ATGCCGAGAG GGTCGGCTTT GCTGGCTGTT TCGAGCATGG CTTGGAAGAG TTCTCCTTCT 2940 

AGGTCGGGGG TCATGTCCTT GCAGGCGCTA TCTGCGATAA TACCGTCTAA AAGTGTACCT 3000 

GGCATCATAC CCCGTTCATC TGGAGCCAGC CATTTTGGAA GGAAACCAGT ATCGCGGTAG 3060 

CTATTGAGGA AACCTTCTAA AAAGCGTTGA TAGTGCTCCG GTATGATAAG GGCAAAGAGG 3120 

GGGAAGGTGG TGCGGAAGGT ATCCCAGAAA CCATTGTTGC TAAAGAGGAC ACCAGGCTTG 3180 

ACAGTACCAG TAGCCAGATC CATGTGGATG GCTTCCCCTG ATTCATTAAT CTCATAAAAA 3240 

GTCTGTGGGA AGAGGAAGAG TCTGTAGAGG CAGTGGTCAA AGAAGGTTCG GTCAGCCTCT 3300 

CCTGTCTCTA TAATGTCAAA ACGATGGAGG AGATTTTCCC AATCCACTTG GGCACTTCAT 3360 

TTACAGCTAT CAAAATCTTC TTGAGGTAGA TTGATTAGAG CTTGAGAAGG AGAGATGAAA 3420 

GAAGTGGCTA GTTGCATCTC GGTTTGACTA CTTGCTAAGT CAATTCGCCA GTCTCCAGCT 34 80 

TCTTGGCTGA TAGCAAGAAT ATCCGTGTTC ATTTGCAGGG CAGTGAACAT CGTTAGCGAA 3 540 

TTTTTGTTAG TTTCAGTTTT ACCTTCTTGT CGCAGGGCAA GAGTCCGCTT ATCTACTTGC 3600 

TCTACTGTCA GTTCATCTGC TGCGTGAAGA TAGAGGGAGA GGGCTTTGCC TTGCTTTTGA 3660 

TTCAAACGAA TAGAAGCACC ATAGCAAGTC GGTGTGAGCT GGGTTTCAAT CTGATAACGC 3720 

AGAGAAAAGA GCTTCAAATA GTGAGGCTGG AAGCAAGCTT TATCTATATC ATAAGAAGAC 3780 

TGGCGGTGAA AGAGGCTGTC TCCCCCCACT TGACTGGTGA CAGGTGTCAG AAGGAGCCAA 3840 

GAGTAGTCCC CAATCCAAGG ACTGGGCTGG TGAGTTAATC GAATCCCCTG AAAGATAGGC 3900 

AGATGTGGAT CAAAAAACCA AGATCCATCC TGGTCACTGG TCTGGGGCAC AAAGTAATTC 3960 
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ATCCCAAAAG GCACGCCTGT GTATCGCAGG GTATTTCCCC GAGAAAAGGC ATGCTTCTTG 4020 

GTAGTTCCAA AACGGGTATC GATGGTATCA ACTAGTCCTT TCATACTCTT TCCTTTAGCT 4080 

GTTTTTCTAC ATTATATCAG TAATAGAGGG CCTTTAG 4117 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2727 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CTGGTTCAAT TATTATTCAC TCTAAGTAGT CATATGTTCT TTATTTATGT GAGTTTTTAC 60 

CTTTTAAAGG ATCTTGTTAG ATGGGAGAAG GTTTTAAAAG TGACAGATGA TAATACAAGA 120 

AAAGTTCGTT TATTAGTAGC CTTTTTTAGC ATTCTCATAG GCTACATCCT GACTTCTTTC 180 

TTTATTAGCC TGTATCATTT GTGGCAAGAA GCGCTTAGAG GATTATTATG AAATCAAGAG 240 

TAAAGGAAAC GACTATGCAT AAAATTGTGG TTCAAGGTGG CCATAATCGT CTGGTAGGAA 300 

GCGTGACGAT CGAGGGAGCA AAAAATGCAG TCTTACCCTT GTTGGCAGCG ACTATTCTAG 360 

CAAGTGAAGG AAAGACCGTC TTGCAGAATG TTCCGATTTT GTCGGATGTC TTTATTATGA 420 

ATCAGGTAGT TGGTGGTTTG AATGCCAAGG TTGACTTTGA TGAGGAAGCT CATCTTGTCA 480 

AGGTGGATGC TACTGGCCAC ATCACTGAGG AAGCCCCTTA CAAGTATGTC AGCAAGATGC 540 

GCGCCTCCAT CGTTGTATTA GGGCCAATCC TTGCCCGTGT GGGTCATGCC AAGGTATCCA 600 

TGCCAGGTGG TTGTACGATT GGTAGCCGTC CTATTGATCT TCATTTGAAA CGTCTGGAAG 660 

CTATGGGGGT TAAGATTAGT CAGACAGCTG GTTACATCGA AGCCAAGGCA GAACGCTTGC 720 

ATGGTGCTCA TATCTATATG GACTTTCCAA GTGTTGGTGC AACGCAGAAC TTGATGATGG 780 

CAGCGACTCT GGCTGATGGG GTGACAGTGA TTGAGAATGC TCCGCGTGAG CCTGAGATTG 840 

TTGACTTAGC CATTCTCCTT AATGAAATGG GAGCCAAGGT CAAAGGTGCT GGTACAGAGA 900 

CTATAACCAT TACTGGTGTT GAGAAACTTC ATGGTACGAC TCACAATGTA GTCCAAGACC 960 

GTATCGAAGC AGGAACCTTT ATGGTAGCTG CTGCCATGAC TGGTGGTGAT GTCTTGATTC 1020 

GAGACGCTGT CTGGGAGCAC AACCGTCCCT TGATTGCCAA GTTACTTGAA ATGGGTGTTG 1080 

AAGTAATTGA AGAAGACGAA GGAATTCGTG TTCGTTCTCA ACTAGAAAAT CTAAAAGCTG 1140 

TTCATCTGAA AACCTTGCCC CACCCAGGAT TTCCAACAGA TATCCAGGCT CAATTTACAG 1200 
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CCTTGATGAC AGTTGCAAAA GGCGAATCAA CCATGGTCCA 
TCCAACACCT AGAAGAGATG CGCCCCATGG GCTTCCATTC 
CTCGTATTGT TGGTGGACAG CCTTTGCAGG GAGCAGAAGT 
CCAGTCCGGC CTTGATTTTG ACAGGTTTCG TAGCACAGGG 
TGGTTCACTT GGATAGAGGT TACTACGGTT TCCATGAGAA 
AGATTCAGCG GATTGAGCCA AGTGATGAAG ATGAATAAGA 
CGTTTACTTT TAGTCATCAT AGTACTGATT TTAGGTACTC 
ATGGTAGGTT ATGGAATCTT GGGCAAGGGT CAAGATCCAT 
AAATGGCAGG AATTGATTCA TAAATTTACA GGAAATTAGG 
TAAAGATAAG GAGAAATATG AACAAAAAAA CAAGACAGAC 
TATTGCTTTT GTCTACAGGG. AGCTATTATA TCAAGCAGAT 
CCAAAACCAA TCTTACTCAG AAAAAACAAG CGTCTGAAGC 
AGAGTGTCTT AACAGACGCA GTCAAGAGTC AAATAAAGGG 
CAGGTGCTTT TATCGTCAAT GGTAATAAAA CAAATCTAGA 
CCTACGCTGA CAATAAAACA AAGACAGTGG GCAAGGAAAC 
CCCTCTTGTC TAACGCCACT CGTCAGTACA AGAATCGTAA 
CTTCTTGGAC TCCTCCAGGT TGGCATCAGG TCAAGAATCT 
CAGTCGATAG AGGTCATTTG TTAGGCTATG CCTTAATCGG 
CCTCAACAAG CAATCCTAAA AACATTGCTG TTCAGACAGC 
CCGAGTATTC GACTGGTCAA AACTACTATG AAAGCAAGGT 
ACAAGCCTGT CCGTTACCGT GTAACCCTTT ACTACGCTTC 
CAGCTTCACA GATTGAAGCC AAGTCTTCGG ATGGAGAATT 
CCAATGTTCA AAAGGGACTT CAACTGGATT ACCGAACTGG 
AAAAGATACG CCTACACTCC TATGTCACTT ATGGATGTAG 
AGCAGGACTA AGACAGGTAC TAAGACAAAA TAGCAACTTC 
GGGAGAGAGA TGGAAGTTAC TTTGAGA 
(2) INFORMATION FOR SEQ ID NO: 102: 

ti> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



GACAGTTTTC 
TGAGATTATC 
TCTTTCAACT 
AGAAACTGTG 
GTTGGCGCAG 
AATCAAGCTA 
TGGCTCTACG 
GGGCTATCCT 
CTGGAGAACC 
ACTAATCGGA 
GCCGTCGGCA 
TCCTAGTCAA 
GACTCTGGAG 
TGCCAAGGTT 
TCTTCCAACC 
AGAAACTGGG 
AAAGGGCTCT 
TGGTTTGGAT 
CTGGGCAAAT 
GCGTAAACCC 
AAACGAGGAT 
GGAATTCAAT 
AGAAGTAACT 
GAGTTCTTTT 
TAAAACTAAC 



GAAAATCGTT 
CGTGATACAG 
GACCTTCCTG 
GTCGGTAAAT 
CTAGGTGCTA 
CGTAGTCAAG 
AATCGGTTTA 
GTCTCCAGCA 
AGCCTTTTTC 
CTGCTAGTGT 
CCTAATAGTC 
GCATTGGCAG 
TGGAATGGCT 
TCAAGTAAGC 
GTACCTAATG 
AATGGTTCAA 
TATACCCATG 
GGTTTTGATG 
CAGGCACAAG 
TTGGACCAAA 
TTAGTTCCCT 
GTTCTAGTTC 
GTAACTCAGT 
TACTAGTTTA 
TTCCAGTTTT 



1260 

1320 

1380 

1440 

1500. 

1560 

1620 

1680 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2727 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


102: 






TTTTTTGTAG 


ATTTAAGTGG 


GGTGCAATTC 


CTAAAAAATA 


. AAAAACAATT 


TTTGAAAATT 


60 


ATGTTAGCAG 


GAATTGCTTC 


AAATTCGATT 


TTATCACTTA 


CAGGTTTACT 


TGTTTTATTG 


120 


TTCACATCGT 


ATAAATTGCT 


TGGACTCTTA 


TTTTTTATCA 


TTAACTTAGG 


TATGATTTTT 


180 


ATTAATTCAA 


TTCCTTTTTT 


TCAGTATGAT 


AGTGGTATTA 


TTTTAAGATA 


CTTGAATTCT 


240 


AACAATAATA 


ACTTGAATTT 


TCAATATATA 


GTTCAACTTT 


TAATAGCATT 


TGTTATTATT 


300 


TATTTTCCTT 


TGAGTCAACT 


ATTACAGTTT 


TTGACACCCA 


ATATTATTGT 


TCGTAGTATA 


360 


GGAGGGGTGG 


TTGTTTCTAT 


ACTGCTTTCT 


ATATTATATA 


TGATAGGAAG 


GACGAAATAT 


420 


GTTCTACGTA 


AATAGTTATG 


TTTTTGCTTA 


TAAAAAAGAA 


GGTATAATGT 


ATTTACGTGG 


460 


TCGGAGTATG 


CGGGAAATAG 


CTATAGAACC 


TCAAATTTCG 


CAAGAATTTA 


TCAACGATCT 


540 


ATTTAATAGT 


TGTAAGGAAC 


TATTAGAGAT 


AGAAGAAGTA 


TTAGGCAGTA 


AACTAACATT 


600 


TGAACTATAA 


ATGAACAAAT 


TTTAATTTCC 


GATGAGATAG 


ATATTCATAG 


TAGATATTCT 


660 


AGAACTAAAG 


GTTACTATTC 


GTTATTTTAT 


AATGAAGAGT 


ATAATAAAAT 


ACAGAATAAA 


720 


ACAGTATTAG 


TATTAGGAGC 


AGGAGTCTTA 


GGATGTTATA 


TATCTCTAAG 


TCTAAGTATG 


780 


TATGGAGTGA 


GGAAACTTAT 


TGTCGCTGAT 


TACGATATAA 


TAGAACCATC 


AAATTTAAAT 


840 


AGGCAAATTC 


TTTATACAGA 


GTCGGATGTT 


GGTAAGGAGA 


AGATTAATGT 


TCTTTCTGAA 


900 


AAAATACACA 


AGTATAATTC 


AGATGTTCAG 


GTAGTACCTA 


TTTCTATTAA 


AGTTTCrrCA 


960 


GTAGAAGAAT 


TAGAAAAAAT 


TGTTGCCGAA 


TATGGGAGTA 


TAGATTTTAT 


CGTTAAAGCA 


1020 


ATTGATACGC 


CCATTGATAT 


TATAAAAATT 


GTCAATCAAT 


TTGCTGTATC 


GCATAAGATA 


1080 


TCCTACATAT 


CAGGAGGGTT 


TAATGGATGC 


TATCTTATTA 


TTGATAATAT 


ATATATCCCT 


1140 


ACCATCGGTT 


CTTGCTTTGG 


TTGTCGGAAT 


ATAAACAAAG 


ATATAAATAA 


GTACACTTTA 


1200 


TCTGATAAGA 


CAAAGTGGCC 


GACTACACCA 


GAGATGCCTG 


CTATTTTGGG 


AGGGATAATG 


1260 


ACTAATTTAA 


TAATTAAAAT 


ATTTCTGGGA 


TUTTATAATC 


AAATCCTAAT 


AGATAACGCT 


1320 


TACGTTTATA 


ATATGAGAAA 


TCATGCTCTA 


AGTCAAGAAA 


AATATGTTCT 


GGAAAACGGA 


1380 


GAATGTCCAA TTTGTAAAAA AATAATAAAG TCAAAGATAA 


CAATATTAGA 


GCGAAAACAT 


1440 


TTATTCGTTC 


AGTTTGTTTT 


TGCTTATTAT 


CAGGAGGAGT 


AGCTTTTTTA 


TCTGCTATTG 


1500 


GGCAGTTCAC 


TGTTATAGAA 


ACACAATTAA 


TAGTATTGTT 


CTTGGGTATT 


ATTTTTGCTA 


1560 


TATATTATGC 


TTACTACAAT 


AAAAATATTC 


AAACATCATT 


GGAAAATATA 


GTATGGCTTT 


1620 
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TTTCATCGTT TGAGATTTTA TTTTTGCTTG TTAATTTTAG AACATTTATT CAGTTACCAG 1680 

TGGATATTTT TATTGGTATG ATAATATTTT TAATGCTGTG GATATTTATT ATGTTAGGTA 1740 

TAGTGTGTCT TAGTTATTAT ATAACTTTAT TATTTAGCAA GGAGGCTTAG TATGTTTAAA 1800 

AAAATAGGTA TAATGAGCAT TTGCATATAT ATAATTATTT TATACTGCTT GAGAATGTAT I860 

CGTATTATCA ATAATATTGA AACAATCTTG CTAACGGTTA TATGCTTAAT GTTATTGTTT 1920 

TTTTTAAGAC GTTTATTTGA TAAAGATAAG TAAATAGATG TTAAGTAAAA ATGTAGAATA 1980 

TAAAGGAGGT GCAATGAGTA TGATTGAAGT TAGCCATTTA TCAAAAAGTT TTGGTGATAA 2040 

AATAGCTTTA AATAATATAA GCTTCACTGT TAAAGAAGGT TAGATTTTTG GATTTTTAGA 2100 

ACCATCTGGT TCTGGAAAGA CCACAACGAT TAATATTCTG ACTGGGCAGT TCCTTGCCGA 2160 

TAAAGGACAA TCTATTATTT TGGGACAAAA ATCTCAAAAT TTAACAACCG GTGAATTAAA 2220 

GAGAATTGGA TTGGTTAGCG ATACAAGTGG ATTTTATGAG AAAATGTCTC TGTATAACAA 2280 

TCTTCTTTTT TATAGTAAAT TTTATAATAT TAGTAAATCA CGTGTTGATA ATTTGTTAAA 2340 

GCGAGTAGGA TTATATGATA GTCGCAAGAT GGTAGCAGGA AAATTATCCA CTGGAATGAG 2400 

GCAACGAATG CTTTTAGCAC GAGCTCTTAT CAACAACCCC GCTCTACTCT TTCTGGATGA 24 60 

ACCGACCTCA GGTCTAGATC CCACAACTTC TCGAACAATT CATGAGTTAA TTTTAGAATT 2520 

GAAAACAGCA GGGACAACGA TTTTTCTAAC GACTCATGAT ATGAATGAAG CAACTCTTTT 2580 

ATGTGATTAT GTTGCCTTAT TAAATAAAGG GAAATTAGTT GAGCAAGGAG CTCCTTCTGA 2640 

ACTCATTCAA AGATATAATA AAGATAAAAA GATTAAGGTT ACAGATTATA ATGGGAATCA 2700 

GATAACTTTT GATTTTACAT CACTAGAACA GGTATCTCAG ACTGATCTGG AAAATATTTT 2760 

TTCAATTCAT TCATGTGAGC CTACTTTAGA AGATATTTTT ATCACATTAA CAGGAGGAAA 2820 

GCTAAATGCT TAAACGGTTT CTGGCTTTGG TATGGTTGCG TTGTCAAATC ATCCTTTCCA 2880 

ATAAGAGTAT TTTATTGCAA GTTTTAGTGC CTTTTCCTTT CACATATTTT TATAAATATC 2940 

TTATGGAAAC ACAGGGGAAG GTCAACGATC AACAGGCATT AGTTCTTTTG ATGATGTGTT 3000 

TACCTTTTTC TTTTTCTTTG GCTGTTGGAA GTCCTATAAC TATTATCTTG TCTGAAGAAA 3060 

AAGAAAAGTA CAATTTACAA ACTCTTCTGT TGAGTGGTGT TAAAGGCTCC GAATACATTT 3120 

TATCAACTAT GTTTCTTCCT TTTTTGCTAA CTTTTGTGAT TATGGGAACT ACTCCTCTTA 3180 

TTTTAGGAGT TACAATTGTA CATACTTTTA ATTATATTAC AATCGTTCTT CTAACCTCTT 3240 

TATCCATCAT TTTATTCTAT TTATTGATAG GTTTAACCGC GAAGAGCCAA GTAGTAGCTC 3300 

AGGTTATCAG TCTTCCTGCT ATGATTTTAG TT QC T T TC T T ACCGATGCTA TCTGGTTTGG 3360 

ATAAGACAGT TGCGAAGATA ACAGATTATA GTTTTATGGG ACTATTTACT AAGTTTTTCA 3420 



WO 98/18931 



PCT/US97/19588 



CAAAATGGGA GCAATTTTCA TGGAATAAAA CTCTAATTCC TAATCTAACA CTACTTATTT 3480 

GGATTGTTCT TCTATTAACT TTAATTACGA TAACTATTAG GAAAAAGAAA ATTTCTTAAT 3540 

TGAGTTATTT TAATGATTAT AAACACAAGT GGGAAGGAAA AAATGAACTG ATCTTTTTGA 3600 

CAGCAATTCT ACAGAATAGT CTTATTGCTA TATTTTGATT TGAGTGTACG AAAAAAGAAA 3660 

AATAACAATA GTGCTCATAC TAATTGCAGA AGTTTTGGGT GATAAGATAA CTGATAAATT 3720 

GCAATAAAAA ATGCAACATT TTTAAATCTC CTCTATAAGT GCTTCAAAAA GTGCTTCAAA 3780 

ACCTGTCTTG TAATCCAAGT ATTTTTGGGG ACGGTGATTA ATAAGCTAGC AAAGCATCAT 384 0 

TAAGGATTTT TTCGGTAATT GTTGCCAAAT CGGTTTAAGA AAATACTCAC GAAGAAGTCC 3900 

ATTCGCATTC TCATTACTTC CCCTTTGCCA AGATGAATAG GCATCCGCAA AATAAAACAG 3960 

AATTCCCATT TGTTCAATTA AAGGGTAACA AGCAAACTCT TTTTCTCTGT CCGAAGTGAA 4020 

AGTCTTTAAC TATTCTTTTG GAAAGAGTCT TGTGAGGTGT TCAATAGCAG TCAACATGGA 4080 

TTTAGCTGTT TTTACTTGAC AAGTGCTAGT AGAAATAATA GAATAGTAAA AAACCTTTAA 4140 

AGCAGTCCAG AGAGGCAGCT AAGGTTAGAC GGTGAAAGGG TGGAGACTAC CCATTTTTCG 4200 

TGGAACCTTG CTGTTGGCAG GTTCCTTTTT TCGTGGCTTC TGTTGGCCAG ACTCTCTCAC 4260 

TAGTAAAGGT AAAAGGAGAA ACCTATGCGA GAACATCGTC CAATCATTGC TCTTGATTTT 4320 

CCTAGTTTTG AGGCGGTCAA GGAATTTTTA GCTCTTTTCC CAGCAGAAGA AAGCCTTTAT 4380 

CTCAAGGTAG GGATGGAGCT TTATTACGCA GCGGGGCCTG AGATTCTGTC CTACTTAAAA 4440 

GGTTTGGGTC ATAGTGTCTT TTTCCATCTC AAACTTCATG ACATTCCTAA TACAGTCAAG 4500 

TCAGCCATGA AGATCTTGTC TCAGCTTGGT GTCGATATGA CTAATGTCCA TGCGGCTGGT 4S60 

GGTGTAGAGA TGATGAAGGC GGCGCGTGAA GGTCTTGCGA GTCAAGCCAA ATTGATCGCT 4620 

GTAACTCAGC TCACATCAAC GTCAGAAGCT CACATGCAGG AGTTTCAAAA TATCCAAACC 4680 

AGTCTGCAAG AGTCTGTGAT TCACTATCCC AAGAAGACAG CTCAAGCTGG CTTGGATGGT 4740 

GTTGTTTGCT CGGCTCAGGA AGTACAAGTC ATCAAGCAGG CTACCAATCC AGATTTTATC 4800 

TGTCTGACAC CAGGGATTCG TCCAGCTGGT GTTGCAGTTG GAGATCAAAA ACGAGTCATG 4860 

ACACCTGCTG ATGCCTATCA AATCGGCAGT CACTATATCG TAGTGGGACG TCCCATTACC 4920 

CAAGCTGAGG ATCCTGTTGC AGCTTATCAT GCCATCAAGG ATGAATGGAC ACAGGACTGG 4980 

AATTAAAGAA CTAGATTAGA AAAATAAAAG GAGAATACCA TGACACTTCC TAAAGATATC 5040 

GCTAGCCACC TCTTGAAAAT CCAAGCCGTT TACCTCAAAC CAGAGGAACC CTTCACTTGG 5100 

GCATCTGGTA TCAAGTCACC GATTTACACT GATAATCGTG TGACACTAGC CTATCCAGAA 5! 60 
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ACTCGTACCC TAATTGAAAA TGGTTTTGTG GAAGCTATCA AAGAAGCCTT TCCTGAAGTA 5220 

GAAGTGATTG CAGGAACTGC AACAGCAGGG ATTCCACACG GAGCCATTAT TGCTGATAAG 5280 

ATGGACTTGC CTTTTGCCTA CATCCGTAGT AAACCAAAAG ACCACGGAGC TGGTAATCAA 5340 

ATCGAAGGTC GCGTAGCTCA AGGTCAAAAA ATGGTAGTGG TTGAAGACCT TATTTCAACG 5400 

GGTGGTTCAG TTCTTGAAGC TGTAGCAGCA GCCAAGCGAG AAGGAGCAGA TGTACTTGGA 5460 

GTTGTAGCGA TTTTCAGCTA CCAATTGCCA AAAGCAGATA AGAACTTTGC AGATGCTGGT 5520 

GTTAAACTTG TGACGCTTTC AAACTATAGC GAGCTTATCC ATCTAGCCCA AGAAGAAGGT 5580 

TACATCACGC CAGAGGCCCT TGATCTTCTA AAACGCTTTA AAGAAGACCA AGAAAATTGG 5640 

CAAGAAGGTT AGGTCAGTAA GATAAAGAGA GACGACGCTA CCGAGTCTCT TTTACCATTT 5700 

TATTTAAAAT ATGACAG 57 17 



(2) INFORMATION FOR SEQ ID NO: 103: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CCTGGACTTT CTAAAATGAA ATCTTGCGAC CTGGATCAAG CCCTTCATGA GCATTTTTCA 60 

GAAGAAGAAT TAGCTGGTCA CTTTCATGTC CTTCTATGGA CTTTTTTTAC AATGGCATTG 120 

CTATCACACC CAATACCTAT CTAAGCGCCT GGTTCGTAAA CTTTATTGCA GCTCTTCCTC 180 

TAAATTTCCT AATTGTTGAA CCAATTGCCC GTTTTATACT AAGTTCTTTT CAGAAACCAT 240 

TTACTGGGGA AGAAGTTGAA GATTTTCAAG ATGATGATGA AATCCCAACT ATTATCTAAG 300 

CCAGTTCTGT AAACTACTAA TATTTGAAAT CCACTTCCTT TTAGGGTGCA ATGGTTATAA 360 

ATGAATTTTT GAGAGGATCA GAATGAAAAA ACTACCAACC CTTCTTTTAC TGTCTACTv.T 420 

AGCCCTAGCT GGGTGTAGCA GCGTCCAACG CAGTCTGCGT GCTGATGATT ATGTTGATTC 480 

CAGTCTTGCT GCTGAAGAAA GTTCCAAAGT AGCTGCCCAA TCTGCCAAGG AGTTAAACGA 540 

TGCTTTAACA AACGAAAACG CCAATTTCCC ACAACTATCT AAGGAAGTTG CTGAAGATGA 600 

AGCCGAAGTG ATTTTCCACA CAAGCCAAGG TGATATTCGC ATTAAACTCT TCCCTAAACT 660 

CGCTCCTCTA GCGGTTGAAA ATTTCCTCAC TCACGCCAAA GAAGGCTACT ATAACGGTAT 720 

TACCTTCCAC CGTGTCATCG ATGGCTTTAT GGTCCAAACT GGAGATCCAA AAGGGGACGG 780 

TACAGGTGGT CAGTCCATCT GG CATGACAA GGATAAGACT AAAGACAAAG GAACTGGTTT 840 
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CAAGAACGAG ATTACTCCTT ATTTGTATAA CATCCGTGGT GCTCTTGCTA TGGCTAATAC 900 

TGGTCAACCA AACACCAATG GCAGCCAGTT CTTCATCAAC CAAAACTCTA CAGATACCTC 960 

TTCTAAACTC CCTACAAGCA AGTATCCACA GAAAATTATT GAAGCCTACA AAGAAGGTGG 1020 

AAACCCTAGT CTAGATGGCA AACACCCAGT CTTTGGTCAA GTGATTGACG GTATGGATGT 1080 

TGTGGATAAG ATTGCTAAGG CCGAAAAAGA TGAAAAAGAC AAGCCAACTA CTGCTATCAC 1140 

AATCGACAGC ATCGAAGTGG TGAAAGACTA CGATTTTAAA TCTTAAAAAC CAAAAAAATA 1200 

CAGTATCCAC ATTCGGTACT GTATTTCTTT TACTCTCATT CTTAAGTTAA ATTATTAAAA 1260 

TCCCATATTT GGTCTATCCA GCCTTCATAA AAGTCTGGCT CGTGGCAGAC CATAAGGATA 1320 

GATCCCCTAT ATTCTTTGAG AGCGCGTTTG AGCTCATCCT TTGCATCCAC ATCCAAATGG 1380 

TTGGTCGGCT CGTCCAGCAC TAAAACGTTG TTTTCACGAT TCATCAAGAG ACAGAAACGA 1440 

ACCTTGGCTT GCTCTCCCCC TGATAATACT TGAATCTGGC TTTCAATATG TTTGGTTGTC 1500 

AAACCACAAC GGGCAAGGGC TGCACGGACT TCTGCTTGAT TAAGGGCAGG AAAGGCATTC 1560 

CAGACAGCTT CAAGAGGAGT TTGGCGATTA CCGCCTTCTA CTTCCTGCTC AAAATAACCA 1620 

AGTTCTAAAT AATCTCCACG CTCCACTTCC CCAGCGATTG GCGAGATAAT GCCCAAGAGA 1680 

CTCTTCAAGA GAGTTGTTTT TCCAATACCA TTAGCACCAA TAATCGCAAC CTTTTGATTG 1740 

CGTTCGAAGG TAAGATTTAA AGGCTTAGTA AGAGGACGGT CGTAACCAAT TTGCAAGTTC 1800 

TTGGCTTGGA AGATAAAGCG CCCTGGTGTA CGAGCTGGTT TGAAATCAAA GGATGGTTTT I860 

GGTTTCTCAC TTTGGAGTTC GATAATATCC ATCTTATCCA ATTTCTTTTG ACGAGACATA 1920 

GCCATATTAC GAGTTGCAAC ACGGGCTTTA TTACGAGCCA CAAAGTCCTT GAGGTCTGCA 1980 

ATCTCTTTCT GCTGGCGTTC GTAGGCTGCC TCTAGCTGAG ATTTCTTCAT AGCATAAACT 2040 

TCTTGGAACT GGTAGTAGTC ACCAGAGTAA CGCGTCAGCT GTTGATTTTC CACATGATAG 2100 

ACAATATTAA TAACGTCATT GAGGAATGGA ATATCGTGCG AAATGAGAAC AAAGGCATTC 2160 

TCATAGTTTT GGAGATAGCG CTTGAGCCAA TCAATATGCT CAGCATCCAA GTACTTGGTC 2220 

GGCTCGTCCA ACAGCAAGAT ATCAGGCTTT TCAAGGAGAA GTTTTGCCAA AAGCACCTTG 2280 

GTTCTTTGCC CACCTGACAA AGAAGTTACA TCCGTATCCA TGCCAAAGTC CATAACACCA 2340 

AGAGCACGCG CTACTTCGTC AATCTTAGCA TCCAAGGTAT AGAAATCACG ACTCTCCAGA 2400 

CGCTCTTGAA GTTCTCCTAC TTCTTCCATG AGAGCATCAA CATCCGCGCC GTCTTCAGCC 2460 

ATTTTCATAT AGAGGTCATT GATACGAGCT TCAGCTTTGA AAAGCTCATC AAAAGCCGTA 2520 

CCGAGAACAT CACGCACCGA CTGTCTTTCA GCAAGGACAG AGTGCTGATC CAAGTAACCA 2580 
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GCCGTCACAT ATTTGGACCA CTCAACCTTT 
ATACTCATAA AGGTTGATTT TCCTTCACCA 
TTGAGGAGAC GGAAGGACAC ATCTTCAAAA 
TTTTTAACTT CTAAAATACT CATTTTAATT 



772 
CCTTCATCTG 



GCAGCATTTT ACCAGTCACG 



CCAGGCCGAT ATGTTCTCCC 



AAGGAGCCAA 
GCGATTGAAA 
AAGAAAACTA 
TTGGAAAATC 
TAATTGTCCA 
AAAAACCAGG 
TTGAGGAGTA 
AATATGATTA 
TCACGTTTCA 
TCTTGACGAA 
CACTCTTTGG 
TTTACATAAG 
ACACCGTCGT 
TCAATGTAGT 
ATGATACTTG 
ACCCCTTCAT 
TGACGAGCAC 
TTGTTGTAAT 
TCCTGACCAC 
TTATTGAGGT 
CAGCCGACCT 
TCAATTAACA 
TGCACAATAC 
TCCTTGGAAA 
CAGATTTGAT 
TGATGTACCA 



GCCAGATAGC CACCCAAAGT 
TCAAAGAAAA AGTCCAAGAT 
AAAAGAAGGA CCTTTTTTGT 
AGAGGAAAAA ACAAGAAGAC 
ATGTCACTCA CTTCGCCCAG 
CGTCCAAGAT GCTGAATACC 
AAGCAAAAAC AGACAATGCA 



TTGGCACCGA 
ATTGCACGGT CACCAAAACC GTGACTCAGA 
CCTTACCTTG TTTTTATGTA ATCGTTTATA 
GTTGGTCCAC AAATCATCAA TCTCAAAGAC 



TAATTGCGTA 
TTTCCGCAAA 
ATTGAGGATA 

TTTCCAGAGA GAATTGAAAG GAGTCAAAAG 
TGGAGTTCCC 



CACTCGATTC CAAGACTCAC 
TTTGGAAATA GATAAAGGAG 
TTTTGTAAAA AAATCCAACA 



TAAGTCTTCT 
TTGTGTTAGA 
CCACACAGTT 
GACTACCGCT 
ACAATTTCTT 
TGACTTCATT 



TCATCATTAA 
GCGCAATTGT 
GACCCCTTT T 
ATATTGGTCA 
GATGTTCTTG 
AAGCATGATA 



ATTCAATAGC 
AACGAAGTTC 
CAGCAAAGTC 
CGATAGCCAT 
TATCAAAGGG 
GCTCATCAAA 
CACGTTGCTT 
GAGTGGTCAC 
TACCGTCGGG 
GTTGTTTCAA 



AGCAAAGAGT 
ATTGTTAGGT 
ACGAATTTTA 
TCCTTTATCA 
CTCACCGATT 
GTATTTCTGA 
CTTAATCAAA 
ACAGACAGAT 
CAATTCAAAG 
GGGATTGACC 



GGTTGGTCAT TTCTTCAAAT 



ACTCCCACGG TAGATTGTTC 
AATGCTATAG AAAATGACTC CCCAGACCAA 
GGATTTACCG CTGCGACTGC CTTCTGGCGG 
CCACAAGCTG CGTCAATATC TGTACCATGC 
TTCTTAAGCG TATCATAGAA AGCCAACACG 
TGCTCACTAA CTGGGTTATA AGGAATCAAG 
ACCAATTCAG TCAATTCCAA GGCTTGTTCT 
TATTCAAAGG TTACACGACG GTTTGTTGTC 
TTTTCAATCG GAAAGGCACG GTTAATCTTC 
GCGTGAAGAG ACACGGCAAG ATTGACCTGA 
TGAGCCAAAC CTGAGGTTGA AACCCTGATG 
TCATTGATAG TACGAAAGAA ATTCAAGACA 
CCCATGACAA CGATATGGCT GATGCGTTCA 
ACCAGCATGA TTTGCGCTAC GATTTCACCG 
CCAGAGGCAC AGAAGGTACA ACCGATATTA 
AAACCATAGT GTTGACCCAT GAGTACAGTC 
AGATATTTGA CTGTACCATC AGCAGACTCT 
ACAAACTGGT CATTGAGCTT AGCAATCAAA 
GACTGCACAC GTTTACGGTA GAGCCATTCC 



CTGCACGGAA TTTCTTTTCT CCCTGCTCCA ATACCCATTC CTGCATGGTT 
AACTATGAAT TGAGGGTTTC ATTTCTTCTC CTTATTCTCT ACTCACTTCT 



2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 
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GACCAATGAC AAAATGACGT TGTCCCTTGT CGTCTTTCTG ACGACGTCTA TTTTTCTTAT 4440 

CTCCATTCGA CTTTCGTTTA GTTTGAGTCG GTTTCTTTCC TTTTCTAGAA GGTGTTTCTT 4 500 

CTTCCGTCTT ACGCATTTTC TTGTCAAATG ATGCTCGCTT AGGGGCTTCA TTTTCTAAGA 4560 

CAAAATAGGC ACAACCATAA CTACAATACT CTAAAAGGTA GTCTTGTAAA CGACTGATTT 4620 

TTTCAAGTTT TTCTTCTGTT CGGTCATCCT TGTAAAAACC TCGTAGGCGA AGCTGTTCGT 4 6 SO 

TGCTCCAGTC CCCCACGATA TAATCAAACT TGGTTAATAC TTCTGAAAAA CGCTGATTAA 4740 

AAGTCGTCAC ATCAAAGGCA TCCTTGATAT TTTCAACCAA GGAAAAAGCT ATCCCTTCCG 4 800 

TTTCGACCTT GTCCCCGTGT AAATGGAACT CCGGACCAGG AAACTTGTTA TAGTTGTATA 4 860 

ATTCAGGTGC AATTTCTTTT CGCATAGATA TCCTTTTTTC ACGATTACTT AATACTTTAT 4 920 

TCTACCATAA TTTCTAGCAG TTAGCACGTT TCTCATAAAA ATGAAAAAAG TCTGACGATT 4980 

TTGTCAGACC AGAATCTTAT AACCTAAAAA GAGAAGAACA ATTCTTCCCT CCAACTATCA 5040 

TTATTTAGCA GCTGCGTACA ATTCATCTAC TTTATTCCAG TTGATTACTG AAAAGAAAGC 5100 

TTTGATGTAG TCAGGACGCA CGTTGCGGTA TTTCACGTAC TAAGCATGTT CCCAAACGTC 5160 

CAAGCCCAAG ATTGGTTTTT TACCTTCTGA GATTGGTGTG TCTTGGTTTG CTCTTGAAGT 5220 

CACTTCAAGT TTCCCTTCTT TGTTGACAAC CAACCATGCC CAACCTGAAC CAAAACGAGT 5280 

TGTTGCTGCT GCAGTGAAGG CTGCTTGGAA TTCTTCAAAT GAACCAAATG TTGCATCGAT 5340 

TGCTGCTGCC AGTTCTGCTG AAGGAGCTGT TTTCTCGGGA GTCATCAATT CCCAGAAAAG 5400 

AGCGTGGTTC AAGTGTCCGC CACCATTGTT GATAAGTGCT TGACGGATAT CAGCTGGGAT 54 60 

AGATTCTACA TCAGCAAGCA AGGCTTCAAG GTCTTCACCG ATTTCAGGGT GTTTTTCTAA 5520 

AGCTGCATTG GCATTGTTGA CATAAGTTTG ATGGTGTT 5558 

(2) INFORMATION FOR SEQ ID NO: 104: 

li) SEQUENCE CHARACTERISTICS: 

I A) LENGTH: 673 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 
<D) TOPOLOGY : linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GGAATTGTAA ATATCATATT GTTTTTGCAC CCAAATATCG TCGTCAAATC ATTTATGGCA 60 

GATACAAAGC TAGTATCGGA AGAATCATAC GTGACTTATG TGAGCGTAAG GGTGTAATAA 120 

TCCATGAAGC GAATGCTTGT TCAGACCATA TTCACATGCT TATCAGTATT CCTCCGAAAC 180 
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TTAGTGTTTC 


GTCCTTTATG 


GGCTATTTAA 


AGGGCAAGAG 


CAGTTTGATG 


ATTTTTGATA 


240 


AGCATCCGAA 


TTTAAAATAC 


AAATATGGCA 


ATCGCAAGTT 


TTGGTGTAGA 


GGCTATTATG 


300 


TAGATACGGT 


ACGCCGTAAT 


CAGAAAGTGA 


TAGCTGAATA 


TATTCAGAAT 


CAATTACAAG 


360 


AAGACAGAGT 


AGCAGACCAG 


CTCACGTTAT 


TCGAGTCAGT 


AGATCCGTTT 


ACTGGCGAAA 


420 


TAAATAAGAG 


GAAGTAACTA 


AGGTGCTTTA 


CCACCTCCTC 


GGGAAAGTGG 


TGCGCGAGGA 


480 


AGCTATTTCG 


GTGGGCCTTT 


GGCCCTGGCC 


GGTAGAAGCG 


GCTTATAGCC 


GCAGAACAAA 


540 


CCACCAGTTC 


ACACTGGTGG 


TTTTGATTTA 


AAAAACTTGA 


TACATAAAAA 


TAAAAGTCTA 


600 


TATAAAGGAT 


GGTAAAATTC 


CTGTTGTCCG 


ATTTGGACAA 


TATCCTAAAT 


AGTTACAATA 


660 


TATGGTCTAT 


ACTTTTTCTT 


AGGAGAAAGC 


TAGATGTACA 


GACGTTTGAG 


AGATTTGAGG 


720 

> 


GAGGATCATG 


ATCTGCCCCA 


AAAGCAAATA 


GCTACAATAC 


TTTCGTTTAC 


AAATTCAGCT 


780 


TATGCCAAAA 


TTGAACGGGG 


TGAGCATGCG 


TTGACGGCTG 


ATGTATTGGT 


TAAACTCTCA 


840 


GATTTCTATG 


ACGTCAGTAC 


AGACTATTTA 


TTGGGATTAA 


CTGATTTTCC 


TGATAAAATT 


900 


CGCTTTAGAA 


AATAATCTCC 


TCAATTTCAT 


AGAGTTTGAA 


AATCAGTGAG 


ATTTTTTATT 


960 


TGCCCTTTGA 


CAACTGAATA 


GCCTAAAATG 


GTACTTTCCT 


CATTTGTGGA 


GCAAATTTGA 


1020 

M W A V 


ATGGCTCGCC 


ATGATAAGAG 


CGATTTTAAA 


ATCATCAATA 


AAATAGAGCG 


ATACTTTATA 


1080 


TGCCATGATA 


CAAATGATAT 


ACAATGATAC 


TTCTGACCGT 


TCAGCCTCCC 


AACGTAAAAG 


1140 


AGCAGCAAGT 


GAAATTCTTA 


TCATGACTTC 


ATCAGTCATG 


CCACGTTGAA 


TGTGTGAGTT 


1200 


TGTTAGATAA 


ACGCAATTAA 


TCCTCAAAAG 


GTTCCCCGAA 


CCTTTTGAGT 


TCTACAGACG 


1260 


CATCACGTGG 


AGTGTGTAAG 


CTTGTTGCTA 


AAAGCGTAAA 


AACCTTGGAA 


CGAAAGGAAT 


1320 


AATAGACTTT 


CTGCGAAACA 


AAAATATAAT 


ACAATAAAAC 


TATGAATGAT 


GAAGCAAGTA 


1380 


AACAATTGAG 


CGATAGCCGT 


TTCAAGATCC 


TTGTAGGTGT 


TCAGCGCACG 


ACTTTTGAAG 


1440 


AGATGTTAGC 


TGTGTTAAAA 


ACAGCTTATC 


AACGTAAACG 


CGCAAAAGGT 


GGACGAAAAA 


1500 


GCAAATTAAG 


CCTAGACGAT 


CTCCTTATGG 


TAACTATTCA 


ATACATGCGA 


GAATAGAGCA 


1560 


CTTATGAACA 


AATTGCGGCT 


GATTTTGGCA 


TTCACGAAAG 


CAACTTAATC 


CGTCCGAGTC 


1620 


AATGGGTTGA 


AGCAACTCTT 


ATTCAAAATG 


GTTTTACGAT 


TTCAAATTCT 


GCCTTAATTC 


1680 


TGTAAAAACA 


GTAAAATTCG 


AAGGATTGTA 


AGGTAAGAGT 


TTTTTTCTTT 


CTGAAAAAAT 


1740 


GGTATAATAG 


CAATCAAAAC 


TAGAAAATAA 


AACGGAATTT 


GGAACACATT 


TGTCTGTATC 


1800 


CTAGTAGAGT 


GGTGATACTA 


TGAAGATTAG 


TAAGAGGCAC 


TTATTAAATT 


ATTCCATCTT 


1860 


GATTCCCTAC 


TTGCTTTTAT 


CTATTTTGGG 


CTTGATTGTC 


GTCTATTCGA 


CCACCAGTGC 


1920 


TATTTTAATT 


GAAGAAGGCA 


AGAGCGCCTT 


GCAGTTGGTT 


CGAAACCAAG 


GAATCTTTTG 


1980 
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GATTGTTAGT TTGATACTGA TTGCCTTAAT TTATAAATTG AGACTAGATT TTTTGAGAAA 2040 

TGAGCGACTA ATCATTTTAG TTATATTAAT AGAAATGCTT TTATTGTTCT TGGCTCGTTT 2100 

TATTGGTATT TCCGTAAACG GGGCATACGG TTGGATTTCG GTTGCAGGAA TAACTATTCA 2160 

GCCACCTGAG TACTTAAAAA TCATTATTAT TTGGTATTTA GCTCACCGAT TCTCCAAACA 2220 

GCAAGAAGAA ATAGCTACTT ATGATTTTCA AGTTTTGACT CAAAATCAAT GGCTTCCCCG 2280 

TGCTTTTAAT GATTGGCGAT TCGTTCTCCT AGTTCTGATT GGAAGTTTGG GAATTTTCCC 2340 

TGATTTAGGA AATGCGACTA TTTTAGTCTT GGTTTCCTTG ATTATGTATA CAGTTAGTGG 2400 

AATCGCTTAT CGCTGGTTTT CAACCATTCT GGCGCTCGTA TCTGCCGCTT CTGTCTTTGT 2460 

CTTGACCACT ATCAGCCTAA TCGGTGTTGA GACCTTTTCA AAAATTCCAG TATTCGGCTA 2520 

TGTACCCAAG CGCTTTAGTG CCTTTTTTAA TCCTTTTGCC GATCGTGCTG ATGCAGGTCA 2580 

CCAGTTAGCT AATTCTTATT TTGCCATGGT CAATGGCGGT TGGTTTGGTC TAGCTCTTGG 2640 

AAACTCGATT GAAAAACGAG GTTATTTGCC AGAAGCTCAT ACAGACTTTG TCTTTTCTAT 2700 

CGTGATTGAA GAATTTGGCT TTGTTGGTGC CAGTCTTATT TTAGCTCTCT TGTTTTTCAT 2760 

GATTTTGCGG ATTATCTTGG TCGGTATCCG AGCGGAGAAT CCTTTCAATG CCATGGTTGC 2820 

ACTCGGTGTC GGAGGGATGA TGTTGGTTCA GGTATTTGTC AATATCGGAC GGATTTCGGG 2880 

CTTGATTCCA TCTACAGGAG TGACTTTCCC CTTCTTATCC CAGGGTGGAA ATAGTCTTCT 2940 

AGTCTTATCA GTGGCAGTAG CCTTTGTCTT AAATATTGAT GCCAGTGAAA AACGCCCTAA 3000 

ATTGTACCGA GAATTGGAAA ATCAACCAAT GAACCTTCTG TTGAAGTAGG ATAAAGAAAG 3060 

GATAGTTTAT GTCTCTTCAA AAATTAGAAA ATT AT ACT AA TAAAAGTGTT GTGCAAGAAG 3120 

AAGTCTTGAT TCTAACAGAA TTACTGGAAG AT ATT ACT AA AAATATGCTT GCCCCAGAGA 3180 

CCTTTGAAAA AATAATACAG TTGAAAGAAT TATCAACGCA GGAAGATTAT CAAGGTCTAA 3240 

ACCCTCTAGT GACTAGCTTA TCAAATGATG AAATGGTCTA TATTTCACGC TATTTCTCTA 3300 

TCTTGCCTCT TTTGATTAAT ATTTCAGAGG ATGTGGATTT AGCTTATGAA ATCAATCATC 3360 

AAAATAATAT TGATCAGGAC TATTTAGGTA AATTATCTAC AACGATTAAA TTGGTAGCAG 3420 

AAAAGGAAAA TGCCGTTGAG ATCCTAGAAC ACTTGAATGT TGTCCCTGTT TTGACAGCCC 3480 

ATCCAACACA AGTGCAACGC AAAAGTATGT TGGATTTAAC AAATCATATT CATAGTCTTT 3540 

TGCGTAAATA CCGTGATCTT AAGTTGGGGT TGATCAATAA AGATAAATGG TACAATGATT 3600 

TGCGTCGTTA CATCGAAATT ATCATGCAGA CAGACATGAT TCGTGAGAAA AAATTAAAAG 3660 

TGACTAACGA AATCACGAAT GCTATGGAAT ATTATAACAG CTCCTTTTTG AAAGCTGTAC 3720 
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CTCATTTGAC GACGGAGTAT AAGCGCTTAG CGCAAGCGCA TGGTCTGAAT TTAAAACAGG 3780 

CTAAACCAAT CACCATGGGT ATGTGGATAG GTGGTGACCG TGATGGAAAT CCATTTGTTA 3840 

CAGCAAAGAC CTTGAAGCAG TCTGCACTCA CTCAGTGTGA AGTCATCATG AACTACTATG 3900 

ATAAAAAGAT TTACCAACTT TATCGTGAAT TTTCTCTTTC AACTAGCATT GTCAACGTCA 3960 

GCAAGCAAGT CAGAGAAATG GCTCGTCAAT CCAAGGATAA CTCGATTTAC CGCGAAAAAG 4020 

AGCTTTACCG TCGTGCCTTG TTTGATATTC AATCAAAAAT TCAGGCAACT AAAACCTATC 4080 

TGATTGAGGA TGAAGAAGTT GGGACTCGTT ATGAAACCGC CAATGATTTC TACAAGGATT 4140 

TGATTGCCAT TCGAGATTCT CTACTAGAAA ATAAGGGCGA CTCCTTGATT TCAGGTGATT 4200 

TTGTGGAATT ATTGCAGGCA GTAGAGATAT TTGGTTTTTA CTTAGCATCA ATTGATATGC 4260 

GACAAGACTC TAGCGTCTAT GAAGCCTGTG TGGCAGAACT CTTGAAATCA GCAGGAATTC 4320 

ATTCTCGTTA TAGCGAGTTG AGCGAAGAAG AAAAGTGTGA CCTTCTCTTG AAAGAATTAG 4380 

AAGAAGATCC CCGAATTCTT TCTGCGACTC ACGCAGAAAA ATCAGAATTA TTAGCAAAAG 444 0 

AATTAGCTAT TTTTAAGACG GCTCGTGTTT TGAAAGATAA GTTGGGAGAT GATGTCATCC 4500 

GTCAGACCAT CATTTCACAT GCAACCAGCC TTTCTGATAT GCTAGAATTA GCTATTCTGT 4560 

TAAAAGAAGT AGGACTGGTG GATACGGAAA GGGCGCGTGT TCAGATTGTT CCCCTTTTTG 4620 

AAACAATTGA AGACTTGGAT CATTCAGAGG AAACAATGAG AAAATATCTT TCTCTTAGCC 4680 

TTGCCAAAAA ATGGATTGAC TCACGAAATA ACTACCAAGA AATCATGCTT GGCTACTCTG 4740 

ACAGTAATAA AGATGGCGGT TACTTGTCAT CATGTTGGAC CCTCTACAAG GCTCAACAAC 4800 

AATTGACTGC TATTGGAGAT GAATTTGGCG TTAAGGTTAC CTTCTTCCAT GGTCGTGGTG 4860 

GTACTGTCGG TCGTGGTGGT GGGCCAACCT ATGAAGCCAT TACATCTCAA CCGCTCAAGT 4920 

CTATCAAGGA TCGTATCCGC TTGACGGAGC AGGGTGAAGT AATTGGGAAT AAATACGGTA 4980 

ACAAAGACGC CGCTTACTAT AACCTTGAAA TGCTAGTATC GGCAGCTATT AACCGTATGA 5040 

TTACTCAGAA GAAGAGCGAT ACCAATACCC CAAATCGTTA TGAAACCATT ATGGATCAAG 5100 

TAGTGGACCG TAGTTACGAT ATCTACCGTG ATTTGGTCTT TGGTAATGAG CATTTCTATG 5160 

ATTATTTCTT CGAGTCAAGT CCAATCAAGG CTATTTCAAG TTTTAATATT GGTTCTCGTC 5220 

CAGCCGCTCG TAAGACTATT ACTGAAATCG GTGGTTTGCG TGCCATCCCT TGGGTATTCT 5280 

CATGGTCACA GAGTCGTGTT ATGTTCCCTG GATGGTACGG GGTTGGTTCA AGCTTCAAGG 5340 

AATTTATCAA TAAAAATCCA GAGAATATTG CTATCTTACG AGATATGTAC CAAAATTGGC 5400 

CTTTCTTCCA ATCGCTTCTT TCAAATGTTG ATATGGTTTT GTCAAAATCA AATATGAATA 5460 

TTGCTTTTGA ATATGCTAAA CTTTGTGAAG ACGAGCAAGT TAAGGCCATC TATGAGACTA 5520 
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TTTTAAATGA ATGGCAACTT ACTAAGAACG TTATCTTGCC TATTGAACGA CATGACGAAC 
TCTTAGCTGA CAATCCATAT CTAAAAGCTA GTCTGGATTA CCGTATGCCT TACTTTAATA 
TTCTCAACTA TATTCAGTTG GAGTTGATTA AACGCCAACG TCGTGGAGAA TTGTCCAGTG 
ATCAAGAACG ATTGATTCAT ATCACCATCA ACGGAATTGC GACAGGATTG CGTAATTCAG 
GTTGATAATT TTCAAGAGTG AATGCTAAAA GTGAATATCA AAAAAATTCT AATAGACTAT 
TGACAAGTAG TTTAAAAATG ATATAATTTA ACCATTCAGA AAAGTAATCA TACAAACTTT 
TTAGAGAGTC TGTGGTAGCT GAAAACAGAT AAGTGGCAAT GATGAAAATT GGGCTGAATG 
CTATTTAGAA TTTGAAATTA TAAAAATTCG GTAAGCACAC CTTACAGTCC ATCTCGTTAT 
TGCGAGACTG AGCGATAGGG AAATTCCCTA TAATTGAGGT GGTACCGCGC ATCGACGTCC 
TCACACAAGT TTTTTGTGTG AGGATTTTTT TGATGGAGGT TAGTATGGAA AGAAAACGAT 
GGCGTCGCTT GTTTAGATAA CTGAAATATG TTAAAGGAAA TAAAAAGGAG AAACAGAATG 
AAAAATAAAC GTTTAATTGG AATTATTGCT GCATTAGCAG TCTTAGTAGC AGGAAGCTTG 
ATTTATTCTT CAATGAATAA ATCAGAAGCT CAGAATAATA AGGATGAGAA GAAAATAACC 
AAGATTGGTG TGCTTCAATT TGTGACCCAT CCATCCCTTG ATTTGATTTA TAAAGGGATC 
CAAGATGGAC TTGCAGAAGA AGGATATAAA GATGATCAAG TTAAAATTGA TTTTATGAAC 
TCAGAAGGTG ACCAAAGTAA GGTTGCGACA ATGACTAAAC AATTGGTTGC AAATGGGAAT 
GACCTTGTGG TTGGTATCGC AACACCAGCA GCCCAAGGGT TGGCTAGTGC AACAAAAGAC 
CTACCGGTTA TCATGGCCGC TATTACAGAC CCAATTGGTG CTAACTTGGT TAAAGATTTG 
AAAAAACCAG GTGGCAACGT TACAGGGGTA TCTGACCACA ATCCAGCTCA ACAACAAGTT 
GAACTCATCA AGGCTCTCAC ACCGAATGTG AAAACAATCG GAGCTCTTTA CTCAAGTAGC 
GAAGACAATT CAAAA 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6516 base pairs 
{B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6735 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
CTAGAGGATC CCAGCAGGTA AATTGGCTTC AGCTGGCAAA AAAGTTGCCC TCCTTGAACG 60 



CAGCAAGGCT ATGTACGGTG GAACTTGTAT CAACATTGGT TGTATCCCAA CTAAAACCTT 



120 
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GCTAGTTGCT 


GCTGAAAAGG 


ACTTGTCTTT 


TGAAGAAGTC 


ATT GCT ACTA 


AAAACACGAT 


180 


CACTGGTCCC 


CTCAACGGTA 


AAAACTATGC 


GACTGTTGCT 


GGTACAGGCG 


TAGATATCTT 


240 


TGATGCGGAA 


GCTCACTTCC 


TTTCAAATAA 


AGTCATCGAA 


ATCCAAGCTG 


GTGATGAAAA 


300 


GAAAGAACTG 


ACTGCTGAAA 


CAATCGTCAT 


CAACACTGGT 


GCTGTTTCAA 


ACCTCTTGCC 


360 


AATCCCTCGA 


CTTGCTACAA 


GCAAAAACAT 


CTTTGACTCA 


ACAGGTATCC 


AAAGCTTGGA 


420 


CAAATTACCT 


GAAAAACTTG 


GAATCCTTGG 


TGGCGGAAAT 


ATCCGTCTTG 


AATTTGCCGG 


480 


CCTTTACAAC 


AAACTTGGAA 


GCAAGGTCAC 


AGTCCTAGAT 


GCCTTGGATA 


CATTCCTACC 


540 


TCGTGCAGAA 


CCTTCCATCG 


CAGCTCTTGC 


TAAACAATAC 


ATGGAAGAAG 


ATGGCATTGA 


600 


ATTCCTTCAA 


AATATCCATA 


CTACTGAAAT 


CAAAAACGAT 


GGTGACCAAG 


TGCTTGTCGT 


660 


AACTGAAGAC 


GAAACTTACC 


CTTTCGACGC 


CCTTCTCTAC 


GCAACTGGAC 


GCAAACCAAA 


720 


TGTAGAACCA 


CTTCAACTTG 


AAAATACAGA 


TATTGAACTA 


ACTGAACGTG 


GTGCTATTAA 


780 


AGTAGACAAA 


CACTGTCAAA 


CAAACGTTCC 


TGGTGTCTTT 


GCAGTTGCAG 


ATGTCAACGG 


840 


TGGCCTTCAA 


TTTACTTACA 


TTTCACTTGA 


TGACTTCCCT 


GTTGTTTACA 


GCTACCTTGC 


900 


TGGAGATGGC 


AGCTATACAC 


TTGAAGACCG 


TCTCAATGTG 


CCAAATACTA 


TGTTCATCAC 


960 


ACCTGCACTT 


TCACAAGTTG 


GTTTGACTGA 


AAGCCAAGCA 


GCTGATTTGA 


AACTTCCATA 


1020 


CGCTGTTAAG 


GAAATCCCCG 


TTGCAGCAAT 


GCCTCGTGGT 


CACGTAAATG 


GAGACCTTCG 


1080 


CGGTGCCTTC 


AAAGCTGTTG 


TCAATACTGA 


AACAAAAGAA 


ATTCTTGGAG 


CAAGCATCTT 


1140 


CTCAGAAGGT 


TCTCAAGAAA 


TCATCAACAT 


CATCACTGTT 


GCTATGGACA 


ACAAGATTCC 


1200 


TTACACTTAC 


TTCACAAAAC 


AAATCTTCAC 


TCACCCAACC 


TTGGCTGAGA 


ACTTGAATGA 


1260 


CTTOTTTGCG 


ATTTAAGTTG 


AGATTTAATC 


GTATCGAACA 


GCCCTCTTTG 


GGCTGTTTTT 


1320 


ACTTCTGCGG 


AATCTCAAAT 


CTGTCTTTCT 


CCTCTTTTAT 


GATATAATAG 


AAACATGAAC 


1380 


TTAAAAACTA 


CTTTGGGCCT 


TCTTGCTGGG 


CGTTCTTCCC 


ACTTCGTTTT 


AAGCCGTCTT 


1440 


GGACGTGGAA 






GCvCTTCAAT 


TTGATAAAGA 


TATTTTACAA 


1500 


AACCTAGCTA 


AGAACTACGA 


GATTGTCGTT 


GTCACTGGAA 


CAAATGGAAA 


AACCCTGACA 


1560 


ACTGCCCTCA 


CTGTCGGCAT 


TTTAAAAGAG 


GTTTATGGTC 


AAGTTCTAAC 


CAACCCAAGC 


1620 


GGTGCCAACA 


TGATTACAGG 


GATTGCAACA 


ACCTTCCTAA 


CAGCCAAATC 


TTCTAAAACT 


1680 


GGGAAAAATA 


TTGCCGTCCT 


CGAAATTGAC 


GAAGCCAGTC 


TATCTCGTAT 


CTGTGACTAT 


1740 


ATCCAGCCTA 


GTCTTTTTGT 


CATTACTAAT 


ATCTTCCGTG 


ACCAGATGGA 


CCGTTTCGGT 


1800 


GAAATCTATA 


CTACCTATAA 


CATGATATTG 


GATGCCATTC 


GGAAAGTTCC 


AACTGCTACT 


1860 


GTTCTCCTTA 


ACGGAGACAG 


TCCACTTTTC 


TACAAGCCAA 


CTATTCCAAA 


CCCTATAGAG 


1920 
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TATTTTGCTT TTGACTTGGA AAAGGGACCA GCCCAACTGG CTCACTACAA TACCGAAGGG 1980 

ATTCTCTGTC CTGACTGCCA AGGCATCCTC AAATATGAGC ATAATACCTA TGCAAACTTG 2040 

GGTGCCTATA TCTGTGAAGG TTGTGGATGT AAACGTCCTG ATCTCGACTA TCGTTTGACA 2100 

AAACTGGTTG AGTTGACCAA CAATCGCTCT CGCTTTGTCA TAGACGGCCA AGAATACGGT 2160 

ATCCAAATCG GCGGGCTCTA TAATATCTAT AACGCCCTAG CTGCTGTGGC CATCGCCCGT 2220 

TTCCTAGGTG CCGATTCGCA ACTCATCAAA CAGGGATTTG ACAAGAGCCG TCCTGTCTTT 2280 

GGACGCCAAG AAACCTTTCA TATCGGTGAC AAGGAATGTA CCCTTGTCTT GATTAAAAAT 2340 

CCAGTCGGTG CAACCCAAGC TATCGAAATG ATCAAACTAG CACCTTATCC ATTTAGCCTA 2400 

TCTGTCCTCC TTAATGCCAA CTATGCAGAT GGAATTGACA CTAGCTGGAT CTGGGATGCA 2460 

GACTTTGAAC AAATCACTGA CATGGACATT CCTGAAATCA ACGCTGGCGG TGTTCGTCAT 2520 

TCTGAAATCG CTCGTCGCCT CCGAGTGACT GGCTATCCAG CTGAGAAAAT CACTGAAACG 2580 

AGTAATCTGG AGCAAGTTCT CAAGACCATT GAGAATCAAG ACTGCAAGCA TGCCTATATT 2 640 

CTGGCAACTT ATACTGCCAT GCTGGAATTT CGTGAACTGC TGGCTAGTCG TCAGATTGTT 2700 

AGAAAGGAGA TGAACTAATG GTTTATACTT CACTTTCCTC AAAAGATGGC AATTACCCCT 2760 

ATCAGCTCAA CATTGCCCAC CTCTACGGAA ATCTCATGAA TACtACGGGG ACAATGGAAA 2820 

CATCCTCATG CTCAAGTATG TGGCTGAAAA ACTGGGAGCC CATGTGACCG TTGACATCGT 2880 

TTCTCTCCAT GATGACTTTG ATGAAAATCA CTACGACATC GCCTTTTTCG GTGGTGGTCA 2940 

AGACTTTCAA CAAAGTATCA TTGCAGACGA CCTACCTCCT AAAAAAGAGA GCATTCACAA 3000 

CTACATCCAA AACGACGGTG TAGTTCTGGC TATCTGCGGT GGTTTCCAAC TATTCGGTCA 3060 

ATATTATGTT GAAGCTTCAG GAAAACGTAT CGAAGGGCTA GGGGTCATGG GACACTACAC 3120 

GCTCAACCAG ACCAATAACC GTTTTATCGG TGACATCAAG ATTCACAATG AAGATTTCGA 3180 

TGAAACCTAC TATGGATTTG AAAATCACCA AGGTCGTACC TTCCTCTCTG ATGACCAAAA 3240 

ACCGCTGGGA CAGGTTGTCT ATGGAAATGG AAACAACGAA GAAAAGGTCG GTGAAGGGGT 3300 

TCATTATAAG AATGTCTTTG GTTCCTACTT CCACGGGCCT ATCCTCTCTC GTAATGCCAA 3360 

TCTGGCTTAT CGCCTAGTTA CTACTGCCCT CAAGAAGAAA TATGGTCAGG ACATCCAACT 3420 

CCCTGCCTAT GAGGACATTC TCAGCCAAGA AATCGCTGAA GAGTACAGTG ACGTCAAAAG 3480 

CAAGGCTGAC TTTTCTTAAA CAAAGGAAAA TGATATCAAA GAACTCCGTT ATCTTGTCGG 3540 

AGTTTTTTGT CTTTTCTTTT ACCCTTCTCC CTTGCATTTT CTCTCATTTT TTGCCAAAAT 3600 

AGAGGGGTAG AAAGAAGGTA GCATATGTCT AAATTACAAC AAATCCTAAC ATATCTTGAA 3660 
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TCAGAAAAAC TAGACGTCGC TGTCGTATCT GACCCCGTCA CAATCAATTA CCTCACTGGT 3720 

TTTTACAGTG ATCCCCATGA ACGCCAAATG TTCCTCTTTG TCCTAGCAGA TCAGGAACCT 3780 

CTCCTCTTTG TCCCAGCTCT TGAAGTAGAA CGTGCAAGTA GCACCGTTTC CTTCCCAGTA 3840 

GTGGGCTATG TCGATTCTGA AAATCCATGG CAAAAAATCA AACATGCTCT TCCACAACTT 3900 

GACTTCAAAC GTGTCGCTGT TGAGTTTGAC AATCTCATCT TGACCAAATA CCATGGTTTG 3960 

AAAACAGTTT TTGAGACTGC TGAGTTTGAC AACCTCACTC CTCGTATCCA ACGCATGCGC 4020 

CTCATCAAAT CAGCTGATGA AGTGCAAAAA ATGATGGTTG CAGGTCTTTA TCCTGACAAC 4080 

GCTGTTCATG TTGGTTTTGA CAATATTTCT CTTGATAAGA CTGAGACAGA TATCATCGCA 4X40 

CAAATCGACT TTGCCATGAA ACGTGAAGGT TATGAAATGA GCTTTGATAC CATGGTCTTG 4200 

ACTGGTGATA ATGCTGCGAA TCCACACGGC ATTCCAGCAG CTAATAAGGT TGAAAATGAT 4260 

GCTCTTCTCC TCTTTGACCT GGGTGTTCTG GTCAATGGCT ATGCGTCAGA TATGACTCGT 4320 

ACAGTCGCTG TCGGCAAACC AGACCAATTC AAGAAAGATA TTTACAACTT GACTCTTGAA 4380 

GCCCAACAAG CTGCTCTTGA CTTTATCAAG CCAGGTGTGA CTGCTCATGA AGTGGACCGC 44 40 

GCTGCCCGTG AGGTCATCGA AAAAGCTGGT TATGGTGAGT ACTTCAACCA CCGTCTCGGG 4500 

CATGGTATCG GTATGGATGT CCATGAATTC CCATCTATCA TGGAAGGAAA CGACATGGTC 4 560 

ATCGAAGAAG GCATGTGCTT CTCTGTTGAA CCAGGTATCT ATATCCCTGG TAAAGTCGGT 4620 

GTTCGTATTG AAGACTGCGG TGTTGTTACC AAGGATGGCT TCAACCTCTT TACAAGCACC 4680 

AGCAAAGATT TGCTTTATTT TGATTAAACT ATATAGCCCC TATGCTTTCC TTTCAAAATA 4740 

TCTAGGGGCT ATTTTATTGT CATTTTTCTG CTATTATGCT AAAGAAATTG GCTGCAATAA 4800 

TCTAACCCTA AGTGTCTGGA ATGATAACGA GGGTGCTCTC CGCTTTTATC AAAGACAAGG 49 60 

GATGAAACCC CAAGAAACAA CAATGGAAAT GATAATTGAT TAAGAAGTCA TCTATCAAAA 4920 

GATGTTAGAA AAAGTTCAAT TTCACTAGAA AATGAGGAAA ATCTCCCCAC AATAAAACGC 4 980 

ATAGTATCAG GTATTGTGTA CTGACCCCAA ACAGTTAGAC AATTAATTTA TCCGAAGGAT 5040 

TTAGTTCTGT ACTGCACAGG ACTAAGTCCT TTTAGTTTTA CCTTAATTCG TTTGTTGTTG 5100 

TAGTAATCAA TATAGTCTAT AATGACTTGT TCCAATTGGT TAAGTCATTT AAATCTTTTC 5160 

TCATAGCCAT AAAACATTTC GGATTTTAAA ATGCCAAAGA AAGATTCCAT CATACCGTTG 5220 

TCTTGGCTGT TTCCCTTGCG TGACATAGAT GCTTGAATTC CCTTATTCTC TAGGAACCGA 5280 

TGATAAGAAT CGTGTTGGTA TTGCCAGCCT TGGTCACTAT GGAGAATCGT ATTCTCGTAG 5340 

TGCTTCTCTT TGAATGCCTG TTCCAACATT GTTTGTACTT ATTCTAAATT AGGCGAACAA 5400 

GAAAGATTAA AAGCAATAAT TTCGCTGTTA AAGCCATCTA AAACTGGTGA TAAGTAAAGC 5460 
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TTTTGAGTAC TTGCTGGAAT GGCAAATTCA GTCACATCTG TGTAGCACTT TTCCATTGTT 5520 

TTAGAGCCTT CAAATTGGGC TTGAATGAGA TTCTCTGCCT TCTTACCAAC GTCTCCTTTA 5580 

TGAGAAGAAT ATTTTCGTTT CTTTCGCATT TTAGCTTGTA AATTGAGTAC TTTCATCAAG 5640 

CCTTGAACTC TTTTATGATT TACCAGATAA CCACGATTTC TTAGTTCTAA ATGAACCCGG 5700 

CGATAAGCAT AATTTCCCTT GTGTTCGATA AAGATGGATT GAATTTCAGT TTTAAGCTCT 5760 

TGGTCTTTAT CTGTTTTGTC TAGCTGTTTC AAGTGATACT AGTAGGTCCA ACGAGCTAGT 5820 

TTAATGGCTT CTAGAAGAAG ATCTAACGAA AACTCAGTCA TTAATTCTTG AACAATTTCT 5880 

GTCTTTCTTC TTTCTCTTTT TCCTCCTTCA ATCGGAGTTC TCTTAACTTT TTTAGGATGG 5940 

CATTCTCCGC TCTCAGGTAC TCTCCCTCTT GTTTTCTCAA CAATAGTATA CCCGTTTTTC 6000 

CTGTATTGTG CTAGCCAGTT AAGAAGTATC CTACGACTTG GGAGACCGTA TTCAAGAGAA 6060 

ACTCTATCTT TAGTCCAGCC TTCATGTCAG ACTTTATTAA CCCCAATTAT TCACCCCAAA 6120 

TCTAAAAACC ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC 6180 

CAATGGGTGT TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA 6240 

AAACACAAAA AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA 6300 

GTCTTTTTAC CAACTATCTT TCGATGGACG TCATTTAACC CAGTATGGTG GTCTTATCTT 63 60 

TTTTCAGGAA CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC 6420 

GAATGAmCAA CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCwTCTCC AGTTCCTCTT 6480 

TCAACTGTTA ACAGGTTATG GAACGGAATA TGCTTG 6516 
(2) INFORMATION FOR SEQ ID NO: 106: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

TTTTCAACCC ATATCGTGGC TCCTGAATAC TACTTACTGA CAACTATGCT ATCAGAGACT 60 

TCTCTACTTG TTTTCTATAT CATTTTCATC CATAGAAAAC AACTCATCCA CTTGGGACAT 120 

ATCTTTAGCT ATACTGTTCG ATACTCTCTC TTTTCACTTT CCTTTGTAGC AATTTATTTC 180 

CTGATTAATT TCGTGTATCC TGTAGATATG GTCATTAATT TGCCATTTTT GATTAATACT 240 

GGTTTGATTG TCTTGCTATC AGCTATCTCT TATATTAGTC TACTTGTCTT CACAAAAGAT 300 
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AGCATTTTCT ATGAATTTTT AAACCATGTC CTAGCCTTAA AAAATAAATT TAAAAAATCA 360 

TAGGAGTTTA AAATGAAACA ACTAACCGTT GAAGATGCCA AACAAATTGA ATTAGAAATT 420 

TTGGATTATA TTGATACTCT CTGTAAAAAG CACAATATCA ACTATATTAT TAACTACGGT 480 

ACTCTGATTG GGGCGGTTCG ACATGAGGGC TTTATCCCTT GGGACGACGA TATTGATCTG 540 

TCCATGCCTA GAGAAGACTA CCAACGATTT ATTAACATTT TTCAAAAGGA AAAAAGCAAG 600 

TATAAGCTCC TATCCTTAGA AACTGATAAG AACTACTTTA ACAACTTTAT CAAGATAACC 660 

GACAGTACGA CTAAAATTAT TGATACTCGA AATACAAAAA CCTATGAGTC TGGTATCTTT 720 

ATCGATATTT TCCCTATAGA TCGCTTTGAT GATCCTAAGG TCATTGATAC TTGTTATAAA 780 

CTGGAAAGCT TCAAACTGCT GTCTTTCAGT AAACATAAAA ATATTGTCTA TAAGGATAGC 840 

CTTTTAAAAG ATTGGATACG AACAGCCTTC TGGTTACTCC TTCGACCGGT TTCTCCTCGT 900 

TATTTTGCAA ATAAAATCGA GAAAGAAATT CAAAAATATA GTCGTGAAAA TGGGCAATAT 960 

ATGGCTTTTA TCCCTTCAAA ATTTAAGGAA AAGGAAGTCT TCCCAAGTGG TACCTTTGAT 102 0 

AAAACAATCG ATTTACCCTT TGAGAATTTA AGCCTTCCTG CACCTGAAAA ATTTGATACT 1080 

ATTTTGACAC AATTTTATGG AGATTATATG ACCCTACCAC CAGAAGAAAA ACGCTTCTAC 1140 

AGTCATGAAT TTCACGCTTA TAAATTGGAG GATTAGGATC CAATATTTAG AAAAAAAAGA 1200 

AATTAAAGAA ATTCAACTAG CCCTGCTGGA CTATATTGAT GAGACTTGTA AGAAACATGA 1260 

TATTCCTTAT TTTCTCAGTT ATGGAACCAT GCTTGGACCC ATCCGCCACA AAGGTATGAT 1320 

TCCTTGGGAT GATGATATTG ATATTTCCCT TTATCCTGAG GATTATGAGC GTTTACTGAA 1380 

GATTATTGAA GAAGAAAATC ACCCTCGCTA CAAGGTTCTT TCCTACGATA CATCTTCTTG 1440 

GTACTTCCAT AATTTCGCAT CGATTTTGGA CACTTCTACT GTTATAGAAG ACCATGTTAA 1500 

GTACAAGCGT CATGATACCA GCCTTTTCAT CGATGTCTTC CCAATTGATC GATTTACAGA 1560 

CTTGAGCATT GTCGACAAGA GCTATAAGTA TGTGGCTCTT CGTCAACTAG CTTATATCAA 1620 

AAAATCACGA GCAGTTCACG GTGATAGCAA ACTAAAAGAT TTTCTTAGAT TATGTAGCTG 1680 

GTACGCTCTC CGATTTGTCA ATCCTCGCTA CTTTTACAAG AAAATTGATC AACTAGTCAA 1740 

AAATGCTGTA ACCAACACTC CTCAATATGA AGGAGGAGTT GGGATCGGTA AGGAAGGGAT 1800 

GAAAGAAATC TTCCCAGTTG ATACCTTTAA AGAACTGATT TTAACTGAGT TTGAGGGCCG 1860 

TATGTTGCCT GTTCCCAAAA AATATGACCA ATTTTTAACC CAGATGTATG GCGATTATAT 1920 

GACACCACCA TCAAAAGAAA TGCAAGAGTG GTATAGTCAT AGCATTAAAC CTTATCGCAA 1980 

AAACTGATTG AGGGGGATTA TACAAACTAC TAAGATAGAG GTTATTCAAA AACATAATTT 2040 

TAGTAGAAAA TGAAATACAT ATTCCCACAA TAAAACGCAT CATATCAACG TTTTTGAAAA 2100 
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ACCTTGATAT GATGCGTTTT ATAATTTTAA AGACTTTTTT CTATAGTAGA TTGAAATAAG 2160 

ATGCGAACAA ATCAATTAGA AAATTCAAAT TAATTTATAG AAATATTTTA GTATTCCTGT 2220 

GTACTGTTCT AAATTCAGTC TGCTATATCT TATTTTTCTA TTTAAATCGC TTCTGTAACA 2280 

AAGCTACGAC TTTCAAGTAC CTTAAGCATG GCATTAGCTG TATCTAGCGC TGTGAAGAGG 2340 

GGCACCCCGT GTTCAATGGC TGAACGACGA ATTTGCTCAC CATCTTCGTC AGCAGTTCGT 2400 

TTTGTTCCTA CTGTGTTAAT GATAGCTTGA ATTCTTCCTT TGCGTACAAA ACTTGGGATA 2460 

TCCTTATCGT CATCACCAAT CTTACCAACA GGTTGGGCTT GGAAGCCATG ACTAGCAAAG 2520 

AAGGCTGCTG TCCCTTCTGT CGCAAGGATT CCATAACCAA TGTTTTGGAA ACGACGAGCC 2580 

AAGTTCAAGG CTTCTTCTTT GCCATCATCA GCGATGGTAA AGACGACATT ACCAAAAGTT 2640 

GGCAAGTGTA GATAAGAAGC TTCAAAGGCT TTATAGAGAG CTTTTTCCAA AGTAGCATCA 2700 

GAACCCATAA CTTCACCTGT TGACTTCATT TCAGGACCGA GCAAGCTGTC TACCTTAGCT 2760 

AGTTTGGTAA AGGAGAAGAC ACGTGCCTTG ATATGAACAC GGGTGCTTTC AGGGTAAAGT 2820 

CCATTTTGGT AGCCAAGTTC TGATAAACTT TGACCAAGAA TGAGTTTGGT CGCTACTTGA 2880 

GCCATAGGAA TATTGGTTAC CTTAGATAGG AATGGAACAG TACGGCTGGC ACGTGGATTG 2940 

ACCTCAATAA CGTAGACTTT TTCATCCTTG ATAACAAACT GGATGTTCAT CATTCCAACG 3000 

CAGTGAAGAC CGATTGCTAA GCGTTTGGTG TAGTCTGCGA TGGTCTCCTG AACCTTTTGC 3060 

GACAAGGTTT GTGGTGGGTA AACAGCCATT GAGTCACCTG ACTGGACACC AGCACGTTCG 3120 

ATATGCTCCA TGATACCAGG AATGAGTACA TTTTTACCAT CTGAAATGGC ATCAACTTCG 3180 

CACTCTTGCC CAACGATATA AGAGTCGACA AGAACTGGGT GGTCTGGACT AGCCTTAACA 3240 

GCAGTTCGCA TGTAAGAACG AAGGTCTTCT TCGTTTTCAA CGATTTCCAT GGCACGTCCA 3300 

CCAAGTACAT AAGATGCGCG GACAAGAACT GGGAAGCCAA TCTTGCGAGC TCCAAGAGCT 3360 

GCTTCTTCTT CATTGGTAGC CGTTTGTCCT GGTGGCTGTG GAATATCCAA TTCTTTGAGA 3420 

GCTTGCTCGA AGAGGTCACG GTCTTCGGCA CGATCTAGGT CAGCAACCTG TGTACCAAGG 3480 

ATGGTCACAC CTGCTTTTGC CAATGGCTCC GCAAGGTTGA TGGCTGTTTG ACCACCGAAC 3540 

TGAACGATAA CTCCCTTTGG TTGTTCCAAG TCAATGACGT TCATAACATC TTCGAATGTC 3600 

AATGGCTCAA AGTAAAGCTT ATCTGATACA GAGAAGTCTG TTGAAACGGT CTCTGGGTTT 3660 

GAGTTCATGA TGATAGCTTC ATAACCAGCT GCCTGGATAG CCTTAACAGA GTGAACGGTT 3720 

GCGTAGTCAA ACTCAACCCC TTGACCGATA CGGATTGGAC CTGAACCTAG GACAAGTACA 3780 

GATTCTTTAT CAGATCTGAT AGATTCATTT TCCCAACCAT AGGTTGAATA GAAATATGGC 3840 
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GTTTCGGAGT CGAACTCTGC CGCACAAGTG TCTACCATCT TATAAACTGG AACAATCTTG 3900 

TTTTCCAAGC GAAGTTGGCG AACTTTATCA TCAGTCGTTC CCCAGACTTC AGCAATCTTA 3960 

CGGTCTGAAA AACCATTAAG TTTGGCTGTT TTCAAAACTT CTAAATCTTG TGGATGAGCA 4020 

CCCAATTCTT GCTCAATTTC AAAGATATGC AAGAGTTTAT CAAGATAGAA GATATCAATT 4080 

TTTGTAAGCT CTGCAATTTC TTCAGGTGTG TAGCCACGAC GAATGGCTTC TGATACGTAG 4140 

AAGAGACGGT CATCTTGCGC TTTGACAACC TTTTCAATCA AGGCATCATC AGAAACTGCT 4 200 

GCAAGTTCAG GTATTTCATT GTGGTGCACC CCAATTTCAA GGGAGCGGCA GGCCTTGAGA 4260 

AGAGATTCCT CGATGTTACG ACCGATTGCC ATGACTTCTC CAGTCGCCTT CATTTGTGTA 4 320 

CCGAGACGGC GTTCACCCTT TTCAAACTTG TCAAATGGGA AACGTGGAAT CTTAGCAACT 4 380 

ACGTAGTCAA GGGCTGGTTC AAACATGGCA TAGGTTGAAC CTGTAACTGG GTTTATAACC 4440 

TCATCCAAGG TCAAACCTAC TGCAATGTTG GCAGCCAACT TAGCAATCGG ATATCCTGTC 4500 

GCTTTAGAAG CAAGGGCTGA CGAACGTGAT ACACGAGGGT TTACTTCGAT AACATAATAC 4 560 

TTGAAGCTGT TAGGATCAAG AGCTAGCTGA ACATTACATC CACCTTCAAT CTTGAGGGCA 4 620 

CGAATAATGC TCAAGCTCGC ATCACGAAGC ATTTGGTTTT CATAGTCTGA CATGGTTTCC 4 680 

GCAGGGGCAA ATACAATGGA ATCCCCTGTG TGAATCCCAA CTGGGTCAAA GTTTTCCATG 4740 

TTACAAACAA CCAAGCCATT GTCAGCTGAG TCACGCATCA CTTCGTATTC AATTTCCTTG 4800 

AAACCGGCAA TCGAACGCTC AATCAAACAT TGGGTAACAG GTGACAATTT CAAACCATTT 4 860 

TCAGTGATTT CACGCAATTC TTTCTCGTTG GCACACATAC CACCACCAGT ACCACCAAGG 4920 

GTAAAGGCTG GACGAACGAT GACTGGGTAG CCAATTGTCG CTGCAAAGGC AACTGCTTCT 4 980 

TCTACTGTGT TAACAATTTC AGATTCTGGA ATGGGTTGTT CAAGCTCTTC CATCAATTGT 5040 

TTAAAGAGGT CACGGTCCTC CGCTTGGTCA ATGGCAGATA ATTTGGTACC CAGAAGTTCA 5100 

ACGCCAAGCT CGTCTAGGAT ACCATTTTTA GATAATTCCA TGGCCATGTT GAGACCTGTC 5160 

TGACCACCGA GTGTTGGTAG CAAGCCATCT CGACCTTCCT TACGAAGAAT ACGTGTCACA 5220 

AACTCAAGTG TAATCGGTTC AATGTAAACC TTGTCAGCAA TTTCCTTGTC CGTCATGATG 5280 

GTTGCAGGAT TTGAGTTAAC CAAAACAACC TCATAACCTT CCTCTTTCAA CGACAAGCAA 5340 

GCCTGAGTCC CAGCGTAGTC AAACTCAGCA GCCTGACCAA TAATAATCGG ACCAGAACCA 5400 

ATCACCATAA TTTTTTGAAT ATCAGTACGT TTAGGCATAT ATAAGATATT AAGGGTGTCA 5460 

AGCGGACAAA GCTAAAATAG GAGTTATGAC GAAGAACTGT CAGTTCTAGG AATAACTATC 5520 

TTTTTAGCAC CGTCCGTAGC CCGTATTCAG TTCAGCAAAT ACGGAGCACC CTTCTCCTTT 5580 

CTATTCGTCG CCTCTCAGGG CGACATTAAA TAAGATACAA AGGACGAATA GAAAGCGATT 5640 
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GAATTTTAGG 


AAATCAAGGA 


AGGATTGACA ATCCAAGTTG 


GTTTCTCTAC 


ATTCTGAGCT 


5700 


TTCCGTCCGT 


GTTCAGTTAC 


ATAAATTCTC 


CGACGAGCTT 


TTACTCGTTC 


TTAGTTTGAT 


5760 


TGTTTAAAAA 


CTTCCATCAT 


CTCGATAAAC 


TCGTCAAATA 


GGTAGCTAGC 


GTCGTGTGGC 


5820 


CCAGGAGCTG 


CATCTGGGTG 


GTATTGAACA 


GAGAAAGCAG 


GTTGGTATCT 


GTGGCGCACA 


5880 


CCTTCCACTG 


ACTTGTCATT 


GATTTCTTCG 


TGGGTAATAA 


TCAAGTGCTC 


TGGCAAATCC 


5940 


TCGCGGCTGA 


CTGCATAACC 


ATGG'rtC'rGG 


CTGGTGAAGT 


CTACTCGTCC 


TCTTGCGATT 


6000 


TCACGTACCG 


CATGGTTGAA 


TCCACGGTGG 


CCAAACTTCA 


TCTTATAGGT 


CTTAGCCCCG 


6060 


TTTGCCATTG 


CAAAGAGTTG 


GTGTCCCATA 


CAAATACCAA 


AGATTGGAAT 


TTTTCCTTGT 


6120 


ACACCGCGAA 


TCATGTCGAG 


TGCTTGTGGA 


ACGTCTTCTG 


GCTTACCTGG 


ACCATTTGAC 


6180 


AACATAACTC 


CGTCAGGATT 


GAGATGGAGA 


ATTTCTTCAG 


CCGTTGTCGA 


ATAAGGAACA 


6240 


ACTGTCACGT 


TACAGTTGCG 


TTTAGAAAGT 


TCACGTAGGA 


TTCAGTCCTT 


GAGACCAAAG 


6300 


TCCACTAGCA 


CCACCCTCAA 


ACCAACTCCT 


GGAGCTGGAT 


AAGACGTTTT 


AGTAGAAACC 


6360 


TGTTTGATAT 


TGTCTGTCGG 


TAAAACTGTT 


GCTTGGAGCT 


CGTCCGTCAC 


ATGGTCCATA 


6420 


CTGTCCCCAA 


CATGGGTCAA 


GGTTGCACGC 


ATAGTACCAT 


GCTTACGGAT 


AATCTTGGTA 


6480 


AGAGCACGCG 


TATCAATTCC 


TGAAATCCCT 


GGAATTTTCT 


TGGCTTTCAA 


AAATTCATCC 


6540 


AAGGTCATTT 


GGTTGCGCCA 


GTTGCTAGCT 


CTACGCGCTT 


CTTCAAAAAC 


AACGACTCCC 


6600 


TTACAAGTTG 


GAATAATGGA 


TTCATAATCA 


TCACGATTAA 


TACCATAATT 


TCCTACCAAA 


6660 


GGATAAGTAA 


AGGTCAACAT 


TTGTCCATTA 


TAAGACTGGT 


CTGTAATGGA 


TTCTTGGTAG 


6720 


CCGGTCATCC 


CTGTATTAAA 


GACGATTTCG 


CCTGTTACAT 


CAATATCTGC 


TCCGAAGGCC 


6780 


TTGCCTTCAA 


AAACTGTGCC 


ATCTTCTAAT 


ACTAGAATTC 


TTTTTGTCAT 


ATTTTCACCT 


6840 


CTCGTGGACG 


CTCACTGGCG 


TCTTTTAACG 


TCTTGTGTTT 


TACTTGGCCT 


TTCTACTCGC 


6900 


TAGTACGGAT 


TCTAAGATTG 


CCATTCGAAC 


AAAGACACCA 


TTGGTCATTT 


GTTGGACAAT 


6960 


CCGTGATTTT 


GGTGCTTCAA 


CCAAGTGGTC 


TGCTATTTCT 


ACATCACGAT 


TGATTGGAGC 


7020 


TGGGTGCATG 


AGGATTGCTG 


TTTCTTTCAA 


ACGATCGTAA 


CGTTCTTGAG 


TCAACCCATG 


7080 


TTGGGCATGG 


TAGTCTTCTT 


TTGAAAATAC 


AGCTCCACTA 


TCATGGCGTT 


CGTGTTGCAC 


7140 


ACGGAGAAAC 


ATCATGACAT 


CAACCTGATC 


AATGATTTCA 


TCAATGGTTA 


CAAACTGTCC 


7200 


ATAGTCTGCA 


AACTCTTGAC 


TTCTCCATTC 


CTCAGGTCCA 


GCGAAAAAGA 


GTTCAGCTCC 


7260 


CAAGCGTTTC 


AAAATCTGCA 


TATTGGATTT 


GGCAACGCGT 


GAGTGGTCCA 


AGTCACCTGC 


7320 


AATAGCAACT 


TTAAGACCCT 


CAAAGTGGCC 


AAATTCCTCA 


TAAATGGTCA 


TCAAATCAAG 


7380 
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CAAGCTCTGG CTAGGGTGTT GGCCCGAACC ATCTCCACCA TTGATGATGG AAGTCGTAAT 74 40 

CGTTGGACTA GCAATCAATT CTCTATAGTA GTCGACCTCT GGATGGCGAA TCACACAGAC 7 500 

ATCCACTCCT AAAGCAGACA GAGTCAAAAT GGTGTCATAA AGTGTCTCAC CCTTATTAAC 7560 

CGAGCTAGTC TTCACATCAA AGTCAAGTCG TTCCAATCCA AGTTTAATCT CTGCGACTTC 7620 

AAAGGACTTA TGTGTCCGTG TAGAATCCTC AAAGAAGAGA TTGGAAACAA TCGGATGGTC 7680 

TTCATAGGGA AGCTGGGCTC CATTTTTAAA CTCAATTCCT CGCTTGATCA ATTTCATTAC 7740 

TTGATCGACA GTGAGGTCTT CCATGGACAC CACATGGTTC AATGCTTGTT GATTTTCTGA 7800 

CATGGCTACT CCTTTAACTT TCTAAGCTTC TTCAGTAATC AGAACTCTGT CTTGGTCATC 7860 

AAGTTCTGTC ATCTCTACGA TGATTTCTTC AGAACGACTG GTTGGGATAT TTTTTCCAAC 7920 

CTAATCTGGA CGGATTGGCA ATTCTCTATG TCCACGATCG ACTAGAACTG CTAAACTCAC 7980 

ACGCGCAGGA CGACCATGAC CGACAATATT ATCAATAGCA GCACGGATGG TACGACCTGT 8040 

ATAGAGCACA TCATCCACCA AGATAACTTC GCGGTCTGTC ACATCGACAG AAACCAAAGA 8100 

AGTATCTTCT CCACTTTTAA CAT CATC ACG GAAAGCTTTA GTATCCAATT CCACAACAGG 8160 

AACTGAAAGA TTTTCTAACT GCTTCAAACG TTCTTGGATT CGGTGGGCAA TAAAGACACC 8220 

ACGAGTTTTA ATACCAGCCA AGACGATCTT ATTCAAATCT TTGTTGCGTT CGATAATCTC 8280 

ATAAGTAATA CGCGTAATCG CTCGTTTGAC GGTCAATTCG TCTACAACTT CTTTTGTTTT 8340 

CATGACAAAC CTCCAAAAAG AAAAGTCTCC TTAAACAAGG AGACTTGAAA TTTATAGCCA 8400 

AGCGAGCCCT ACTGCACACA GTATAGACTT CACCCTTCTA CTTTATCGCG CTCCTTGCCT 84 60 

GCCTCACGGG ACAGGTTTAA AGGAATATTT AGTTATCATT TACTATACCA CAAAGCATGC 8520 

TTAAAATCAA GCAAAAAGTT TCAATGTAGC ATCTTACAAA TTGCTAAAAT CATATAATTG 8580 

TGGGTACTGG TCACACTCTG GATTTTTTGG ATGGCAAATG GCTCTTCCAA AATAAATCAT 8640 

GCCCTGATGG GCAGCTAACC ACTGCTCAGG CGGCAAGATA TCCATCACCC GCTTTTCCAC 8700 

CTCAAGTGGC GTCGCTGATT TTTTGACAAT ATCGTGGTGT TTGCAAATAC GCTCCACATG 8760 

AGTATCCACT GCAAAGGCTG GAATTCCAAA TCCTACACTC ATGACAACAT TGGCTGTCTT 8820 

GCGACCAACA CCTGCCAAAC TCTCCAATTC TTCACCTGTC TGAGGGACTT GACCATCAAA 8880 

ATCGTCTAGT AACTGTTGGG CACATTTTTT AAGGAATTTA GCTTTATTCC GATACAATCC 8940 

CAAGCGAGAA ATATGTGAAG CAATCTCACT CTCTGTCGCT ACAGACATAG CTTGGGGTCT 9000 

TGGAAAGGCA ACAAAGAGAC CTGGTGTGGC CTTATTTACC GCTGCATCTG TCGTCTGGGC 9060 

TGATAACATG ACCGCAACCA GGAGTTCAAA ATGATTGGTA AAATCAAGAC TAGGCTTGGC 9120 

ATCTGGGAAG AGGGCAATGA TTTCTTCTAG CACCTTTCGT GCTCGTTTTT TTGACAAGAC 9180 



WO 98/18931 



PCT/US97/19588 



78*7 

CATTATTCAT CTCCGTCAAA TAGTCCTTGT AAGCCAGCAA AAGGACTGTT TTCTTCTTTC 9240 

TTTACTGCTT TTTGAGCTTG GTATTCTTCC TCTGTCATGA TTTGCCAGTC ATTTCCTGAG 9300 

ATAAATCCTT GACCAGCTTC TTCTTCAGCC GTCAAGACCT TGATAGGAAT GTTTAGCAGG 9360 

ATATTGTCTG ATACACTCTC AGCAAGGTCA AGCTCCCCAT TTTCGATGGG CAAGACCAAG 9420 

TCATCATCTA AAACTTCTTG ATCTAGCTGG TTAGTTGCGC CTTCCATGAA AACTTCCGTG 9480 

ACTGGATAAG ATTCAACTAA CTCAACTGGC TCCATACTGC GACTCGACGC AAGAACAATG 9540 

GTATAAGATA GTTGATAATC TAAGAAATAC ATACGGTCTT CATATTGTAC TTTCCCAACT 9600 

GCAAGGATAT CTTTTACATC TAAAATTTCT TGATTACGTG CACGCAGGTC ATCAACTAAA 9660 

TCTAACGTTT GTTCAAAGTT CAAACCTTCA GACTGCTTAC GAATTTCTTG AATATTTAAT 9720 

TTCATACTTC CTCCATAAAG ATTTACTCTC TTGATTATAC CATGAAAAGG CTACAAATCA 9780 

GCACACCAAA CTTTGTAATT AAAATTCAAA ATTTTAACAT ATTTACTATG ATAGTTTTAT 9840 

TTTTTACTCC TATACTATAG GGAAAGAGTA CATCAGATCA AGGAGGATGC TCACATGGAA 9900 

GACAAGAAAC TCATTCAACT CCTATCCAAG TTAAATAAAA GCTACCAAAA CTGTAAACAG 9960 

GGTACGGCAG ATGATATTCG ACTACAAGAG CTGCTAAACA CTACTATGCA AGAGCTCAAA 10020 

AAAACGGAAC AGTTGAACAA CAGTATCTTA ATTGATCTTG AGAAATTTTA CCAACCTACC 10080 

AGTCTTCTGA TTGGACTGGG TAGCCTAAAA CTAAACGATC AAGCACCCAC TGCTTGGCGA 10140 

AACTATGATA AATTCCATTA CGATCATGTC AAACACGTAC TAAGTCTCTA TGGACCTGTT 10200 

TTTGAATTTT AGAGCATAGA ATTTCCAGTT TTCTGTTGAC AAAATTTCCT TAAACGTATA 10260 

ATATAAAGAT ACTAATACTC GGAGGTAAGG GAGACATGAA CAACTAAGTC TATCAAATAA 10320 

AGAACCTTTA TTTAGTAGAT CTTGTTTTTG TCTCTTTTTG TGTGCTCTTT TATGCTCTTT 10380 

TTCTGGCATG TTAATACAGT TTTTTTGACA TAGACTTTGG GCTCTACTAC GTAAACTAGA 10440 

CCTTTTTCTT ATGCACTATG AACATTCTAG AAAGGGAAAT CATATGATAA AAATCAATCA 10500 

TCTAACCATC ACACAAAACA AAGATTTACG AGATCTTGTA TCTGACCTAA CCATGACCAT 10560 

CCAAGACGGG GAAAAGGTTG CTATTATTGG TGAAGAAGGA AATGGCAAAT CAACCTTACT 10620 

TAAAATTTTA ATGGGGGAAG CTTTGTCTGA TTTCACTATC AAGGGAAACA TCCAATCTGA 10680 

CTATCAGTCA CTGGCCTACA TTCCTCAAAA AGTCCCTGAG GACCTAAAAA AGAAAACTTT 10740 

ACACGACTAC TTCTTTTTAG ATTCTATTGA TTTAGACTAC AGTATCCTCT ATCGTTTGGC 10800 

GGAGGAATTG CATTTTGATA GCAATCGTTT CGCAAGTGAC CAAGAGATTG GCAATCTATC 10860 

AGGGGGCGAA GCTTTGAAAA TTCAGCTTAT CCATGAGTTA GCCAAACCCT TTGAGATTCT 10920 
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ATTTTTAGAT GAACCTTCAA ATGACCTAGA CCTTGAGACA GTTGATTGGC TAAAAGGCCA 10980 

GATTCAAAAG ACCAGGCAAA CCGTTATTTT CATTTCCCAT GATGAAGACT TTCTTTCTGA 11040 

AACGGCAGAC ACTATTGTTC ACTTGCCACT GGTCAAACAC CGTAAAGAAG CGGAAACGCT 11100 

AGTAGAGCAT TTAGACTATG ATAGCTATAG TGAGCAGAGA AAGGCTAATT TTGCCAAACA 11160 

AAGTCAGCAA GCTGCTAACA ACCAAAGAGC CTACGATAAA ACCATGGAAA AACATCGGAG 11220 

AGTTAAGCAA AATGTAGAAA CTGCGCTTCG AGCTACCAAA GATAGTACTG CCGGTCGCCT 11280 

ATTGGCTAAA AAGATGAAAA CTGTCCTCTC ACAAGAAAAA CGCTACGAAA AGGCAGCTCA 11340 

GTCCATGACT CAAAAGCCAC TTGAAGAGGA ACAAATCCAA C TT TTLT r TT CACACATCCA 11400 

ACCATTACCA GCTTCTAAAG TCTTAGTCCA ACTGGAAAAA GAAAATTTGT CCATTGACGA 11460 

CCGAGTTTTG GTTCAAAAAC TACAACTAAC TGTCCGTGGC CAAGAAAAAA TCGGTATTAT 11520 

CGGGCCAAAT GGTGTTGGGA AATCAACTCT GTTAGCCAAG TTACAGACAC TTCTGAATGA 11580 

TAAAAGAGAG ATTTCACTTG GTTTTATGCC ACAAGATTAC CACAAAAAAC TGCAATTGGA 11640 

TTTATCCCCA ATAGCCTATC TCAGTAAAAC TGGGGAAAAA GAGGAACTAC AGAAAATCCA 11700 

ATCTCACCTA GCTAGTCTCA ATTTCAGTTA TCCAGAAATG CAGCATCAAA TTCGCTCCTT 11760 

ATCTGGCGGA CAACAGGGAA AACPCCTGCT TTTGGATTTA GTCCTGCGCA AACCAAACTT 11820 

TCTCCTGCTG GATGAACCCA CACGAAACTT TTCTCCCACT TCTCAACCCC AAATCAGAAA 11880 

ACTCTTTGCT ACCTATCCAG GCGGTCTCAT CACTGTTTCG CATGACCCTC GTTTCTTAAA 11940 

AGAAGTCTGC TCGATCATCT ATCGCATGAC AGAACACGGT TTGAAGCTAG TTAATTTAGA 12000 

AGATTTATAA ATTTGCAACA TAGCAAAAAT CCAGAGACGA CCTCTGGATT CTTTTACATC 12060 

TGTTTTAAAC GTTCAATCCG TTCTGAGATA CGTGGCTGCG TATAAAAGAG TTTTTGGAAC 12120 

CCCCCACCTT TCTTAGGATC ATTGATATAA AGGGCACTGC TAGCATCATC GACGTGGCGA 12180 

CTCATAGGTT TGCTATTGTC CAACTTATCT AGGGCATTAA TCATTCCCTG GGGATTGCGA 12240 

GTCAGCTCGA CACTAGATGC ATCTGCCAGA AATTCCCTCT GACGAGAAAT AGCGAGCTGA 12300 

ACCAAGGTTG CAGCGAGAGG TGCCAGTACA ATAGCTAGTA GGGAAACCAC TAGCATAATG 12360 

ATTTCAAGAC CATTTCCATC TCGGTCATCA TCACTTCGTC TGCGACCTGC TCCACCCCAC 12420 

CACATCATAC GACCTCCCAT ACTAGAAAGC ATGGTGATAG CACTAGCAAG GGCAACTGCA 12480 

ATAGTCGAAA TACGGATATC ATAATTACGA ATATGACTGA CTTCATGTCC CATAACAGCT 12540 

TCTAGTTCTT CACGATTCAT GATAGCTAGT AGACCTGAAG TCGCAGCAAC AGCCGCATTT 12600 

TGAGGATTAG AACCTGTCGC AAAGGCATTT AAGGCTGGAT CATCAATGAT GAAAACACGG 12660 

GGCATAGGAA TCTGAGCGAC CAGAGCCATA TCTTCCACTA CATGGTAGAG GTCTGGTGCC 12720 
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GTTTGCTCAT 


CCACCTCACG 


CGCTCCATTC 


ATGGACATGA 


CAATCTCTGT 


CGATTGAAAA 


12780 


ATCATAGACA 


AAGCGTAGAT 


AAAGCCGATA 


ATCAGTGCAA 


TAACCAAACC 


ACCAAGTCCA 


12840 


GATCTTATAA 


AGAGATAACC 


AACCGCATAA 


CCAACAAGAG 


CTAAGAGTAG 


GAAAAATACC 


12900 


AGCAACAAAA 


TCCAGGTTTT 


TCGTTTATTG 


CTTGCAATTT 


GATCAAACAA 


CATCTTAGTC 


12960 


ACCTAAACCG 


CTAAAATCAA 


CTTTAGGAAC 


CGACTTTTCC 


TCTTCAGGTG 


TTTGAAGGAA 


13020 


ATCTGCCCCT 


TTAAATCCAA 


ACATTCCAGC 


GATAATATTG 


CTCGGGAAAG 


TTTCTAATTT 


13080 


TACATTGTAG 


TTGCTGACAA 


CACTGTTATA 


GAGTTGACGA 


GAGTAAGAAA 


TTTTATTTTC 


13140 


TGTGTTTGTC 


AACTCCTCTT 


GCAATTTAAC 


AAAGTTAGCA 


CTAGCTTTCA 


AATCTGGATA 


13200 


GCTTTCTGCA 


ACTGCAAAAA 


TACCTGAAAC 


CTGACGAGTG 


AGGGCATCAC 


TGGCTTTCAT 


13260 


AGCTTCTGCT 


GGTGAAGTCG 


CTGCCGCCAC 


TTGGTTACGT 


AGTTCTGCCA 


CCTTTTCAAG 


13320 


GGTAGAACCT 


TCATATTTGG 


CATAACCTTT 


TACAGTCTCA 


ATCAAGTTTG 


GCAAGAGGTC 


13380 


ATTGCGACGT 


TTCAACTGAA 


CATCAATCTG 


ACTCCAAGCC 


TCCTTGGTTT 


GCATACGATT 


13440 


TTTAACCAAA 


CCGTTATAGC 


TAACAATCAC 


AAAAATAACA 


ATAAGAGCCA 


TAACTCCAAG 


13500 


AATAATCCAA 


GTCATAATAT 


AAGTCCTTTC 


TGCTTTTAGA 


TTAGTACCAG 


TATATCAAAT 


13560 


TTTCTATGAT 


TGTGGTAAAA 


TAAGATGATA 


CTAAAGAAGG 


AAATAACTAT 


GAAACCAAAA 


13620 


ACATTTTACA 


ACTTGCTTGC 


CGAGCAGAAT 


CTTCCACTTT 


CGGACCAGCA 


AAAAGAACAA 


13680 


TTTGAACGTT 


ATTTTGAGCT 


CTTGGTCGAG 


TGGAATGAGA 


AGATTAATTT 


GACGGCGATT 


13740 


ACGGACAAGG 


\AGAAGTTTA 


TCTCAAACAT 


TTTTACGATT 


CGATTGCACC 


CATTCTTCAA 


13800 


GGTTTGATTC 


CCAATGAAAC 


TATCAAACTT 


CTTGATATCG 


GGGCTGGGGC 


AGGATTTCCT 


13860 


AGTCTACCAA 


TGAAAATTCT 


CTATCCGGAG 


TTACATGTGA 


CCATTATTGA 


TTCACTCAAT 


13920 


AAGCGCATCA 


ACTTCCTACA 


ACTCTTGGCT 


CAAGAACTGG 


ATTTGAACGG 


AGTTCATTTC 


13980 


TACCACGGAC 


GTGCCGAAGA 


TTTTGCCCAA 


GACAAGAACT 


TCCGTGCTCA 


ATATGATTTT 


14040 


GTAACAGCTC 


GTGCGGTTGC 


CCGTATGCAG 


GTCCTATCTG 


AATTGACTAT 


TCCCTACCTT 


14100 


AAGGTTGGTG 


GCAAACTATT 


AGCACTCAAG 


GCTAGCAATG 


CGCCTGAGGA 


ATTATTAGAA 


14160 


GCTAAGAATG 


CCCTCAATCT 


CCTTTTTAGT 


AAGGTCGAAG 


ACAATCTCAG 


TACGCCCTAC 


14220 


CGAATAGAGA 


TCCGCGCTAT 


ATCACAGTGG 


TAGAAAAGAA 


AAAAGAAACA 


CCAAATAAAT 


14280 


ATCCACGTAA 


GGCTGGTATC 


CCAAATAAAC 


GCCCACTTTA 


AATTTTTTAG 


TAAACAAATG 


14340 


TTTACAAAAT 


CAGCCTCGCT 


CTTTTATTTC 


TAGGCTCGGG 


AAAAAATGAT 


TTACAAAATC 


14400 


AGCCTCGCTC 


TTTTATTTCT 


AGGCTCGGGA 


AAAAATGATT 


TACAAAATCA 


TTTTTTTCTG 


14460 
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CTATACTATC CTAAGCAAAG GTTTTTAATG TCATCCCGTG AGGTGACGAA GACGCAGAAA 14520 

TATTTAAAAC TCTTTAAAAT CTAAATTTTA AAGAAGTCTT ACTCTGAGGG CCTATTGCTG 14580 

TAAAATAATG GGCTCTTTTT TGATGCCCAA AAGTGAGGTT TATATGAAAC AAGAATCAAC 14640 

TGTTGATTTG TTAC 14654 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

AGAAAAATCT GCTTTACAGA AAATAAAAAT AATAGGAGAA AATCTATGTC AGATTTGAAA 60 

AAATACGAAG GTGTCATTCC AGCCTTCTAC GCATGTTATG ATGATCAAGG AGAAGTAAGC 120 

CCAGAACGTA CGCGTGCCTT GGTTCAATAC TTCATTGATA AAGGTGTTCA AGGTCTTTAT 180 

GTCAATGGTT CTTCTGGTGA ATGTATCTAC CAAAGCGTTG AAGATCGCAA GTTGATTTTG 240 

GAAGAAGTCA TGGCGCTAGC AAAGGTAAAT TGACCATTAT TGCCCATGTT GCTTGCAATA 300 

ATACTAAAGA TAGTATGGAA CTTGCTCGCC ATGCTGAAAG CTTGGGAGTA GATGCTATTG 360 

CAACGATTCC ACCAATTTAT TTCCGCTTGC CAGAATACTC AGTTGCCAAA TACTGGAACG 420 

ATATCAGTTC TGCAGCTCCA AACACAGACT ACGTGATTTA CAACATTCCT CAATTGGCAG 480 

GGGTTGCTTT GACTCCAAGC CTTTACACAG AAATGTTGAA AAATCCTCGT GTTATCGGTG 540 

TGAAGAACTC TTCTATGCCA GTTCAAGATA TCCAAACCTT TGTCAGCCTT GGTGGAGAAG 600 

ACCATATCGT CTTTAATGGT CCTGATGAGC AGTTCCTAGG AGGACCCCTC ATGGGGGCTA 660 

GGGCTGGTAT CGGTGGTACT TATGGTGCTA TGCCAGAACT CTTCTTGAAA CTCAATCAGT 720 

TGATTGCGGA TAAGGACCTA GAAACAGCGC GTGAATTGCA GTATGCTATC AACGCAATCA 780 

TTGGTAAACT CACTTCTGCT CATGGAAATA TGTACGGTGT CATCAAAGAA GTCTTGAAAA 840 

TCAATGAAGG CTTGAATATT GGATCTGTTC GTTCACCATT GACACCAGTG ACTGAAGAAG 900 

ATCGTCCAGT TGTAGAAGCG GCTGCTGCCT TGATTCGTGA AACCAAGGAG CGCTTCCTCT 960 

AATCTAAAAG GAGGTATTTA TCACATATTA CGTTGCAATT GATATCGGTG GAACCAACAT 1020 

CAAGTATGGT TTGGTTGATC AAGAGGGGCA ACTTCTTGAA TCGCATGAAA TGCCAACTGA 1080 

GGCGCATAAG GGTGGACCTC ATATCTTACA AAAGACCAAA GATATCCTAG CTAGTTATTT 1140 

AGAAAAAGGC CCAGTAGCAG GTGTTGCCAT ATCTTCTGCT GGGATGGTGG ATCCGGATAA 1200 
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GGGTGAGATT TTCTATGCTG GGCCGCAAAT CCCTAACTAC GCAGGCACCC AGTTCAAAAA 1260 

GGAAATCGAA GAAAGCTTTA CTATTCCTTG TGAGATTGAA AATGATGTCA ACTGTGCAGG 1320 

TCTTGCTGAG GCAGTATCTG GTTCAGGCAA GGGAGCAAGT GTGACACTTT GCTTGACCAT 1380 

TGGAACCGGT ATCGGTGGTT GCTTGATTAT GGATAGGAAA GTCTTCCATG GTTTTAGCAA 1440 

TTCAGCCTGT GAAGTCGGGT ATATGCATAT GCAGGATGGA GCTTTTCAAG ACTTGGCTTC 1500 

TACAACAGCT TTAGTGAAAT ATGTAGCTGA ACCCCATGGA GAAGATGTTG ATCAGTGGAA 1560 

TGGCCGTAGA ATTTTCAAAG AAGCCACTGA AGGAAACAAA ATCTGCATGG AAGGTATTGA 1620 

CCGTATGGTT GACTATCTAG GAAAAGGTCT GGCAAATATT TGCTACGTTG CCAATCCAGA 1680 

ACTGGTTATT CTTGGTGGTG GTATCATGGG GCAAGAGGCT ATCCTCAAAC CTAAGATCCG 1*740 

TACAGCCTTG AAAGAGGCTT TGGTACCAAG TTTAGCAGAA AAAACACGAT TAGAATTTGC 1800 

CCATCACCAA AATACAGCAG GGATGTTGGG TGCATATTAT C ATTTT AAG A CAAAACAATC 18 60 

CTAGTTTGGC TCAGCCAAAC TAGGATTTTC TTACACGTTT TTGTCTACGA TAGCCGTTGA 1920 

G TTTTTTATT TTCCCAGTAG CTATTAAAGA TTTTTTCCTT GCTTTCGCGA TTGATTTCCA 1980 

AAAAGTAGGC ATAAATCAAA TCGATAAAGA AGAGCATAGG AAGTTGAGCG GATATTCGTT 2040 

GGATATAGGA GGGTTGGCTG TGGGTGGCTA CAAGAACAGT CTCTGTATAG GTCTGGCTAT 2100 

CTTTATTGGG AACACTTGTA AAGAGTACAG TCTTTGCCCC CATCTCCTTA GCATCTAATA 2160 

GACTATCTAA AATAGAAGGA GTTGAGCCTG AAAGTGAGAA GCCCAGTACT AGACAATTTT 2220 

CATCCATGAT GCTGGTTGTC CAGGCAAAGC CGTCTTGGTC TGTCAAAGCT TCGCAGACCA 2280 

CACCTAGTCG CATAAAACGT AATTTCATTT CACGGGCGAC GAGGCCAGAA CTCCCTGTTC 2340 

CAAAGAAGTA GATACGCTCA GCATCTTCGA TTAGCTGGGC AATTCGTTCT AGTTGGATTT 2400 

CGTCAATCAA GTCTTGTGTT TGTTCCCTCA TATTGCTATA ACTTCTGAGG ACTCGTTTGG 2 4 60 

TCAGTGGACT GTGCTTGGAG ACTTGGTTGG CTTGATTTTC TGCCTGATGT TGGTATTGGA 2 520 

AAATAAATTC TCGGTAGCCA GTAAAGCCAC ACTTTTTAGC AAAGCGGGTC AAAGCAGCTT 2580 

GAGAAATATG TAATTTTTGG GTGACTTGTT GAGAAGATAA ATCATCTGTA ATCGTTTCAG 2 640 

CTTGCAAAAA ATAGCGAGCG ATTTCTTGTT CTAGGTCTGT CATTTCTTCA AAATGTGAAT 2 700 

CAATGATAGT TGCGATATCT GGTTTGTCCA TAGGGAAAGC TCCTTTACAT GAGTCATACT 2760 

GGAAGACTAG ATCAGAGAAT AGTCACACTT CATTATAACA CATAATATAA GGATAGATAA 2820 

ATAAAAACGC ATCTCTGTTT TAAAAACGAA AAAATCGAAA AAGCTTCTCT CTTTTCCATA 2880 

ATTTTCTACT CAAATTGTGG TACAATTAAG AGTAAGATTT TAAGTTAGAA ATGAGACTGA 2940 
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TTTGTATGAG 


AAAATTTAAC 


AGCCATTCGA 


TTCCGATTCG 


CCTTAATTTA 


TTGTTTTCAA 


3000 


TCGTCATTTT 


ACTCTTTATG 


ACCATTATTG 


GTCCTTTGTT 


GTATATGCAG 


GTTTTGAACA 


3060 


AGGATTTTTA 


CGAAAAAAAG 


CTAGCTTCAG 


CTAGTCAGAC 


CAAGATTACA 


AGCAGTTCAG 


3120 


CCCGTGGGGA 


AATTTATGAT 


GCTAGTGGAA 


AACCTTTGGT 


AGAAAATACG 


TTAAAGCAGG 


3180 


TTGTTTCCIT 


TACGCGTAGC 


AATAAAATGA 


CGGCTACAGA 


CTTAAAAGAA 


ACAGCTAAAA 


3240 


AGTTACTGAC 


TTATGTGAGC 


ATCAGTTCTC 


CAAATTTGAC 


AGAACGCCAG 


CTGGCGGATT 


3300 


ACTATTTGGC 


TGATCCTGAA 


ATCTATAAAA 


AAATAGTGGA 


AGCTCTCCCA 


AGTGAGAAAC 


3 360 


GCTTGGATTC 


AGATGGCAAT 


CGTCTATCCG 


AATCAGAACT 


GTATAACAAT 


GCGGTCGATA 


3420 


GTGTACAAAC 


GAGTCAACTA 


AACTATACAG 


AGGATGAAAA 


GAAAGAAATC 


TATCTTTTTA 


3480 


GTCAGTTAAA 


TGCTGTTGGA 


AACTTTGCGA 


CAGGAACCAT 


TGCGACAGAT 


CCTCTAAATG 


3540 


ATTCTCAGGT 


GGCTGTTATT 


GCCTCTATTT 


CAAAGGAGAT 


GCCTGCCATT 


AGTATTTCTA 


3600 


CTTCTTGGGA 


TAGAAAGGTT 


TTGGAAACTT 


CCCTTTCTTC 


TATAGTTGGG 


AGTGTATCCA 


3660 


GTGAAAAAGC 


TGGTCTCCCA 


GCGGAAGAAG 


CAGAAGCCTA 


TCTTAAAAAA 


GGCTATTCTC 


3720 


TAAATGACCG 


TGTAGGAACC 


TCCTATTTGG 


AAAAGCAATA 


TGAAGAGACC 


TTACAAGGAA 


3780 


AACGCTCGGT 


AAAAGAAATC 


CATCTGGATA 


AATATGGCAA 


TATGGAAAGC 


GTGGATACAA 


3840 


TTGAGGAAGG 


TAGTAAGGGA 


AACAATATCA 


AACTGACCAT 


TGATTTGGCT 


TTCCAAGATA 


3900 


GCGTGGATGC 


TTTACTGAAA 


AGTPATTTCA 


ATTCTGAGCT 


AGAAAATGGT 


GGAGCCAAGT 


3960 


ATTCTGAAGG 


TGTCTATGCA 


GTCGCCCTTA 


ACCCAAAAAC 


AGGTGCGGTT 


TTGTCTATGT 


4020 


CAGGGATTAA 


ACATGACTTG 


AAAACGGGAG 


AGTTGACGCC 


TGATTCCTTG 


GGAACGGTAA 


4080 


CCAATGTCTT 


TGTTCCAGGT 


TCGGTTGTCA 


AGGCGGCGAC 


CATCAGCTCA 


GGTTGGGAAA 


4140 


ATGGAGTCTT 


GTCAGGAAAC 


CAGACCTTGA 


CAGACCAGTC 


CATTGTCTTC 


CAAGGTTCAG 


4200 


CTCCCATCAA 


TTCTTGGTAT 


ACTCAGGCTT 


ACGGTTCATT 


CCCTATCACA 


GCGGTCCAAG 


4260 


CTCTGGAGTA 


TTCATCAAAT 


ACCTATATGG 


TCCAAACAGC 


CTTAGGTCTT 


ATGGGGCAAA 


4320 


CCTATCAACC 


CAATATGTTT 


GTCGGCACCA 


GCAATCTAGA 


GTCTGCTATG 


GAGAAACTGC 


4380 


GTTCAACCTT 


TGGCGAATAT 


GGCTTGGGTA 


CTGCGACAGG 


AATTGACCTA 


CCAGATGAAT 


4440 


CTACTGGATT 


TGTTCCCAAA 


GAGTATAGCT 


TTGCTAATTA 


CATTACTAAT 


GCCTTTGGGC 


4500 


AGTTTGATAA 


CTATACGCCG 


ATGCAGTTGG 


CTCAGTATGT 


ACCAACTATT 


GCAAATAATG 


4560 


GTGTTCGTGT 


GGCTCCTCGT 


ATTGTTGAAG 


GCATTTATGG 


TAATAATGAT 


AAGGGAGGAC 


4620 


TGGGTGACTT 


GATTCAGCAA 


CTGCAACCGA 


CAGAGATGAA 


TAAGGTCAAT 


ATATCCGACT 


4680 


CCGATATGAG 


CATCTTGCAC 


CAAGGTTTTT 


ATCAGGTTGC 


CCATGGTACT 


AGTGGATTGA 


4740 
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CAACTGGACG TGCCTTTTCA AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG 4800 

CCGAAAGCTA TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATCCGGTG GCCTATGCCC 4860 

CATCTGATAA TCCCCAAATC GCTGTCGCAG TGCTCTTTCC TCATAATACC AATCTAACAA 4 920 

ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT GTATCAAAAA TACCATCCAA 4 980 

TGAATTAGAA AGGAAATTAT GCTTTATCCA ACACCTATTG CCAAGTTGAT TGACAGTTAT 5040 

TCTAAGTTAC CAGGTATCGG GATTAAGACG GCTACGCGTC TGGCCTTTTA TACGATTGGG 5100 

ATGTCTGCTG ATGATGTCAA TGAATTTGCA AAAAATCTCC TTTCTGCTAA GAGAGAATTG 5160 

ACATATTGTT CTATTTGTGG ACGTTTGACA GACGACCATC CTTGTTCTAT CTGTACTGAT 5220 

CCGACTCGTG ACCAGACAAC AATTTTAGTT CTTGAGGATA GTAGAGATGT GGCAGCCATG 5280 

GAAAATATCC AAGAATACCA TGGACTCTAT CATGTCCTTC ATGGCCTCAT TTCTCCTATG 5340 

AATGGTATCA GTCCGGACGA TATCAATCTC AACAGCCTTA TGACTCGTCT TATGGATAGT 5400 

GAGGTTTCAG AAGTGATTGT GGCGACTAAT GCTACAGCGG ATGGTGAAGC GACTTCCATG 54 60 

TATCTTTCAC GTTTGCTCAA GCCGGCTGGT ATCAAGGTTA CGCGTCTAGC ACGAGGTCTC 5520 

GCTGTGGGAG CGGACATTGA GTATGCGGAC GAAGTGACAC TCTTACGAGC CATTGAAAAT 5580 

CGGACAGAGT TGTAAGTGTA GGCAAATTTA CGAACTCCAT TCATTTATAA AAAATCAAAG 5640 

AGGCTGAAAA TCGTTCCTAT CGGCCTCTTT TTGTATAGTC TGATGAGTAG GCTCAGGTTC 5700 

AAGTTTTAAA AAACCAAGCA AATATGATAT ACTAAAGAGC GAGTATTCTA GTAGAATTAG 57 60 

GACAAATAAT ATGAAACAAA CGATTATTCT TTTATATCGT GGACGGAGTG CGGAACGCGA 5820 

AGTCTCTGTC CTTTCAGCTG AGAGTCTCAT GCGTGCGGTC GATTACGACC GTTTCACAGT 5880 

CAAGACTTTC TTTATCAGTC AGTCAGGTGA CTTTATCAAA ACACAGGAAT TTAGTCATGC 5940 

TCCGGGGCAA GAAGACCGTC TCATGACCAA TGAAACCATT GATTGGGATA AGAAAGTTGC 6000 

ACCAAGTGCT ATCTACGAAG AAGGTGCAGT GGTCTTTCCA GTCCTTCACG GGCCAATGGG 6060 

AGAAGATGGC TCTGTTCAAG GATTCTTGGA AGTTTTGAAA ATGCCTTACG TTGGTTGCAA 6120 

CATTTTGTCA TCAAGTCTTG CCATGGATAA AATCACGACT AACCGTGTTC TGGAATCTGC 6180 

TGGTATTGCC CAAGTTCCTT ATGTGGCTAT CCTTGAAGGC GATGATGTGA CTCCTAAAAT 6240 

CGCTGAAGTG GAAGAAAAAT TGGCTTATCC AGTCTTCACT AAGCCGTCAA ACATGGGGTC 6300 

TAGTGTCGGT ATTTCTAAGT CTGAAAACCA AGAAGAACTC CGTCAAGCCT TAAAACTTGC 6360 

CTTCCGATAT GACAGCCGTG TCTTGGTTGA GCAAGGAGTG AATGC 6405 
(2) INFORMATION FOR SEQ ID NO: 108: 
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(i) SEQUENCE CHARACTERISTICS: 

(A* LENGTH: 11309 base pairs 
(B) type: nucleic acid 
tC) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CGAGCTCCGG TACCGGCATT TTAAGGAGTT TGATATGTAT AACCTATTAT TAACCATTTT 60 

ATTAGTATTA TCTGTTGTGA TTGTGATTGC AATTTTCATG CAACCAACCA AAAAC CAATC 120 

CAGCAATGTA TTTGATGCCA GTTCAGGTGA TTTGTTTGAA CGCAGTAAAG CTCGCGGTTT 180 

TGAAGCTGTA ATGCAGCGTT TGACAGGGAT TTTAGTCTTT TTCTGGCTAG CCATTGCCTT 240 

AGCATTGACG GTATTATCAA GTAGATAAGA AAATAATGGG CAGGACTAGG TCTTTGCCTC 300 

TTTTTATTTT TAAAGGATGT TTGAGAAGGT TTTACAGTAA AAGAAAATTA AAAAATCTAG 360 

AAAGAAAATA TGAAAGATAG AATAAAAGAA TATTTACAAG ACAAGGGAAA GGTGACTGTT 420 

AATGATTTGG CTCAGCCTTT GGGAAAAGAC AGTTCCAAGG ATTTTCGTGA GTTGATTAAA 480 

ACCTTGTCCT TAATCGAAAG AAAGCACCAA ATTCGTTTTG AAGAAGATGG TAGTCTGACA 540 

TTAGAAATTA AGAAAAAACA TGAGATTACC CTCAAGGGGA TTTTTCATGC CCATAAAAAT 600 

GGCTTTGGCT TTGTTAGTCT GGAAGGCGAG GAGGACGACC TTTTTGTAGG GAAAAATGAT 660 

GTCAACTATG CTATTGATGG TGATACCGTC GAGGTAGTGA TTAAGAAAGT CGCTGACCGC 720 

AATAAGGGAA CAGCAGCAGA AGCCAAAATT ATTGATATCC TAGAACACAG TTTGACAACA 780 

GTTGTCGGGC AAATCGTTCT GGATCAGGAA AAACCTAAGT ATGCTGGCTA TATTCGTTCA 840 

AAAAATCAGA AAATCACTCA ACCGATTTAT GTTAAGAAAC CAGCCCTAAA ATTAGAAGGA 900 

ACAGAAGTTC TCAAAGTCTT TATCGATAAA TACCCAAGCA AGAAACATGA TTTCTTTGTC 960 

GCGAGTGTTC TCGATGTAGT GGGACACTCA ACGGATGTCG GAATTGATGT TCTTGAGGTC 1020 

TTGGAATCAA TGGACATTGT ATCCGAGTTT CCAGAAGCTG TTGTTAAGGA ACCAGAAAGT 1080 

GTGCCTGATG CTCCGTCTCA AAAGGATATG GAAGGTCGTC TGGATCTAAG AGATGAAATT 1140 

ACCTTTACCA TTGACGGTGC GGATGCCAAG GACTTGGACG ATGCAGTGCA TATCAAGGCT 1200 

CTGAAAAATG GCAATCTGGA GTTTGGGGTT CACATCGCAC ATGTTTCTTA TTATGTGACC 1260 

GAGGGGTCTG CCCTTGACAA GGAAGCCCTT AACCGTGCGA CTTCTGTTTA CGTGACAGAC 1320 

CGAGTGGTGC CAATCCTTCC AGAACGACTA TCAAATGGCA TCTGCTCTCT CAATCCCCAA 1380 

GTTGACCGCC TGACCCAGTC TGCTATTATG GAGATTGATA AACATGGTCG TGTGGTCAAC 1440 

TATACCATTA CACAAACAGT TATCAAGACC AGTTTTCGTA TGACCTATAG CGATGTCAAT 1500 
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GATATCCTAG CTGGCGATGA AGAAAAGAGA AAAGAATATC ATAAAATTGT ATCAAGTATC 1560 

GAACTCATGG CCAAGCTTCA TGAAACTTTA GAAAACATGC GTGTGAAACG TGGAGCTCTC 1620 

AATTTTGATA CCAATGAAGC GAAGATTTTA GTGGATAAAC AAGGTAAGCC TGTTGATATC 1680 

GTTCTTCGGC AGCGTGGTAT TGCCGAGCGG ATGATTGAGT CTTTTATGTT GATGGCTAAT 1740 

GAAACAGTTG CCGAACATTT CAGCAAGTTG GATTTGCCTT TTATCTATCG AATTCACGAG 1800 

CAGCCTAACG CTGAAAAGGT TCAGAAGTTT ATTGATTATG CTTCGAGTTT TGGCTTGCGC 1860 

ATTTATGGAA CTGCCAGTGA GATTAGTCAG GAGGCACTTC AAGACATCAT GCGTGCTGTT 1920 

GAGGGAGAAC CTTATGCAGA TGTATTGTCC ATGATGCTTC TTCGCTCTAT GCAGCAGGCT 1980 

CGTTATTCGG AGCACAATCA CGGCCACTAT GGACTAGCTG CTGACTATTA TACTCACTTT 2040 

ACCAGTCCAA TTCGTCGTTA TCCAGACCTT CTTGTTCACC GTATGATTCG GGATTACGGC 2100 

CGTTCTAAGG AAATAGCAGA GCATTTTGAA CAAGTGATTC CAGAGATTGC GACCCAGTCT 2160 

TCCAACCGTG AACCTCGTGC CATAGAAGCT GAGCGTGAAG TCCAAGCCAT GAAAAAGCCT 2220 

GAGTATATGG AAGAATACGT GGCTGAAGAG TATGATGCAG TTGTATCAAG TATTGTCAAA 2280 

TTCGGTCTCT TTGTCGAATT GCCAAACACA GTTGAAGGCT TGATTCACAT CACTAATCTG 2340 

CCTGAATTTT ATCATTTCAA TGAGCGTGAT TTCACTCTTC GTGGAGAAAA ATCAGGTATC 2400 

ACTTTCCGAG TGGGTCAGCA GATCCGTATC CGTGTTGAAA GAGCGGATAA AATGACTGGA 2460 

GAGATTGATT TTTCATTCGT ACCTACTGAG TTTGATGTGA TTGAAAAAGG CTTGAAACAG 2520 

TCTAGTCGTA GTGGCAGAGG GCGTGATTCA AATCGTCGTT CGGATAAGAA GGAAGACAAG 2580 

AGAAAATCAG GACGCTCAAA TGATAAGCGT AAGCATTCAC AAAAAGACAA GAAGAAAAAA 2640 

GGAAAGAAAC CTTTTTACAA GGAAGTAGCT AAGAAAGGAG CCAAGCATGG CAAAGGGCGA 2700 

GGGAAAGGTC GTCGCACAAA ATAAAAAGGC ACGCCACGAC TATACAATCG TAGATACGCT 2760 

AGAGGCAGGG ATGGTCCTGA CTGGAACTGA AATCAAGAGT GTACGAGCTG CTCGAATTAA 2820 

TCTCAAGGAT GGCTTTGCTC AAGTGAAAAA TGGAGAAGTT TGGCTGAGCA ATGTTCATAT 2880 

CGCGCCTTAC GAAGAGGGCA ATATCTGGAA CCAGGAACCA GAACGTCGTC GTAAACTCCT 2940 

GCTCCATAAA AAGCAAATTC AAAAATTGGA ACAAGAGATC AAAGGGACAG GAATGACCTT 3000 

AGTTCCCCTT AAGGTCTATA TAAAAGATGG CTACGCTAAG CTTCTTTTAG GACTTGCCAA 3060 

AGGGAAGCAT GACTATGACA AACGGGAGTC TATCAAACGT CGTGAGCAAA ATCGAGATAT 3120 

CGCGCGTGTG ATGAAAGCTG TTAATCAGCG ATAAAAAGAG GAATTGAAAA TGCAAAAATT 3180 

AGTTGCCTAT AAACGCATGC CTTTGTGGAA TAAACAAACA ATGCCTGAAG CTGTTCAGCA 3240 
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AAAGCACAAT 


ACAAAAGTTG 


GGACTTGGGG 


GAAAATTACT 


GTCTTGAAGG 


GAGCTCTCAA 


3300 


GTTTATTGAA 


TTGACAGAAG 


AAGGGGAAGT 


TCTAGCTGAA 


CACCTCTTTG 


AAGCAGGGGC 


3360 


AGACAATCCA 


ATGGCCCAAC 


CTCAAGCCTG 


GCACCGAGTG 


GAAGCTGCCA 


CAGATGATGT 


3420 


GGAATGGTAC 


TTGGAATTTT 


ATTGTAAACC 


TGAGGATTAT 


TTTGCTAAAA 


AATACAATAC 


3480 


CAATCCTGTT 


CATTCAGAGG 


TCCTAGAGGC 


CATGCAGACA 


GTGAAACAAG 


GGAAAGCTTT 


3540 


GGATTTGGGT 


TGTGGTCAGG 


GGCGTAATTC 


TCTTTTTCTA 


GCCCAGCAAG 


ATTTTGATGT 


3600 


GACGGCTGTA 


GATCAAAATG 


GACTAGCTCT 


TGAAATCTTG 


CAAAGCATTG 


TGGAGCAGGA 


3660 


AGATTTGGAC 


ATGCCTGTTG 


GCCTTTACGA 


TATCAATTCA 


GCTAGCATTG 


AACAAGAATA 


3720 


TGATTTTATC 


GTTTCAACAG 


TTGTTCTCAT 


GTTTCTACAA 


GCGGACCGCA 


TTCCAGCTAT 


3780 


TATTCAAAAT 


ATGCAGGAGA 


AAACCAGTGT 


TGGTGGTTAC 


AACCTTATCG 


TTTGTGCCAT 


3840 


GGACACGGAG 


GATTATCCTT 


GCTCGGTTAA 


CTTCCCATTC 


ACCTTTAAAG 


AAGGAGAACT 


3900 


GGCAGACTAT 


TACAAGGATT 


GGGAATTGGT 


TAAGTACAAT 


GAAAATCCAG 


GCCATTTGCA 


3960 


CCGTCGCGAT 


GAGAATGGCA 


ATCGTATTCA 


ACTACGCTTT 


GCGACCTTAC 


TAGCTAAGAA 


4020 


AATCAAGTAA 


ACACACATGA 


AGATTAGGAA 


TTTTCCTGAT 


CTTTTTTCTT 


TTTTACGAAT 


4080 


GATATAGAAA 


AGGAGGGAAT 


TCATGTTTGT 


TGCGAGAGAT 


GCTAGGGGAG 


AATTGGTAAA 


4140 


TGTGTTAGAG 


GATAAACTTG 


AGAAGCAAGC 


ATACACCTGC 


CCAGCTTGTG 


GAGGCCAGCT 


4200 


CCATTTGCGT 


CAAGGACCAA 


GTGTACGGAC 


GCATTTTGCC 


CATAAATCCT 


TAAAAGACTG 


4260 


TGATTTTTTC 


TTTGAAAATG 


AAAGTCCAGA 


ACACCTGGCC 


AATAAGGAAT 


CCCTCTATCA 


4320 


CTGGTTGAAA 


AAAGAGACAA 


AGGTTCAATT 


AGAGTACCCG 


CTTTCAGAAC 


TTAAACAGAT 


4380 


TGCGGATGTA 


TTTGTAAATG 


GCAATCTAGC 


TCTAGAAGTT 


CAGTGTAGTC 


CCTTGCCTCA 


4440 


GAAAGTCCTT 


AAAGAGCGAA 


GTGAGGGCTA 


TCGTAGTCAG 


GGTTACCAAG 


TACTGTGGTT 


4500 


GCTGGGTCAA 


AAACTGTGGC 


TCAAGGAGCG 


TTTGACTCGT 


CTACAGCAAG 


GTTTTCTTTA 


4560 


TTTCAGTCAA 


AACATGGGCT 




ndAATTAflAf* 
vninn i i n\?tv*f 




Aau 1111 f\4\Kj 




ACTCAAATAC 


CTGATTTACC 


AGGATCTCCG 


CGGTAAACTC 


CATTATCAAA 


TCAAGGAATT 


4680 


TTCCTATGGT 


CAAGGTAGTT 


TATTGGAAAT 


ATTCCGTCTT 


CCCTATAAGA 


GACAAAAAAT 


4740 


ATCTCATTTT 


ACAGTTTCTG 


AGGACAAGGA 


CATCTGTCGC 


TATATCCGGC 


AACAACTTTA 


4800 


TTATCAAAAT 


CTCTTTTGGA 


TGAAAGAACA 


AGCAGAAGCC 


TATCAAAAGG 


GAGAAAATAT 


4860 


CCTGACTTAT 


GGACTGAAAG 


AATGGTATCC 


ACAAATTCGA 


CCAATAGTGG 


GCAAATTTTT 


4920 


CCAGATTGAA 


CAAGACTTGA 


CTAGCTATTA 


TCAGCACTTT 


TATACCTATT 


ACCAAAAAAA 


4980 


TCCTCAAAAT 


GATTGGCAAA 


AGCTTTATCC 


ACCAGCCTTT 


TATCAGCAAT 


ATTTCTTGAA 


5040 
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AAATATGGTA GAATAGAAAG GATGGAGGAA TCTAATGGTA TTACAAAGAA ATGAAATAAA 5100 

TGAAAAAGAT ACATGGGATC TATCAACGAT CTACCCAACT GACCAGGCTT GGGAAGAAGC 5160 

CTTAAAAGAT TTAACAGAAC AATTGGAGAC AGTAGCCCAG TATGAAGGCC ATCTCTTGGA 5220 

TAGTGCGGAT AACCTACTAG AAATCACTGA ATTTTCTCTT GAAATGGAAC GCCAGATAGA 5280 

GAAGCTTTAC GCTTATGCTC ATATGAAGAA TGACCAGGAT ACACGTGAAG CTAAGTATCA 5340 

AGAGTACTAT GCCAAGGCCA TGACACTCTA CAGCCAGTTA CACCAAGCCT TTTCATTCTA 5400 

TGAGCCTGAA TTTATGGAGA TTAGCGAAAA GCAGTATGCT GACTTTTTAG AAGCTCAACC 54 60 

AAAGCTGCAG GTTTATCAAC ACTATTTTGA CAAGCTTTTG CAAGGCAAGG ATCACGTTCT 5520 

TTCACAACGT GAAGAAGAAT TATTGGCTGG AGCTGGAGAA ATCTTTGGTT CAGCAAGTGA 5580 

AACCTTCGCT ATCTTGGACA ATGCGGATAT TGTGTTCCCT TATGTCCTAG ACGATGATGG 5640 

TAAAGAAGTT CAGCTATCTC ATGGGACTTA CACACGTTTG ATGGAGTCTA AAAAACGTGA 5700 

GGTTCGCCGT GGTGCCTATC AAGCTCTTTA TGCGACTTAC GAACAATTCC AACACACCTA 5760 

TGCCAAAACC TTGCAAACCA ATGTTAAGGT GCAAAATTAC CGTGCTAAAG TTCGTAACTA 5820 

CAAGAGTGCT CGTCATGCAG CCCTCGCAGC GAATTTTGTT CCAGAAAGTG TTTATGACAA 5880 

TTTGGTAGCA GCAGTTCGCA AGCATTTGCC ACTCTTACAT CGCTATCTTG AGCTTCGTTC 5940 

AAAAATCTTG GGGATTTCAG ATCTCAAGAT GTACGATGTC TACACACCGC TTTCATCTGT 6000 

TGAATACAGT TTTACCTACC AAGAAGCCTT GAAAAAAGCA GAAGATGCTT TGGCAGTCTT 6060 

GGGTGAGGAT TACTTGAGCC CTGTTAAACG TGCCTTCAGC GAGCGTTGGA TTGATGTTTA 6120 

CGAAAATCAA GGCAAGCGTT CAGGTGCCTA CTCTGGTGGT TCTTATGATA CCAATGCCTT 6180 

TATGCTTCTC AACTGGCAAG ACAATCTGGA CAATCTCTTT ACTCTTGTTC ATGAAACAGG 6240 

TCACAGTATG CATTCAAGCT ATACTCGTGA AACTCAGCCT TATGTTTACG GGGATTACTC 6300 

TATCTTTTTG GCTGAGATTG CCTCAACTAC CAATGAAAAT ATCTTGACGG AGAAATTATT 6360 

GGAAGAAGTG GAAGACGACG CAACACGCTT TGCTATTCTC AATAACTTCC TAGATGGTTT 6420 

CCGTGGAACA GTTTTCCGCC AAACTCAATT TGCTGAGTTT GAACACGCCA TTCACCAAGC 6480 

AGATCAAAAT GGGGAGGTCT TGACAAGCGA TTTCCTAAAT AAACTCTACG CAGACTTGAA 6540 

CCAAGAGTAT TATGGTTTGA GTAAGGAAGA CAATCCTGAA ATCCAATACG AGTGGGCTCG 6600 

CATTCCACAC TTCTACTATA ACTACTATGT ATATCAATAT TCAACTGGCT TTCCGGCCGC 6660 

CTCAGCCTTG GCTGAAAAAA TTGTCCATGG TAGTCAAGAA GACCGTGACC GCTATATCGA 6720 

CTACCTCAAG GCAGGTAAGT CGGACTATCC ACTTAATCTC ATGAGAAAAG CTGGTGTTGA 6780 
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TATGGAGAAG GAAGACTACC TCAACGATGC CTTTGCACTC TTTGAACGCC GTTTAAATGA 6840 

GTTTGAAGCC CTTGTTGAAA AATTAGGATT GGCATAAAAT GGTTGAATCG TATAGTAAGA 6900 

ATGCTAACCA TAACATGCGT CGTCCTGTCG TCAAAGAAGA AATTGTAGAC TTGATGCGTC 6960 

AGCGTCAAAA GCAGGTCACA GGTTTCTTGA AAGAATTGGA AGACTTTGCC CGCAAGGAAA 7020 

ATATTCCTAT TATTCCCCAT GAAACGGTTG CTTATTTCCG - TTTTCTTATG GAAACCATGC 7080 

AGCCTAAAAA TATTCTGGAA ATTGGGACGG CTATCGGTTT TTCAGCTCTC TTGATGGCTG 7140 

AACATGCGCC AAATGCTAAG ATTACAACTA TTGATCGTAA TCCAGAAATG ATTGGTTTTG 7200 

CCAAGGAAAA TTTTGCCCAG TTTGACAGTC GCAAGCAAAT CACTCTCCTA GAGGGAGATG 7260 

CGGTGGATGT CTTATCTACA CTGACAGAGT CTTATGATTT CGTCTTTATG GATTCTGCCA 7320 

AGTCTAAATA CATCGTCTTT CTGCCAGAAA TCCTCAAACA TTTGGAAGTT GGTGCTGTCG 7380 

TTGTCTTGGA TGATATTTTT CAAGGTGGTG ATGTTGCCAA GGATATTATG GAAGTCCGTC 7440 

GTGGTCAGCG AACCATTTAT CGAGGCCTTC AAAAATTATT TGATGCAACC TTAGACAATC 7 500 

CAGAACTCAC CGCAACATTA GTGCCTTTAG GAGATGGTAT TCTCATGCTT CGTAAAAATG 7560 

TAGCAGATGT TCAACTGTCT GAAAGCGAAT GATTTTCAGA AAAATTTAAG AAAAAATAGT 7620 

AAAATAGATA GAGTAACACT TATCTCAAAG GAGTAGACAT GAAGAAAAAA TTATTGGCAG 7680 

GTGCCATCAC ACTATTATCA GTAGCAACTT TAGCAGCTTG TTCGAAAGGG TCAGAAGGTG 7740 

CAGACCTTAT CAGCATGAAA GGGGATGTCA TTACAGAACA TCAATTTTAT GAGCAAGTGA 7800 

AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC CATCCAAAAA CTTTTTGAAA 7860 

AACAATATGG CTCAGAGCTT GATGATAAAG AGGTTGATGA TACTATTGCC GAAGAAAAAA 7920 

AACAATATGG CGAAAACTAC CAACGTGTCT TGTCACAAGC AGGTATGACT CTTGAAACAC 7980 

GTAAAGCTCA AATTCGTACA AGTAAATTAG TTGAGTTGGC AGTTAAGAAG GTAGCAGAAG 8040 

CTGAATTGAC AGATGAAGCC TATAAGAAAG CCTTTGATCA GTACACTCCA GATGTAACGG 8100 

CTCAAATCAT CCGTCTTAAT AATGAAGATA ACGCCAAAGA AGTTCTCGAA AAAGCCAAGG 8160 

CAGAAGGTGC TGATTTTGCT CAATTAGCCA AAGATAATTC AACTGATGAA AAAACAAAAG 8220 

AAAATGGTGG AGAAATTACC TTTGATTCTG CTTCAACAGA AGTACCTGAG CAAGTCAAAA 8280 

AAGCCGCTTT CGCTTTAGAT GTGGATGGTG TTTCTGATGT GATTACAGCA ACTGGCACAC 8340 

AAGCCTACAG TAGCCAATAT TACATTGTAA AACTCACTAA GAAAACAGAA AAATCATCTA 8400 

ATATTGATGA CTACAAAGAA AAATTAAAAA CTGTTATCTT GACTCAAAAA CAAAATGATT 8460 

CAACATTTGT TCAAAGCATT ATCGGAAAAG AATTGCAAGC AGCCAATATC AAGGTTAAGG 8520 

ACCAAGCCTT CCAAAATATC TTTACCCAAT ATATCGGTGG TGGAGATTCA AGCTCAAGCA 8580 
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GTAGTACATC AAACGAATAG TCCAAATCAA TGAGTCAGGG AAAAAACTCG ACTTCAGGAA 8640 

AAAATGAAGC AAACATTCCC ACAATAAAAC GCATAGTACA AGGTTTGTAC TGCCCCCCAA 8700 

AAAGTTAGAC AATTAATTTA TCCGAAGGAT TTAGTTCTGT ATTGCACAGA GCTAAGTCCT 8760 

TTTAGTTTTA TCTTAATTCT CTTATTGTTG TAATAATCAA TATAGTCTAT AATGGCTCGT 8820 

TCCAATTGAT TAAGTGATTT AAATGTTTTC TCATAGCCAT AAAACATTTC GGATTTTAAA 8880 

ATGCCAAAGA AAGATTCCAT CCTACCGTTG TCTTGGCTGT TGCCCTTACG TGACATGGAT 8940 

GCTTGAATTC CCTTACTCTC TAGGAAGCGA TGATAAGAAT CGTGTTGATA TTGCCAGCCT 9000 

TGGTCACTAT GGAGAATCGT ATTCTCGTAG TGCTTCTCTT TGAATGCCTG TTCCAACATT 9060 

AACGATCAAT CAATTTAATC ATGTACCTAA GATTAGAATT GTTTATCCCA AATTTATTTG 9120 

AAAGCTTCTC TAAGCTATAT CCTTGTTTTC TAAGTTCATA GATCTGAACT TTATCATCAT 9180 

AAGTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT TTTCTGTCTA ACTTTTGGGG 9240 

TGTAGTTCAT GTACACCTGA TATGATGCGT TTTATAATTT TAAAGACTTT TTGACCAGCC 9300 

TCATTTTTTT AACTTGATAC TCAGTGAAAA GCAAAGATTA AACTAGGAAG CTAGCTGTAG 9360 

GCTGCTCAAA GAACAGCTTT GAGGTTGTAG ATAAAACTTG TGACGTCACC AACATATATA 9420 
ATGTGAAGCT GACGTGGTTT GAATAGATTT TAGAAGAGTA TGAGTCTGGA AGTTTTAATG * 9480 

GATAATGCAA GATTCCATAG AATGGGTAAG CTAGAGTTCT TATGTGAAGA CTTTGGGCAT 9540 

AAACTTTTAC CTTTTCCTCC CTACTCATCT TAGTATAGAA AAGTGAATCT GAAATAGTAC 9600 

ATAACTGCTT CTAAAACATT CTTATAAATT GATTTAAATT CTCAAATCAT ATTATTCAGT 9660 

TCTTATTTCA TTTTGTTCTA CAATCCTGTT GAGAAGACAC GTGTTCATAT CAAAAAGGTA 9720 

TTGGCAAGTT GCAATACCTT TTTACGAGGC TCTGTTGTCT TATTTTTGTT TCAACTGACT 9780 

ATATCTCCTA TGGTTCTAGT TCAGAAGGCT AGGCTATAAT TATGATTGAT AAGAACTATC 9840 

ATTCCAAGTA TTGGGAGTGA ATGTTTCAAA ATCATGGGTT TCTATAATGC TCAGGCTGGC 9900 

ATTTGCTAGA CCGCCATCTT TACGAAGAAG TGGTTCTTTA TAGCCTAGGA GAGTACGAAG 9960 

ACTGGCAGTA AGATTGGCGC CGTGTCCGAC AATTAGAATA CGTTCAGCTG GACTATCTTT 10020 

TAATGATTTG ATAAATTGGA TGCTCCGTTC AGTTGTACTA TAGAGGGATT CGGCTCCGAA 10080 

CATTCGAGTG TCAAATTGAG CAAGATTTGA ACGAAAAGCC TGGATTTGTT GCGGGTAAAT 10140 

AGCTTCCAAG GTTGCAATTT TCAAACCTTC TAACTTCCCA AGTTGCCATT CACGGAGATT 10200 

AGGAACGATT TCTAAAGAAC AGGGGGTATA GAGTTGACTT TGGATAATCT CAGCAGATTT 10260 

GACCGCTCGA GGTAAATCAC TTGAATAAAT CTGATCAAAA GGAATTTCCT TGAGATACTG 10320 
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ACCAAGTCGT TTTAGGGTTT CAATGGATTC AGGAAGAAGA 
TTGAAAACGA CCTTCTTGGT TCCAGAGGGT ACGACCGTGG 
TACTTGATGT CCTCCAAAAT ATCTACAAAG TCTGCCTTTA 
GGCGCGACGA TAATGCTGTG TCCGACTTCG CCTGCAGAGA 
AGAGCAATTT TATCGATAAA AATGGGATAA TTGTGTTTCT 
TTGGCTCCAT GAATGTAACC AGTTGTTTTT TCTAAGTCCT 



GGAGAATCAC CACTAGCACC 
CGGACAAAGT AGAGTTTCAT 
CAAAGCTAGC CAAGTCTTGT 
CAATCATTTG ATCCAAATCT 
GACGAATTCC GACAGGATTA 
TTTGTGGAAT CATGCTCACT 
TTTTTATTGC CAGAAATTTT AGCTAGTTTC TTTTCAGACA AGTCCTGAGT GATAGGGACA 
ATTCCGATAA TCGGTCCGGT CTTGTCTCCC AAAAGCGCCA AGGTTTTGAA AATCTGATCT 
CGTTCATAAC CTTGAGGAAG CTCTCCTTCT AGGGCATTGA TTTGAATCCC CTGATGAGGG 
ATAGCTGCTT TAGATAGGAT TTGTTCCACC AATGTTTTTT TGATTTTAAC TTTTTTTGCC 
ATTATTTATA TTTATCCTCC AATTGACTCA TCCAAATACC AAGCCAGATT CCCAGCGCAA 
AGAAGAAGGC CATGATGACA TAACCGACAA GTGAAAGTCC TGTGTATTGG ATACTTTCAG 
CGTTTCCTGC ATTTGGAATT AAGATCAAAA GGGTACTTGA TAGGACGATA CCGATGATGA 
AATGATAGAC GAACTGTTTA CGGAGTTCTT CTAGTTCTCC GTCCGTCCAA GCGTAGGCCA 
CTTCTTCTTT CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA 
CATGACCTGC CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT 
GGATATGGGC ACCGTCGGTA TCCGCATCG 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10380 

10440 

10SOO 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

1U00 

11160 

11220 

11280 

11309 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
CCATAGTCTA ACAAGTCTTT CTAAAGGTTT ATCCCTGATT CATGTAAAGA TTGTGTAAAG 
AATCAAAAAA AGCCACTTTT GAAAAATGGC TGCTCCTAAA AATAGCTTTA AAAATTATTA 
GTCCTGTGCG AAAGATTGGT TAGGAAGAAA AATCGTGAAG CAACTGCCTC TGCCAAGCTG 
ACTCGTCACC GTGACTTGGC CACCTAATAA TTGACTGAGT TCTTTGACAA TGGCAAGGCC 
AAGACCAGTG CCACCAGTTT GTCTGCTTCG ACCTTTATTA ACTCGGTAAA AACGTTCAAA 
AATACGATCC TGCTCTAATT GACTAATACC AATCCCTGTA TCTGATACAG AAATCTTAAT 
CCCTTCGTTC ACCTTTTGGG TCTTGACCTC AATTTTTCCC CCTTGTTCAG TGTAACGGAT 



60 
120 
180 
240 
300 
360 
420 
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GGCATTGGAT AAAAGATTGA GTAAGATTTG GGAAAGTAAT TGACTATCTG ATACGAGGGT 480 

GACATCATCT GGCACCTGCA CCTTTAGCTG TAAATCCTTC TTCTTGAGCT GAGGTTGCAA 540 

GCTTTGAGTC AAATCCTGTA CAAATTCTGC CAAAGAAAGG GTCGTCCATT GTATAGGCAT 600 

TTGTTGAGCC TTAGATAAGG TAAGAAGATG CTCAACAATA TGCTCAAGAC GCAAACTTTC 660 

TTTGTAAATA ATGTCTAGAA AGTCATCCTT GAGCGCTTCT TCTTCAGCTG ACATCCCCTT 720 

AATGGTTTCA GCAAAGCCCT TAATCGAAGT AACTGGTGTC CTCAATTCAT GGGAGGCATT 780 

TGAGACAAAG GCTAAATTTA ACTTTTCATA AGTTCTAATC GTTGTTAAAT CATATAGCAA 840 

GACGAGCACA GCTTCCACAG ATTGGGTGGG GCTAAAAACG GGAACTGCTG TCACTTCTAA 900 

AATCAAGTCA CCCTCATGAA ACCCACTTAC TTCTTGTTTT AACCTTGTTT TTTGATCAAA 960 

GGCTTGGTGA ACTAAATTCC GAATATCCAT CCGTTTGAGG TCATCAAGTG AACTTATGTC 1020 

GCCGTCCACA TCGGGAAAAT AATGAGGCAG AGAGCGACTG GATAATAACA TCTGACCTTG 1080 

AGCGGAAACT AAAAACGTCC CCATGGTTAG GTGCGACAGA AGAACCTCCA TTGTTTCGGC U40 

TAGATCCTTG TATTGCTGAT CCTGTTGGGA GACTTTGGTT TTTAGGCCAG ACACATACTG 1200 

AGCCAAAGAC TTTAAGTCTT CTTGCCCTTT TTCTAAAAAG TATTCACTAC TGGTCAAGAG 1260 

AGGTTGGTGC AAGGTCTCAA AAGCAACTTC CCATTTCCAA AGCCAAAAGA GCCAGTAGCC 1320 

ACCTAGTCCC AAAGAAAGGG CTAGAAGAAA GAGACCGATG CCTTTACTGA TCCAAGTTAA 1380 

TGCCATCCCT GCAATCAGAA TGAGGCTAAC ACTTAGATTG ACTAGCCAAA ATTGAAGGTA 1440 

GCGTTTCATC TATAACTCCT TGAACTTATA ACCATAACCC CGAATGGTTC GAATAAATTG 1500 

AGGGGCTTTA GGATTGTCTT CAATTTTTTC CCTCAACTTA CCAATATGAA CGTCCACCAA 1560 

ACGTGTTTCC TGCCCAAAGT CATACCCCCA GATACGTTCC AAAAGACGCT CTCTAGTCAG 1620 

TGTCATGTTG GGATGTTTCA TAAGATAGAG CAAGAGTTCA AATTCTTTTG GGGTCAAACT 1680 

CAGTAACTTA TTCGCCTTGT AGACTTCATG ACGCTCAGGG TATACTTTCA AGGTCCCAAA 1740 

TAGCCAAGAA TCGTCAGCGA TATTATCTGA ATCATCTCCT TCTTGTTCTC CTTTAGTTCG 1800 

CCTGAGGACA GCCTTGACAC GCGCCAGCAA TTCTCTAGGG CTAAAAGGCT TGGTCAGGTA 1860 

GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT TCGCAGAAAC 1920 

CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT CCATGCCATC 1980 

TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT CTGCCAAAGC 2040 

TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC TTAAATGGTA 2100 

GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTTGTT TTGTCATCTA 2160 
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TTATCTCCTA TTGGTAACAT TATAACACAA TTATCAGAAA TCCTAACATT GCTAAATCAG 2220 

ATTAAATTTG CCTATCAAGA CTAGTATCTG GTCAAACGCT CAATCATCTC CTTGTGCTCT 2280 

GGATAGGTCG CCAGTAGATC TACCCTTTCA AATAATTCAA AATCCTCAAA TTCAAAACCA 2340 

GGAGCAACAA GACAAGAAAC CAGAGCATCA TCCTTATCAA CTGTTGATCC CCAAATAGTG 2400 

CCCTTAGGAA CACAGTAGTG AAGTTGTTGC CCTTTGGATA TGTCCAGGCC TAAAGTGACT 2460 

GCTTCGTAGT GACCATCTGC TGTAATCATG TGAACAGTAA GTGGGGATCC TGCATGAAAA 2520 

TACCAGATTT CATCTGCTGT CAATCGGTGA AAATGTGAAG GATTCGTTTC TTCTAATAAG 2580 

AAATAAATAC TGGTATAAAG CGCCCTTCCC TTACCAGCAA GGTTTATAGT GTCTGAAGCT 2640 

TTTTTTGTTT GTCTAAAATA GCCACCTTCA ATATGGGGAG CTAACTCTAG AGTTCTTATC 2700 

AAGTCTTCTT TATCCGTCGG AGCCAATGGG TTGAAGTAAC TCTTGTTCAA AGTGGTTTTA 2760 

CGATTTCAAG AACTCCTCTC AGTTCTGAGG ACACGGTAAT GATTGATGCG ACGGAAGTAC 2820 

AAATCAATCG CCCTAAAAAA AGAATTAGCG AATGATTCTG GTAAAAAAAA TGCCACGCTA 2880 

TGAAGGCTCA AGCGATTGTC ACAAGTCAAG GGAGAATTGT TTCTTTGGAT ATCGCTGTGA 2940 

ACTATTGTCA TGATATGAAG TTGTTCAAAA TGAGTCGCAG AAATATCGGA CAAGCTGGTA 3000 

AAATCTTGGC TGACAGTGGT TATCAAGGGC TCATGAAGAT ATATCCTCAA GCACAAACTC 3060 

CACGTAAATC CAGCAAACTC AAGCCACTAA CAGTTGAAGA TAAAGCCTAT AACCATGCGC 3120 

TATCCAAGGA GAGAAGCAAG GTTGAGAACA TCTTTGCCAA ACTAAAAACG TTTAAAATGA 31 BO 

TTTCAACAAC CTATCGAAAT CATCGTAAAC ACTTCGGATT ACGAATGAAT TTGATTGCTG 3240 

GCATTATCAA TCATGAACTA GGATTCTAGT TTTGCAGGAA GTCTATTATT TGGTTAGGTG 3300 

AATTAGTGAA GCGTTTAGGC AAGTGTCTCT GGTTACGACG TCATGGACTC TAAATCGATT 3360 

ATATTTAGGG GTCATGACTA GTGAAGCAGT TAGCTAGTTC GCATATAAGC GGCTAGCGTC 3420 

TAACAATTAG GAACTTTAGT TCCAATAACT TTAAGATTAC GACGTTTTAG GACATAAATC 3480 

GATCATATTT ATGTCCTAAA ACTAGTGAAG CGCCTAGCCA AAGTCCGAAT AGGATTTiiGC 3540 

GTTAGTTACT TAGATTGCTT TGCAATCAAG TAACTTTGGC GATTTACATC TTCTCTGGCG 3600 

CTTCTACTCC AAGCAAGCGA AGGGCTTCTT TGAGAACGAC TGCGGTTGCG TAGCTGAGGG 3660 

CTAGACGGCT GTCGCGTTCT GGGCTTTCAT CCAAGATACG TGTATGTGCA TAGTATTTGT 3720 

TAAAGGATTG AGCCAGGCTA ATTGCAAATT TAGCAATGAT AGAAGGTTCA AAGTTATCTG 3780 

CCGCACGGTT GATAATACGT GGGAAGTCTT GAATGAGTTT AATGATTTCC CAGCTTTCAG 3840 

TATCATTCAA GCTATAGTTG CCAGCTGTTT CTGGTTTGAA ATCGGCTTTG CGTAAGATAG 3900 

ATTGGATACG AGCGTAGGCA TATTGAACGT AAGGTCCAGT TTCACCCTCG AAGGATACCA 3960 
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TAGCCTCTAG 


GTCGAAGTCG 


TATCCATTTG 


TACGGTCGGT 


TTTGAGGTCA 


TAGAATTTAA 


4020 


TGGCTCCAAT 


CCCAACAGCA 


TGTGCTACTT 


GGTCTTTGTT 


TTCTAGTTCA 


GGATTTTTAG 


4080 


CCTCGATTTG 


GACCTTGGCA 


CGGCTAACAG 


CCTCTGCAAC 


AGTAGGCTCT 


AGCAAGATGA 


4140 


CATTCCCTTT 


ACGAGTAGAG 


AGTTTCTTCC 


CTTCTTTTGT 


AACCAAACCA 


AAAGGAACGT 


4200 


GAGTAATGTC 


GTCAOTCCAG 


TCGTAGCCCA 


TCTCTTGCAA 


GACAGCTTTG 


AGCTGTTTAA 


4260 


AGTGGGCAGA 


TTGTTCTTGA 


CCAACGACAT 


AGATAGATTT 


AGCAAATTGG 


TATTCGTTTT 


4320 


TACGGTAGAG 


GGCTGCAGCC 


AAGTCACGTG 


TGATATAGAG 


AGTTGCACCA 


TCAGACTTCT 


4380 


TGATGAGGGC 


TGGATGTTCA 


ATTCCATATT 


TCTCAAGATT 


CACAACTTGG 


GCACCTTCTG 


4440 


ATTCAAGAAG 


TAGTCCTTTT 


TCAGAAAGAA 


TGTCTACAAC 


TGCATCCATC 


TTATCATTGT 


4500 


AGAAGGCTTC 


TCCGTTATAG 


CTGTCAAATT 


CAACCTTCAA 


TTCATTGTAA 


AGGCGGTTAA 


4S60 


ATTCCACTAA 


ACTTTCATCG 


CGGAACCATT 


GCCAAAGAGC 


GAGAGCTTCC 


TCATCTCCAT 


4620 


TTTCAAGTTT 


ACGGAACCAT 


TCGCGCGCTT 


CTTCATCCAA 


GCTAGGGTCA 


TTTTCAGCTT 


4680 


CAGCGTTGAT 


GCGGACATAG 


AGTTTAAGGA 


GTTCATCGAT 


TGCATGAGCT 


TTTACAGCTT 


4740 


CTTCGTCGCC 


CCATTTTTTG 


TAGGCAACAA 


TCAACATCCC 


AAATTGTTTA 


CCCCAGTCTC 


4 800 


CCAAATGGTT 


GACCTTGACC 


GTTTGATAAC 


CGATTTTTTG 


GAAAATATGT 


GACAAGCTAT 


4860 


CTCCGATAAC 


AGTTGAACGC 


AGGTGGCCAA 


TAGAAAATGG 


TTTAGCGATA 


TTCGGACTAG 


4920 


ACATGTCGAT 


AACAACATTT 


rCTTGTTTAC 


CAATATTTTG 


GTCAGCATAG 


TGTTCTTTTT 


4980 


CAGTGGTAAC 


AGCTTGCAAT 


ACTTGAGCAG 


AAATGGCAGA 


TTTATCAAGG 


AAAAAGTTAA 


5040 


CGTAAGGTCC 


TGTTGCGACA 


ACTTTTTCAA 


AGGCTTGGCT 


GTTCATTTTT 


TCAGCCAGTT 


5100 


CAGCCGCAAT 


CATTTGTGGT 


GCTTTACGTT 


CGACTTTTGC 


AAGAGAAAAA 


GCAGGGAAAG 


5160 


CAATGTCTCC 


CATTTCTGAG 


TTTTTAGGGG 


TTTCCAGTAA 


CTTTAAAATA 


GCCTCTTGGT 


5220 


CCAGGCTATC 


AATGATGCTA 


GATAATTCGC 


TAGCAATCAA 


TTCTTTTGTA 


TTCATTAAGA 


5280 


GCTCCTTTTT 


GGACTTTTCT 


ACTATTTTAT 


CACAATTTTA 


AAGAAAGAAG 


AAAAAATTTT 


5340 


TCAAATCTCC 


TGTTTTTTTG 


GTATAATATG 


GTTATAAATA 


TAGTTATAAA 


TATGCACGCA 


5400 


AGAGGATTTT 


ATGAGAAAAA 


GAGATCGTCA 


TCAGTTAATA 


AAAAAAATGA 


TTACTGAGGA 


5460 


GAAATTAAGT 


ACACAAAAAG 


AAATTCAAGA 


TCGGTTGGAG 


GCGCACAATG 


TTTGTGTGAC 


5520 


GCAGACAACC 


TTGTCTCGTG 


ATTTGCGG 








5548 


(2) INFORMATION FOR SEQ ID NO: 110: 









(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3132 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



TACCCGGTAC TCTTAGCAGA CACATCTAGC TCTGAAGATG CTTTAAACAT CTCTGATAAA 60 

GAAAAACTAG CAGAAAATAA AGAGAAACAT GAAAATATCC ATAGTGCTAT GGAAACTTCA 120 * 

CAGGATTTTA AAGAGAAGAA AACAGCAGTC ATTAACGAAA AAGAAGTTC? TAGTAAAAAT 180 

CCTGTGATAG ACAATAACAC TAGCAATGAA GAAGCAAAAA TCAAAGAAGA AAATTCCAAT 240 

AAATCCCAAG GAGATTATAC GGACTCATTT GTGAATAAAA ACACAGAAAA TCCCAAAAAA 300 

GAAGATAAAG TTGTCTATAT TGCTGAATTT AAAGATAAAG AATCTGGAGA AAAAGCAATC 360 

AAGGAACTAT CCAGTCTTAA GAATACAAAA GTTTTATATA CTTATGATAG AATTTTTAAC 420 

GGTAGTGCCA TAGAAACAAC TCCAGATAAC TTGGACAAAA TTAAACAAAT AGAAGGTATT 480 

TCATCGGTTG AAAGGGCACA AAAAGTCCAA CCCATGATGA ATCATGCCAG AAAGGAAATT 540 

GGAGTTGAGG AAGCTATTGA TTACCTAAAG TCTATCAATG CTCCGTTTGG GAAAAATTTT 600 

GATGGTAGAG GTATGGTCAT TTCAAATATC GATACTGGAA CAGATTATAG ACATAAGGCT 660 

ATGAGAATCG ATGATGATGC CAAAGCCTCA ATGAGATTTA AAAAAGAAGA CTTAAAAGGC 720 

ACTGATAAAA ATTATTGGTT GAGTGATAAA ATCCCTCATG CGTTCAATTA TTATAATGGT 780 

GGCAAAATCA CTGTAGAAAA ATATGATGAT GGAAGGGATT ATTTTGACCC ACATGGGATG 840 

CATATTGCAG GGATTCTTGC TGGAAATGAT ACTGAACAAG ACATCAAAAA CTTTAACGGC 900 

ATAGATGGAA TTGCACCTAA TGCACAAATT TTCTCTTACA AAATCTATTC TGACGCAGGA 960 

TCTGGGTTTG CGGGTGATGA AACAATGTTT CATGCTATTG AAGATTCTAT CAAACACAAC 1020 

GTTGATGTTG TTTCGGTATC ATCTGGTTTT ACAGGAACAG GTCTTGTAGG TGAGAAATAT 1080 

TGGCAAGCTA TTCGGGCATT AAGAAAAGCA GGCATTCCAA TGGTTGTCGC TACGGGTAAC 1140 

TATGCGACTT CTGCTTCAAG TTCTTCATGG GATTTAGTAG CAAATAATCA TCTGAAAATG 1200 

ACCGACACTG GAAATGTAAC ACGAACTGCA GCACATGAAG ATGCGATAGC GGTCGCTTCT 1260 

GCTAAAAATC AAACAGTTGA GTTTGATAAA GTTAACATAG GTGGAGAAAG TTTTAAATAC 1320 

AGAAATATAG GGGCCTTTTT CGATAAGAGT AAAATCACAA CAAATGAAGA TGGAACAAAA 1380 

GCTCCTAGTA AATTAAAATT TGTATATATA GGCAAGGGGC AAGACCAAGA TTTGATAGGT 1440 

TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT ATACAAAGGA TTTAAAAAAT 1500 

GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA TGGTTGTAAA TACTGTAAAT 1560 
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TACTACAATA 


CAGATAATTG 


GACAGAGCTT 


CCAGCTATGG 


GATATGAAGC 


GGATGAAGGT 


1620 


ACTAAAAGTC 


AAGTGTTTTC 


AATTTCAGGA 


GATGATGCTG 


TAAAGCTATG 


GAACATGATT 


1680 


AATCCTGATA 


AAAAAACTGA 


AGTCAAAAGA 


AATAATAAAG 


AAGATTTTAA 


AGATAAATTG 


1740 


GAGCAATACT 


ATCCAATTGA 


TATGGAAAGT 


TTTAATTCCA 


ACAAACCGAA 


TGTAGGTGAC 


1800 


GAAAAAGAGA 


TTGACTTTAA 


GTTTGCACCT 


GACACAGACA 


AAGAACTCTA 


TAAAGAAGAT 


1860 


ATCATCGTTC 


CAGCAGGATC 


TACATCTTGG 


GGGCCAAGAA 


TAGATTTACT 


TTTAAAACCC 


1920 


GATGTTTCAG 


CACCTGGTAA 


AAATATTAAA 


TCCACGCTTA 


ATGTTATTAA 


TGGCAAATCA 


1980 


ACTTATGGCT 


ATATGTCAGG 


AACTAGTATG 


GCGACTCCAA 


TCGTGGCAGC 


TTCTACTGTT 


2040 


TTGATTAGAC 


CGAAATTAAA 


GGAAATGCTT 


GAAAGACCTG 


TATTGAAAAA 


TCTTAAGGGA 


2100 


GATGACAAAA 


TAGATCTTAC 


AAGTCTTACA 


AAAATTGCCC 


TACAAAATAC 


TGCGCGACCT 


2160 


ATGATGGATG 


CAACTTCTTG 


GAAAGAAAAA 


AGTCAATACT 


TTGCATCACC 


TAGACAACAG 


2220 


GGAGCAGGCC 


TAATTAATGT 


GGCCAATGCT 


TTGAGAAATG 


AAGTTGTAGC 


AACTTTCAAA 


2280 


AACACTGATT 


CTAAAGGTTT 


GGTAAACTCA 


TATGGTTCCA 


TTTCTCTTAA 


AGAAATAAAA 


2340 


GGTGATAAAA 


AATACTTTAC 


AATCAAGCTT 


CACAATACAT 


CAAACAGACC 


TTTGACTTTT 


2400 


AAAGTTTCAG 


CATCAGCGAT 


AACTACAGAT 


TCTCTAACTG 


ACAGATTAAA 


ACTTGATGAA 


2460 


ACATATAAAG 


ATGAAAAATC 


TCCAGATGGT 


AAGCAAATTG 


TTCCAGAAAT 


TCACCCAGAA 


2520 


AAAGTCAAAG 


GAGCAAATAT 


CACATTTGAG 


CATGATACTT 


TCACTATAGG 


CGCAAATTCT 


2580 


AGCTTTGATT 


TGAATGCGGT 


TATAAATGTT 


GGAGAGGCCA 


AAAACAAAAA 


TAAATTTGTA 


2640 


GAATCATTTA 


TTCATTTTGA 


GTCAGTGGAA 


GCGATGGAAG 


CTCTAAACTC 


CAGCGGGAAG 


2700 


AAAATAAACT 


TCCAACCTTC 


TTTGTCGATG 


CCTCTAATGG 


GATTTGCTGG 


GAATTGGAAC 


2760 


CACGAACCAA 


TCCTTGATAA 


ATGGGCTTGG 


GAAGAAGGGT 


CAAGATCAAA 


AACACTGGGA 


2820 


GGTTATGATG 


ATGATGGTAA 


ACCGAAAATT 


CCAGGAACCT 


TAAATAAGGG 


AATTGGTGGA 


2880 


GAACATGGTA 


TAGATAAATT 


TAATCCAGCA 


GGAGTTATAC 


AAAATAGAAA 


AGATAAAAAT 


2940 


ACAACATCCC 


TGGATCAAAA 


TCCAGAATTA 


TTTCCTTTCA 


ATAACGAAGG 


GATCAACGCT 


3000 


CCATCATCAA 


GTGGTTCTAA 


GATTGCTAAC 


ATTTATCCTT 


TAGATTCAAA 


TGGAAATCCT 


3060 


CAAGATGCTC 


AACTTGAAAG 


AGGATTAACA 


CCTTCTCCAC 


TTGTATTAAG 


AAGTGCAGAA 


3120 


GAAGGATTGA 


TT 










3132 


(2) INFORMATION FOR SEQ ID NO: 111: 









( i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 14672 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CGAGATTTCT TTAAATGAAC TACGTGAAAT CTACCCATCA TCCAGATCTG GATATTCTCT 60 

CCTATCTATA AGTAAAGTTT TAGGAGATTT TAATATAAGT TCTCATGCTT TTAAAGCTTC 120 

GGTAAGAGAT TTAAAACCGC TCAGTTTCCC ACTCATTTGC TTCTGGGAGA GTTCTCATTT 180 

TATTATTCTT GAAAAAATTA GTAAAAACAA GTTTTATATT TTAGATCCTG CAAAAGGCAG . 240 

GCAGAGAATG TCAATAAGTG AATTTGAAAG GCATTATTCA AATATCATTT TAACATTTAA 300 

AAAGTTAGAT AGCTTTATGT CTCGTAAAGA TAATAAGAAG TCGCCTGTTT TAAAGTATTT 360 

TTTTAAGTAT AGGAATAAGC TAGGGATTTT ATTTTTTGTA ACAGCATTAT TGTATGTAAT 420 

ACAATCATTA GTACCTATAG CTAATAGATA CATAATTGAC ACGAATTTCA AGGACGATTC 480 

GTATTCGTCT AGAATGTTAT TTACTATATT ATTTATATTT ACTGTTTCAT TCTCACTAAT 540 

GTATTTATTA AGACAGATAT ATGTTGCATC CTTAAAATAT ATAATGGATA AAGAGATTAG 600 

CTATGATTTT ATGAAACATT TGATATATTT ACCTTACAGT TTTTATGAAA AACGTACTTT 660 

AGGGGATATA CTTTTTAGAG CTAACTCTAT TGTTTATATA AGAGAAATAC TATCAAATAA 720 

TTTTATAGCA CCTATACTTG ATTTGTTAAT GATTGTGGTT TATGCTGTGG TTTTATTTAG 780 

CTTTTCTAAG TACATGGTAA TCTTTTTAAT ATCACTAAGT CTAGCTCTAT CTATTGTAAT 840 

GTATCCAATC ATAAAAATCT CAAAAAATTT AATTGATAAA AATATAAAAG AAAAGGTTAA 900 

TGTTCAAAAT ATTACTTCCG AAGTAATTTC TAAAAATAGT GATATTAAGC TAACTGGAGA 960 

AGAGGAATTT TGGATTAACA AATGGGATAA TTTTAATACA AAACAGCTCA TCATAGGTCG 1020 

AAAACTTGAT ATACATTTAT CAATTGTTAG TAGTATAACG AATGTTTTAC AAATTATTCT 1080 

CCCTGTTTTG ACCCTTATTG TAGGTGTAAA TATAAAAACA TTCGAACAAT TGACGTTAviG 1140 

ACAAATTGTA GCAATAAGTA CAGTCTCACC ATACTTTATT TCTCCTATAA TTTCTTTAAG 1200 

TGATAACTAT ATACAATTAA TGTTATTAAA GGGATATTTT TTAAGAATAG AGGATGTGTT 1260 

TAATACTAAA TCCGAATTAA TTCCAGAAAG AGTCAGTCAA GATATAAAAT TTGATAAAAA 1320 

AATAGAATTA AAAGATATTT GGTATAAATA TGGATTATTT GATGATTATG TTTTGAAAGG 1380 

AATAAATGTT ACTATTAAAA AAGGAGAAAC TGTTCCTATT GTTGGAGAAT CAGGTTCAGG 1440 

TAAGAGTACA TTAGCTAAAA TTTTATTAGG TTTATTAGAA CCTAATATTG GTTCAATAGA 1500 

AGTTGATGGA GTAGAAAAAG AAGAAATTGG TCAAACATTG TATAGAAAGA TTTTTGGAGC 1560 
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AGTGTTACAA 


AATTCAACCC 


TAAGTTATGG 


TACCTTAAGA 


GAGAATTTGA 


CATTTGGACA 


1620 


CTTTGTTTCA 


GATGAAGAAT 


TAATGACAAA 


TCTAAATTCA 


ATTGGTCTTA 


GCAATGTAGT 


1680 


TAAATCTTTA 


CCTCTTGGAT 


TAGAGACAAT 


CATCGCTGAA 


GAAGGTAATA 


ACTTTTCTGG 


1740 


AGGGCAGCAG 


CAAATGATAC 


TTTTAGCTCG 


TTGTCTTTTG 


TCGAAACCTT 


CGGTAGTTGT 


1800 


TTTGGACGAA 


GCAACAAGTA 


GTTTAGATAA 


TTTATCTCAA 


CAAATTACAA 


CTTCTTACTT 


1860 


AAGTGAAATC 


GGTACCACTA 


AGATTTTAAT 


TGCCCATCGA 


CT AG AT ACTA 


TCAAGTCTGC 


1920 


AGATAAGATC 


TTAGTAATGC 


ATAATGGTGA 


AATTGTAGAG 


ATTGGGACCC 


ATAGAGAACT 


1980 


TCTTGAACTA 


GGAGGCATTT 


ATAAGCAATT 


GTATTCAAAT 


AATTAGTTTT 


TGATTAAAAG 


2040 


GGTAAATTTA 


TGAAGATTAT 


GAAAAAAAAA 


TATTGGACTT 


TAGCGATATT 


ATTCTTTTGT 


2100 


TTGTTCAATA 


ATTCTGTTAC 


TGCTCAAGAA 


ATACCTAAAA 


ATCTTGATGG 


CAATATAACT 


2160 


CACACTCAGA 


CTAGCGAAAG 


TTTTTCTGAA 


TCTGATGAAA 


AACAGGTTGA 


CTATTCTAAT 


2220 


AAAAATCAAG 


AAGAAGTAGA 


CCAAAATAAA 


TTTCGTATTC 


AAATCGATAA 


GACAGAATTA 


2280 


TTTGTAACAA 


CAGATAAACA 


TTTAGAAAAA 


AACTGTTGTA 


AATTGGAACT 


TGAACCACAA 


2340 


ATAAATAACG 


ATATTGTTAA 


CTCTGAAAGT 


AATAATTTAC 


TAGGCCAAGA 


TAATTTAGAT 


2400 


AATAAAATTA 


AGGAAAATGT 


TTCTCATCTA 


GATAATAGAG 


GAGGAAATAT 


AGAGCATGAC 


2460 


AAAGATAACT 


TAGAATCGTC 


GATTGTAAGA 


AAATATGAAT 


GGGATATAGA 


TAAAGTTACT 


2520 


GGTGGAGGCG 


AAAGTTATAA 


ATTATATTCT 


AAAAGTAATT 


CTAAAGTTTC 


AATTGCTATT 


2580 


TTAGATTCAG 


GAGTCGATTT 


ACAAAATACT 


GGATTACTGA 


AAAATCTTTC 


AAATCACTCA 


2640 


AAAAACTATG 


TCCCCAATAA 


AGGATATTTA 


GGAAAAGAGG 


AGGGAGAGGA 


AGGAATAATA 


2700 


TCAGATATTC 


AAGATAGATT 


AGGTCATGGT 


ACGGCTGTTG 


TAGCTCAAAT 


TGTAGGGGAT 


2760 


GACAATATTA 


ATGGAGTAAA 


TCCTCACGTT 


AATATTAACG 


TCTATAGAAT 


ATTTCGTAAG 


2820 


TCGTCAGCTA 


GTCCAGATTG 


GATTGTAAAA 


GCAATTTTTG 


ATGCTGTAGA 


TGATGGCAAT 


2880 


GATATTATCA 


ATCTTAGTAC 


TGGACAATAT 


TTAATGATTG 


ATGGAGAATA 


TGAGGACGGA 


2940 


ACAAATGATT 


TTGAAACATT 


TTTGAAGTAT 


AAAAAGGCTA 


TTGATTACGC 


GAATCAAAAA 


3000 


GGAGTAATTA 


TAGTAGCTGC 


ATTAGGGAAT 


GACTCCCTAA 


ATGTATCAAA 


TCAGTCAGAT 


3060 


TTATTGAAAC 


TTATTAGTTC 


ACGCAAAAAA 


GTAAGAAAAC 


CAGGATTAGT 


AGTTCATGTT 


3120 


CCAAGTTATT 


TCTCATCTAC 


AATTTCGGTC 


GGAGGCATAG 


ATCGCTTAGG 


TAATTTATCA 


3180 


GATTTTAGCA 


ATAAAGGGGA 


TTCTGATGCA 


ATATATGCGC 


CTGCAGGCTC 


AACATTATCT 


3240 


CTTTCAGAAT 


TAGGACTTAA 


TAACTTTATT 


AATGCAGAAA 


AATATAAAGA 


AGATTGGATT 


3300 
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TTTTCGGCAA CACTAGGAGG ATATACGTAT CTTTATGGAA ACTCATTTGC TGCTCCTAAA 3360 

GTTTCTGGTG CGATTGCAAT GATTATTGAT AAATACAAAT TAAAAGATCA GCCCTATAAT 3420 

TATATGTTTG TAAAAAAATT CTGGAAGAAA CATTACCAGT AAAAAATGGT ATAAAAGTGT 3480 

TAAATATACC AAACGTATTG AGATATGATT TGAATATGTT ACAATTAGAA TATAAAAATG 3540 

AACAAAGTTG GGATAGTTTC ATAGATAATG TTAATTTAAT TGAGTTGGAA GAGAGAATTC 3S00 

AAACTACTAT TGGAATTAAA CAAATAAACA CACACAATAT TATTACTATT GCCCGAGAAG 3660 

GGTACTCTCA AAATTATTTA CCTAACACTT CAGAAAATAC ATATAATTCA TTACAAGTCA 3720 

GTTTAGTTGG AGT ATT ACTA CTTTTTATAA GTATGGTAAA TATTTTATGG GCTAAAAAAA 3780 

GTAAATGAAA ATAAAATTTG GAGCCCTCTG AAAAAGTAAG TCCTACACTT CAACTAAAAT 3840 

GAGTCAAAAG ATGAATCACC TTGATGTAGG GGAGTTTGTC TTATTGCTGC CTGAACACCT 3900 

CCGTTCAGAG GAAGAACATT ATAAATCTGT TTTTGAAGAC GACTTAACCA GTCGCATATC 3960 

TAGTCAAGAT GAACGACAGC AAATGACTGC TACGGTAGGT TATTTAGAAT CAGGTCAGGA 4020 

TCGTTTTGTG TATAATACGA CCCCTATTTC TTACCAGCAG TTTTTGAAAG ATCCAATCAT 4080 

CATTGTTATA ACACCCCAAT CAACTGGTCC ACAGTCCATT- TTGTTTTGGA TAGACGCAGT 4140 

ACAGAACTAC GTTCTCTTTA ATCAATTGTC TGATGCCCAG GAGCTTATCC AGAGACAAGG 4200 

CATTGAAAAT TGGGTCTCAG AAATGCAAAC AGGTTACCAC AACTACATCA CATTATTGGA 4260 

TAATATCCAG AGGGAACGTT GGGTAATGCT AGCAGGAGCT GTGCTTGGGA TTGCAACTTC 4320 

AATCTTGTTG TTTAACACTA TGAATAGGCT CTACTTTGAA GAATTTACAC GTGCCATTTT 4380 

TATCAAACGC ATTGCAGGTC TCAGGTTCTT AGAAATCCAT CGCACTTATC TCTTTGCTCA 4440 

ACTGGGTGTG TTTTTACTGG GATTTGTTGC GAGTGTATTT CTTCAGGTAG AGATAGGAGT 4500 

TGCTTTCTTA GTCTTGTTAC TCTTTACTGG TCTATCTCTT TTACAGTTAC ATGTCCAAAT 4560 

GCAGAAAGAA AACAAGATGT CCATGCTTGT TTTGAAGGGA GGTTAATATG ATTGAACTTA 4620 

AACAGGTGAG TAAATCTTTT GGAGAACGAG AGTTATTTTC GAATCTTTCA ATGACATTTG 4680 

AGGCTGGAAA AGTCTATGCC TTAATTGGTT CAAGTGGTAG CGGAAAAACA ACCTTGATGA 4740 

ACATGATTGG GAAATTAGAA CCTTATGATG GGACGATTTT TTACCGAGGT AAAGACTTGG 4800 

CCAATTATAA ATCAAGTGAT TTTTTCCGTC ACGAATTGGG CTACCTCTTC CAGAACTTTG 4860 

GCTTAATTGA AAACCAAAGT ATTGAAGAAA ACCTTAAGCT AGGTCTCATT GGTCAAAAGT 4920 

TGAGTCGGTC GGAACAGCGG TTGAGGCAGA AGCAGGCTTT AGAACAGGTC GGCCTGGTTT 4980 

ATCTTGACCT AGATAAGCGC ATCTTTGAGT TATCGGGCGG AGAATCGCAA CGGGTTGCCT 5040 

TGGCAAAAAT TATCTTAAAG AATCCACCCT TTATTCTGGC AGATGAGCCA ACAGCTTCAA 5100 
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TAGACCCAGC 


AACCTCTCAG 


TTGATTATGG 


AGATTTTGCT 


ATCTCTTCGA 


GATGATAATA 


S160 


GGCTAATCAT 


TATCGCAACA 


CATAATCCGG 


CAATTTGGGA 


GATGGCTGAT 


GAAGTGTTCA 


5220 


CGATGGATCA 


TCTGAAATAA 


AAATCCTTGT 


TTTTAATTGC 


ACGATGAGTT 


ACTGAAATAT 


5280 


TATCATGAAT 


CAAGAATTGG 


AGTTAATTTA 


GAATTGTACT 


TAATTTAGAA 


TTGTACTTTA 


5340 


TTAATATTGA 


GGTAACTM-r 


TCTTGATAAA 


GGAAGAAATA 


ATGGAGAGGA 


AGTTAGAATG 


5400 


AAAAAATTCG 


ACAATTATAT 


TATTGAGAAG 


CCTTGCGATT 


CTAATTCAGA 


TAAACTGCAA 


5460 


AAAATCTTAA 


TAATTGAAAG 


TTTGGTAGAT 


GATATTTTGC 


AATTTTCTCT 


CAGAATCAAT 


5520 


AATAGTGTAG 


GAGAGATTTT 


CCTCCTACAA 


CCGTTTTAAA 


AGAAAACTAT 


CTTTATTCCA 


5580 


TGTTATTTTG 


AGGAAGATAT 


TGTGAAAGTC 


AAAGATGATG 


ATAAAGTTGA 


GTGGAATTTG 


5640 


TTAGAATTTC 


AAAAATTTAG 


AGCATTTTTG 


GCTTAGTAAT 


CTGTGTTGAA 


GGCTCAAAAC 


5700 


CTATGGTAAA 


AAAGTAGCTT 


TGAAAACGTA 


TTGCCTCCAA 


AGATTTAGTT 


AAATAATGAT 


5760 


TTAACACAAA 


AAGAAATTAT 


TGAAGTTCTG 


GAAAGATGTT 


GTTTCAGTAT 


TGAGAAAAGG 


5820 


TGGGAAAAAC 


TTGCGATTTT 


CACAGAGAAA 


GGAAGAAAAA 


GTATAGAAAT 


ATAGTCAATT 


5880 


GAAACAAGAA 


CAGGATAAAA 


GAACCTTTTG 


TGCCATATTT 


TTCTCCTTTC 


GCTTTACAAT 


5940 


TGGATTGAAC 


ACCTTTATTG 


TATCGCGTTT 


GGAGTTTTTT 


TGGTATAACC 


TTCGACGCAC 


6000 


ACCCGCATAG 


CGGGTGTTTT 


TTTTGTCTCG 


CACCTAACGG 


AGCGAGACAA 


ACTAATAGTC 


6060 


ACTTAATCAA 


AAAACGCACC 


ATATCAAAAA 


CTAAAAAGTT 


TGATATCATG 


CGTCATGTCT 


6120 


TAAACTAATT 


GACTATACTT 


TCTATTCAAA 


TGACCTTTTA 


ACCAATTGAT 


TGAGCCAATC 


6180 


CACTCTTAAA 


ACCAAAGAGC 


AATTTCTCGC 


TTAGCTGACT 


CTTCTGAATC 


TGAACCATGT 


6240 


ACAACATTTT 


GGATAATCTC 


ATTTTCTCCA 


GCAGCTTTTG 


CAAAATCACC 


TCGAATAGTG 


6300 


CCTGGTAAAG 


CTTCTTC'1'GG 


ACGAGTTGCA 


CCCATCATGG 


TCCGCCAAGT 


TTCGATTACT 


6360 


TTGGGACCAG 


AAATGACACC 


CACAAGAACT 


GGACCTGAAG 


TCATGAATTC 


ACGAATCGGT 


6420 


GGGTAAAAAC 


TCTGACCAAC 


CAAGTCCTGA 


TAGTGCTGCT 


CAATCAACTC 


TTCTGAAACC 


6480 


TGTGAACGAA 


ACTCCAATTT 


TTCGATTGTA 


AATCCACGTT 


GTTCGATGCG 


CTTTAACACT 


6540 


TCACCCACTA 


GCCCTCTTTT 


TACACCATCT 


GCTTTGATGA 


TAAAGAATGT 


TTGTTCCATA 


6600 


CCCGTCTCCT 


TTGTCAGCTT 


CTTTCTTTTA 


TTTTACCACA 


TTTCGTGGAA 


AAATGGAGAA 


6660 


AGTTTTCAGA 


AGAGAGAATG 


AGAGAACCCT 


CGGGTTCTCT 


CATTCTCTCT 


TATTCTACTG 


6720 


TTTCTTCCAC 


AGTTTCAACG 


GCAGTATCCA 


CAACTACTTC 


TGTTGTTTCT 


TCATTTCCTT 


6780 


CTTCCTCTAC 


TGGAGGATTA 


AGGTATTCTT 


CTTCGTTGAC 


AGCATGTGGT 


TCAAGGTTAC 


6840 
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GGTAACGCGC CATACCAGTA CCAGCTGGGA TGATCTTACC GATGATAACA TTTTCTTTAA 6900 

GTCCAAGGAG ATGGTCTTTC TTACCACGGA TAGCTGCCTC AGTAAGGACA CGAGTTGTTT 6960 

CCTGGAAGGA AGCCGCTGAC AAGAAACTGT TTGTTTCAAG TGAGGCTTTG GTAATTCCCA 7020 

TAAGGACTGG GCGACCTGTC GCTGGAACTC CACCTGCGAT AAGGACATCT TTGTTGGCAT 7080 

CTGTAAAGTC ATTGATATCC ATGAGGGTAC CCATGAGAAG ATCTGTATCA CCTGGATCCA 7140 

TGACACGGAC TTTACGGATC ATTTGACGAA CCATTACCTC GATGTGTTTG TCACCGATTT 7200 

CTACCCCTTG GCTACGGTAA ACTTTTTGTA CTTCACCGAG AAGGTACGTT TCAACTGACA 7260 

AGACATCACG AACTGCAAGG AGACGTTTTG GTTGGATAGA ACCTTCTGTC AGAGCAGCAC 7320 

CACGCGCTAC TTGGCCCCCA ACTTCGACAC GCATACGAGC TGTAAATGGA ACGACATATT 7380 

CACCTTCGCC AGTTTCACCC TTAACAAAGA CTTTCTTGGT ACGAGTTGAT GCATCTTCTT 7440 

CGATAGCAGT AACTTGTCCT TTAACCTCTG TAATAACCGC TTCCCCTTTA GCATTGCGGG 7500 

CTTCAAAGAT TTCTTGGACA CGAGGAAGAC CCTGAGTGAT ATCGGTATTT GAGGCAACCC 7560 

CACCTGTGTG GAAGGTACGC ATTGTAAGCT GTGTACCAGG TTCCCCGATA GATTGGGCAG 7620 

CGATTGTACC AACTGCTTCA CCAACTTCAA CCCCATCACC AGTCGCCAAG TTGATACCGT 7680 

AACAGTGACG GCAGACACCG TGACGAGTGT TACATGTAAA TACAGAACGG ATAGTCACTT 7740 

CTTCCACACC AGCATTGACA ATTTCACGCG CCTTGTCTTC TGTAATCAAT TCATTTGGAC 7800 

CAATAATCAC TGCACCAGTT TCTGGATGTT TAACAGTTTT CTTAGTGTAA CGACCGTTGA 7860 

GACGCTCTTC GAGAGACTCG ATCATCTCTT TTCCTTCTGC GATAGAACGG ATCAAGAGAC 7920 

CACGGTCAGT TCCACAGTCG TCCTCACGGA TGATAACGTC TTGGGCAACC TCGACCAAAC 7980 

GACGAGTCAA GTAACCTGAG TCGGCTGTCT TAAGGGCCGT ATCGGTCATA CCTTTACGAG 8040 

CACCGTGAGT TGAGAAGAAC ATTTCCAATA CCGACAAACC TTCGCGGAAG TTTGAAAGGA 8100 

TTGGCAATTC CATGATACGT CCATTCGGAG CAGCCATCAG ACCACGCATA CCGGCAAGCT 8160 

GTGAGAAGTT TGAGATGTTA CCACGGGCTC CAGAGTCCAT CATCATAACG ATTGGCTTCT 8220 

TAGGATCTTG GTTAGCAATC AAGCGTTTCT CAAGTTTTTC ACGGGCAGCA CCCCATTCAG 8280 

CTGTAACAGC ATTGTAACGC TCGTCGTCTG TGATCATACC ACGACGGAAT TGTTTGGTGA 8340 

TTTGTTCGAC ACGTTTGTGT GATTCTTCAA TGATTTCAGC CTTGTCATCA ACGACTGGGA 8400 

TATCGGCAAT ACCCACTGTC AATCCTGCAA GAGTTGAGTG GTGGTAACCG AGGTTCTTCA 8460 

TGCGGTCAAG TAGGGCAGAA GTTTCTGTCG TACGGAAACG TTTGAAGATT TCAGCGATGA 8520 

TATTTCCAAG GTTTTTCTTC TTGAATGGAG GGTTGAGCTC AAGATTGCTG ATAGCTTCCT 8580 

TGATATCTCC ACCAAGTGGC AAGAAGTATT TAGCTGGAAC ACCTTCTGTC AAGTTGGCAT 8640 
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TGTTTGGTTC TTGCAACTAT GGTAGCCCCT CTGGCATGAT ATCGTTGAAG AGAATTTTAC 8*700 

CAACTGTTGT AAGCAAGACC TTATGTCTTT GCTCTTCTGT CCAAGGCTTG TTGAGGCTGT 8760 

CTGTTGCGAT ACCAACACGT GAGTGGAGGT GAACATAACC ATTGCGGTAA GCCATAACCG 8820 

CTTCGTCACG GTCTTTGAAG ACCATTCCTT CACCTTCGCG ACCAGCTTCT TCCATGGTCA 8880 

AGTAGTAGTT ACCCAAAACC ATGTCCTGAG ATGGAGTAAC TACCGGTTTC CCATCTTTCG 8940 

GGTTCAAGAT GTGCTCAGCA GCTAGCATGA GGATACGAGC TTCTGCTTGT GCTTCTTCTG 9000 

AAAGTGGTAC GTGGATGGCC ATTTGGTCCC CGTCAAAGTC AGCATTGTAG GCTTCACAGA 9060 

CAAGTGGGTG CAAGCGAAGA GCCTTACCAT CAATCAACAC TGGCTCGAAG GCTTGGATAC 9120 

CCAAACGGTG AAGGGTCGGT GCGCGGTTCA AAAGCACTGG GTGTTCTTTA ATCACTTCTT 9180 

CAAGGATATC CCAGATACGC TCATCTCCGC GTTCCACCAA GCGTTTAGCT CCTTTGACGT 9240 

TTTGCACGAT ATCACGGCCA ACGATTTCAC GCATGACAAA TGGTTTAAAG AGTTCAATCG 9300 

CCATTTCACG CGGCACACCA CATTGGTACA TCTTAAGAGT TGGACCAACG GCGATAACTG 93 60- 

AACGTCCTGA GAAGTCAACA CGTTTACCGA GCAAGTTTTG ACGGAAGCGT CCTTGTTTAC 9420 

CTTTAAGCAT GTGGCTCAAT GATTTCAATG GACCGCTACC TGGTCCTGTG ATTGGACGAC 9480 

CACGACGACC ATTGTCAATC AAAGCGTCAA CTGCTTCTTG AAGCATACGC TTCTCATTTT 9540 

GAACGATGAT ACCTGGTGCA TTTAACTCAA GCAAACGAGC CAAACGGTTG TTACGGTTGA 9600 

TAACACGGCG GTAAAGGTCA TTCAAGTCAG ATGAGGCAAA ACGGCCACCA TCCAACTGCA 9660 

ACATTGGACG AAGATCTGGT GGGATAACCG GAAGGATCTT AAGAATCATC CATTCAGGTT 9720 

TGTTTCCAGA CTTCTAAAAG GCATCCAAAA CATCCAAACG ACGGATGGCT TTGACACGCT 9780 

TTTGTCCAGT AGCTGTTTTC AATTCTTCTT TCAGTTCAGC AATTTCTTTT TCAAGATCTA 9840 

CTTGCTTCAA AAGGTCTTGG ATGGCTTCCG CACCCATCTT GGCAACAAAT GAACCATAAC 9900 

CATATTCACG CAAGCGCTCT CGGTATTCGC GCTCTGTCAT GATAGACTTG TGCTCAAGTG 9960 

GTGTATCCTT AGGATCAATC ACCACATAAC CCGCAAAGTA GATAACTTCC TCCAGGGCAC 10020 

GAGGGCTCAT ATCAAGGGTC AAGCCCATAC GGCTTGGAAT CCCCTTGAAG TACCAGATGT 10080 

GAGATACAGG AGCTTTCAAT TCGATATGTC CCATACGCTC ACGACGAACT TTCGTACGCG 10140 

TTACTTCAAC CCCACAGCGG TCACAAACAA TTCCTCTGTA ACGAATGCGT TTGTACTTAC 10200 

CACAAGCACA TTCCCAGTCT TTTGTAGGAC CAAAGATCAC TTCATCAAAG AGTCCTTCAC 10260 

GTTCTGGTTT CAAGGTACGA TAATTGATTG TTTCAGGTTT TTTGACTTCT CCATAAGACC 10320 

ATGAACGGAC TTTACTTGGA GAAGCTAGGG TGATTTGCAT ACTTTTAAAA CGATTTACAT 10380 
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CAACCACTAT TTCTTCCCTT TCTATTCTAA GTGAACTGCT TATTCTTGTT CAGCAGCTTC 10440 
TTCTGTTGCT TCCGCTTTTG TTGCTTTCTC AGCTTCTTCA GCTTCAAAGG CTGCTTTAGC 10500 
CTCTTGGGCT GCTTTTTCGC GGGCTTTTTC AAGGTCATCT ACGTGGATGA CATCTTCGTC 10560 
CATTCCTTCA TCCAAGTCGC GAAGTTCCAC TTCTTGGTCA TCTTCGTCTA GGACACGCAT 10620 
GTCAAGACCA AGAGATTGCA ATTCTTTGAC AAGAACTCGG AAGGATTCTG GAACACCTGG 10680 
TTTTGGAATT GGTTTGCCTT TTGTAATAGC TTCATAGGCT TTCAAACGTC CGTTGATATC 10740 
GTCCGACTTG TAAGTCAAGA TTTCTTGAAG GACATTTGAC GCACCGTAGG CTTCAAGAGC 10800 
CCAAACCTCC ATCTCACCGA AACGTTGTCC ACCAAACTGA GCCTTACCTC CGAGTGGTTG 10860 

TTGGCTAACA GTTGAGTATG GTCCGACTGA ACGCGCGTGC AATTTATCAT CAACCATGTG 10920 

GTGGAGTTTG ATCATGTACA TGACTCCGAC AGAAACACGG TTATCAAACG GTTCACCAGT 10980 

ACGTCCATCG TAAAGGATCG TTTTGGCATC GCTATCCATA CCTGCTTCTT TAACAGTTGA 11040 

CCAAAGATCT TCAGAACTTG CTCCATCAAA GACTGGTGTA GCGATGTGAA TACCAAGAGT 11100 

ACGAGCTGCC ATACCAAGGT GAAGCTCCAT AACCTGACCG ATATTCATAC GTGATGGTAC 11160 

CCCAAGTGGG TTCAACATGA TGTCGACTGG AGTTCCGTCT GGAAGGTAAG GCATGTCTTC 11220 

TACAGGAACG ATACGAGAGA CAACCCCTTT GTTTCCGTGA CGTCCGGCCA TTTTATCTCC 11280 

GACCTTAATC TTACGTTTTT GAGCGATGTA AACACGAACC AACATGTTAA CACCTGATTG 11340 

CAACTCATCT CCATTTACAC GTGTAAACAT CTTAACATCA CGAACGACAC CATCGGCACC 11400 

GTGTGGTACA CGAAGAGAAG TATCACGCAC TTCACGAGAC TTGTCTCCAA AGATAGCGTG 11460 

CAAGAGACGT TCTTCAGCTG AAAGATCTTT CTCACCCTTA GGTGTTACTT TACCTACAAG 11520 

AATATCACCT TCTTTAACCT CAGCACCAAT ACGGATAATC CCCATTTCGT CAAGGTCTTT 11580 

GAGGGCATCT TCACCAACGT TTGGAATTTC GCGAGTGATT TCTTCAGGCC CAAGCTTTGT 11640 

ATCGCGCGTT TCTGATTCGT ATTCTTCAAG GTGAACAGAT GTGTAGACAT CGTCCTTCAC 11700 

CAAGCGTTCG CTCATGATAA CGGCATCCTC GAAGTTGTAA CCTTCCCAAG TCATGTAGGC 11760 

AACGATTGGG TTTTGTCCAA GCGCCATTTC TCCATTTTCC ATAGAAGGTC CGTCAGCGAT 11820 

GAAATCGCCT TTTTCAACGA CATCACCAAC TTTTACGAGA GTGCGTTGGT TGTAAGCACT 11880 

ACCTGAGTTT GAACGACGGA ATTTTTGGAT GTGGTAAACA TCCAATGAAC CATCTTCACG 11940 

ACGAACTTCT ACCTTGTCAG CATCTGCGTA AGTAACTTTA CCATCATACT GAGCAATCAC 12000 

AGCCGCACCA GAATCGTGGG CTGCTTGGTA TTCCATACCA GTACCAACGT AAGGTGCCTG 12060 

AGGATTAATC AATGGCACAG CCTGACGTTG CATATTGGCT CCCATGAGGG CACGGTTGGA 12120 

GTCATCGTTT TCCAAGAAAG GAATACATGC TCTCGCAACG GCAACTACCT GTTTTGGTGA 12180 
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AACGTCCATG 


TAGTCAACAA 


> TATTAGCTGG 


ATACTCTTGC 


TTGACCCCTT 


' GGTGACGTCC 


12240 


CATGACAATC 


TTCTCAGCAA 


AGGTTCCATC 


TTCATTCAGA 


CGAGAGTTAG 


CCTGAGCTAC 


12300 


AGTATATTCA 


TCTTCTTCAT 


CAGCTGTCAA 


CCAAACAATT 


TCCTTCGTGA 


CAACACCTGT 


12360 


TTCACGGTCA 


ACCTTACGGT 


ATGGTGTTTG 


AACAAAACCA 


TATTTGTTCA 


AGTGTCCATA 


12420 


AGATGACAAG 


TTATTGATCA 


AACCGATGTT 


AGGTCCTTCA 


GGTGTCTCGA 


TTGGACACAT 


12480 


ACGACCATAG 


TGAGTGTAGT 


GCACGTCACG 


TACTTCATAT 


CCAGCACGGT 


CACGAGTCAA 


12540 


ACCACCAGGT 


CCTAAGGCTG 


ACAAACGGCG 


TTTGTGAGAC 


AACTCAGAAA 


GCGGGTTGTG 


12600 


TTGGTCCATG 


AACTGTGACA 


ACTGTGATGA 


ACCAAAGAAT 


TCTTTAACTG 


CAGCTGTTAC 


12660 


AGGACGGATA 


TTGATAATTT 


GTTGTGGTGT 


CAAGACTTCA 


TTGTCCTGAA 


CAGACATACG 


12720 


TTCACGGACA 


TTACGTTCCA 


TACGAGAAAG 


TCCCAAACGT 


ACTTGGTTGG 


CAAGCAATTC 


12780 


ACCAACCGCA 


CGGATACGAC 


GATTTCCAAG 


GTGGTCGATA 


TCATCTACAC 


GGCCAAGTCC 


12840 


TTCAGCCAAG 


TTGAGGAAGT 


AGCTCATCTC 


AGCAAGGATA 


TCTGCAGGAG 


TCACCGTACG 


12900 


AACCTTGTCA 


TCTGGGTTAG 


CATTACCAAT 


GATCGTTACG 


ACGCGATCTG 


GATCAGTTGG 


12960 


AGCAATAACC 


TTGAATTTTT 


GAAGAACAAC 


AGGCTCAGTC 


ACAACGGCTC 


CATCGTTTGG 


13020 


GATGTAGACA 


ATCTTGTTCA 


AGTCGCCATC 


CAAATGGCTT 


TCAATGCTTT 


CAATCACGCT 


13080 


ACGAGTCATA 


ATCGTACCAG 


CTTCTACCAA 


GATTTCTCCA 


GTTTCAGGGT 


CTACCAATGG 


13140 


CTCTGCAATG 


GTTTGGTTGA 


GCAAACGTGT 


TTTAACATTG 


AGTTTTTTAT 


TGATTTTGTA 


13200 


ACGACCAACT 


GCTGCCAAGT 


CATAACGACG 


TGGGTCAAAG 


AAGCGAGCTA 


CAACCAAGCT 


13260 


ACGTGAGCTT 


TCAGCCGTCT 


TAGGCTCACC 


TGGACGAAGG 


CGTTCGTAAA 


TTTCTTTCAA 


13320 


GGCTTCGTCT 


GTACGAGAGT 


CCATTGGATT 


CTTGTGGATA 


TCTTTTTCAA 


CAGTGTTGCG 


13380 


AACCAATTCG 


CTGTCACCAA 


AGATATCAAA 


GATTTCATCA 


TCACCTGAGA 


AACCAAGAGC 


13440 


ACGAACCAAG 


GTTGTAAATG 


GAATCTTACG 


AGTACGGTCG 


ATACGAGTGT 


AGGTGATATC 


13500 


TTTTGAGTCG 


CTTTCAAGTT 


CCAACCAAGC 


TCCACGGTTA 


GGGATAACAG 


TTCAACCATA 


13560 


GCCCACCTTA 


CCATTTTTGT 


CTACTTTGTC 


GTTAAAGTAA 


ACACCTGGTG 


AGCGGACCAA 


13620 


CTGAGAAACG 


ATAATACGTT 


CACCACCATT 


GATGATGAAA 


GTACCCATTT 


CTGTCATGAT 


13680 


TGGGAAATCA 


CCAAAGAAAA 


CTTCTTGGGT 


CTTGATTTCG 


CTTGTTTCTT 


TATTGATCAA 


13740 


ACGGAAGGTT 


ACAAAAATTG 


GTGCTGAGTA GCTAGCATCG 


TGGATACGAG 


CTTCTTCTAG 


13800 


CGTATATTTT 


GGTTCCTTGA 


TTTCATATCC 


AACAAATTCC 


AACTCCATTG 


TGTCTGTGAA 


13860 


GTTTGAAATT < 


GGCAATACAT 


CTTCAAACAC 


TTCCTTAAGA 


CCGTGGTCTA ' 


GGAAAGCTTT 


13920 
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GAATGAGTCA GTTTGAATTT CAATCAAATT TGGTAAGTCA AGAACTTCTT TGATTCTTGA 13980 

AAAACTACGA CGGGTACGAT GTTTCCCGTA TTGAACGTCA TGTCCTCCCA AGATGATTCT 14040 

CCTTTGTAAA TAAGTTCCAA GCCTTGTCAA TCAGGCTTTT CTAATCGTCA TATGGTTGTA 14100 

AACCCCTTAT CACCGTGTCC TCTTGACGAA TTTTCAGAAT CTTTAAGCCT CTGTTACAAA 14160 

TGCTCAAAAT CTTGAAAAAA AGCACAAAAA GAGCAGCTAA ATCTGACTTT TTCAGAAGAT 14220 

TTAACTGCTG TGAGCCTTGT CTGGACAATA TTTCAGACAA AACCTACGAC AAATGATTAC 14280 

CCATATTATA CCCTATTTAG CTAGATTTTT CAAGGGGTTT CAGTAGGTTT TTGGTAAATT 14340 

TTTTCCCATA GAAAACTTGG CATCACATTC GAATCACGCT ATGGTACAAA AAACTGAAAA 14400 

AACTATTGAC T G AAAAT CAT TTTCAAGGTA TAATAATAAA CGTTAAGGCG GTATAGCCAA 14460 

GTGGTAAGGC ACGGCTCTGC AAAAGCTTGA TCGTCGGTTC AAATCCGTCT ACCGCCTTCT 14520 

ATAACTTGAT TTATCAGGTT TCAAATGAAC AGAAAGCCCA ATTTGAAGGG CTTTTTTTAT 14580 

TTTCCCTCGA ATAAATACGT ATAACTTTAA AAACTTTTGG AGCGAGTTTG TGGCAGAGTT 14 640 

CTTTCCATGG CATAATTCCC TTTTGAAATC AG 14672 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7902 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

AGGAGACTAT TCAAGCCCAA ATTgAGTAGC CCAGCAAAGA CTGTATAGAC TGTGATACGT 60 

TTTTCATAGC CATTGGTAAA GAGAATTTGG CAACCAAGAA TCGTATCTAA GGCCAGGATA 120 

ATCGTACGAA AAGCGAAGAG AGAGGTCAAG ATGCCGCCTC CGATATATTT TTCACTACCG 180 

TAAAGTAGGA TGGCATTTGG TCCTAAAACC ATGAGTCCAA AACTCAGTGG AATGATAAAG 240 

AAGTTAAAGA TTCGACTACC TCTATTAACC AGAGAAACAT AGGCTTCTTT GTCTCCTTTC 300 

CCCAGATAGT AACTGAGACG AGGCACACTC ACTCCAATTG CACCTGTTAC AACCCCAGCT 360 

ATAACGGTCA CAATTCGCTG AGCTATGGTA TAGTAACTAA CGTTGACATC AATCCCTGTT 420 

TTAACGAGGA AGAGGCGATC TAAAAAAGTG AAGAGCATAT TGGCATTGGC AAAGACTAAC 480 

ATGGCTGTCA GAGGGAGAAA GAGTGGTTTA AAATCACTTA GGTGAATTTT AACAAGTTTG 540 

ATGTCTCTTT TAATCCAAAA ATAACTAATC AGGTAGTTAA TCAGCGTCGA TAAACTCATC 600 

ACAAGTGTAT AGACAACAAT ATCGTGTTCA TTTTTAACAA ATAAGAAAAT AGAGACCAGC 660 
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ATCAGGATAC GGATGAAGGC AGTTTTGTAA AAGAGAAAAC TGTAATTTTC CAGAGCTTCA 720 

TTGACCCATT CGATTGAAAA AATCTGGGCA ATGAGTTGAA TCCCCATAAC AAGGTAGACC 780 

TTTTTGACGA TTGGATTATC AGTAAAGAAG AGAGGATAGG CTAGGATATA GACAGCAGTG 840 

GTCAAAATCG TACAAGCGAT GCACAAATAA AAAAGACTAG AAAAGGTTCT GTTAAGATCT 900 

TTTTTGTTAT CCTTGACATT ACTGATAGCC CTTAAACCGT AGTTATAGAC ACCATAAGTT 960 

GCAAAGGGCA AGAAAAATGA CAAAATAGTG TCGACTGAGT TGAAGTAACC ATAGTCAGTT 1020 

CGGTCCAAGA CACGCGCGAC ATAGGTTCCA GTTAGGATGG GAAAAATAAT ATTCAAGACA 1080 

CGAATTCCCA TGTAAGATAG AGCATTTAAT TTTATACTTT TCATTCAATT TACCTCGTTT 1140 

TTCATTATAT catAAACTTA GCTAATAAGA AATGAAGGGC AGTAAGTCAA GTAATCACTT 1200 

TGAAGTTTCA AATCTTAAGT TTTAAGTTTT CTTTAAGGAA AGTATATTAT TCTGAAGGAC 1260 

TCTAAAATTT CGCAGCCATT TATTAGTAAT TGCTACAGAA TTCCTAGTCA TT ACT AG AAA 1320 

TGGACTAGTT TCTTTGAATA ATAGAACTGC ATAATTCTCC TATTCTAGAA GGGGAGGACC 1380 

AGTATTTCTT TTATGATAGG ACTAGATTGT GGTATAATAG AGAGAATAAG TTTTTTTAGT 1440 

AAGACAAAGG AGAAAATAGA TGATTTATGC AGGAATTCTT GCCGCTGGAA CTGGCACACG 1500 

CATGGGGATC AGTAACTTCC CAAAACAATT TTTAGAGCTA GGTGATCGAC CTATTTTGAT 1560 

TCATACAATT GAAAAATTTG TCTTGGAGCC AAGTATTGAA AAAATTGTAG TTGGTGTTCA 1620 

TGGAGACTGG GTTTCTCATG CAGAAGATCT TGTAGATAAA TATCTTCCTC TTTATAAGGA 1680 

ACGTATCATC ATTACAAAGG GTGGTGCTGA CCCCAATACA AGTATTAAGA ACATCATTGA 1740 

AGCCATTGAT GCTTATCGTC CGCTTACTCC AGAGGATATC GTTGTTACCC ACGATTCTGT 1800 

TCGTCCATTT ATTACACTTC GCATGATTCA GGACAATATC CAACTTGCCC AAAATCATGA 1860 

CGCAGTGGAC ACAGTGGTAG AAGCGGTTGA TACTATCGTT GAAAGTACCA ATGGTCAATT 1920 

TATTACAGAT ATTCCAAATC GTGCTCACCT TTATCAAGGA CAAACACCTC AAACATTCCG 1980 

TTGCAAGGAC TTCATGGACC TTTATGGATC TCTTTCTGAT GAAGAGAAGG AAATCTTGAC 2040 

AGATGCATGT AAAATCTTTG TGATCAAAGG AAAAGATGTG GCTTTGGCCA AAGGTGAATA 2100 

CTCAAATCTG AAGATTACAA CCGTAACAGA TTTGAAGATT GCAAAAAGTA TGATTGAGAA 2160 

AGACTAGTAA AATGATTAAT CAAATTTATC AACTAACTAA GCCTAAGTTT ATCAATGTCA 2220 

AATATCAGGA AGAGGCTATT GACCAAGAGA ATCATATCCT TATCCGTCCC AACTACATGG 2280 

CTGTCTGTCA TGCGGATCAG CGTTACTATC AGGGAAAACG TGATCCCAAG ATTTTGAATA 2340 

AAAAGCTTCC AATGGCAATG ATTCACGAGT CATGTGGAAC CGTCATTTCT GACCCGACCG 2400 
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GAACCTACGA GGTTGGTCAA AAAGTTGTCA TGATTCCCAA TCAGTCTCCT ATGCAGAGTG 2460 

ATGAAGAATT CTATGAAAAC TACATGACAG GGACCCATTT CTTGTCTAGT GGATTTGATG 2520 

GCTTTATGAG AGAGTTTGTT TCTCTCCCTA AAGATCGTGT GGTGGCTTAT GATGCTATTG 2580 

AAGATACGGT TGCAGCCATT ACAGAGTTTG TCAGTGTGGG CATGCACGCT ATGAATCGTC 2640 

TATTGACTCT TGCTCATAGC AAGCGGGAGC GGATCCCCCT TATTGGAGAT GGAAGTTTAG 2700 

CTTTTGTGGT TGCCAATATT ATCAACTATA CTTTGCCAGA AGCAGAGATT GTGGTTATTG 2760 

GTCCTCATTG GGAAAAGTTG GAACTCTTCT CATTTGCCAA AGAATGCTAT ATTACGGATA 2820 

ATATTCCTGA AGATTTGGCC TTTGACCATG CTTTTGAATG TTGTGGTGGT GATGGTACTG 2880 

GACCAGCTAT TAATGACTTG ATTCGCTACA TTCGTCCTCA GGGAACGATT CTCATGATGG 2940 

GAGTTAGCGA ATATAAAGTC AATCTCAATA CTCCCGATCC CTTAGAAAAG GGCTTGATTT 3000 

TGGTTGGGTC ATCTCGTTCT GGTCGCATTG ATTTTGAAAA TGCTATCCAA ATGATGGAAG 3060 

TCAAGAAATT TCCCAATCGT CTTAAAAATA TCCTTTATCT AGAAGAACCT GTAAGAGAAA 3120 

TTAAAGATAT TCATCGTGTC TTTGCAACCG ATTTAAACAC AGCCTTTAAA ACAGTGTTTA 3180 

AGTGGGAAGT ATAAGTACTG GAGGTTAATT GTGGAGAAAA TCATTAAAGA AAAAATTTCT 3240 

TCCTTACTTA GTCAAGAAGA GGAAGTCCTC AGTGTTGAAC AACTGGGTGG AATGACCAAT 3300 

CAAAACTATT TGGCCAAAAC AACAAATAAG CAATACATTG TTAAATTCTT TGGTAAAGGG 3360 

ACAGAAAAGC TTATCAATCG ACAAGATGAA AAGTACAATC TTGAACTACT AAAGGATTTA 3420 

GGCTTAGATG TAAAAAATTA TCTTTTTGAT ATTGAAGCTG GTATCAAAGT AAATGAGTAT 3480 

ATCGAATCTG CGATTACGCT TGATTCAACG TCAATCAAGA CCAAGTTCGA CAAAATTACT 3540 

CCAATATTAC AAACTATTCA TACGTCTGCT AAGGAATTAA GAGGAGAATT TGCTCCTTTT 3600 

GAAGAAATCA AAAAATACGA ATCCTTGATT GAAGAACAAA TTCCTTATGC CAACTATGAA 3660 

TCTGTTAGAA ATGCAGTCTT CTCCTTAGAG AAAAGACTGG CTGACTTAGG TGTTGACAGA 3720 

AAATCTTGTC ATATCGATTT GGTGCCTGAA AACTTTATCG AATCACCTCA AGGACGACTT 3780 

TATTTCATTG ACTGGGAATA TTCATCAATG AATGATCCAA TGTGGGATTT GGCTGCCCTC 3B40 

TTTTTAGAGT CTGAATTCAC TTCCCAAGAG GAAGAAACTT TCTTATCTCA CTATGAGAGT 3900 

GACCAAACAC CGGTTTCTCA TGAAAAGATT GCTATTTATA AAATTTTACA AGATACTATT 3960 

TGGAGTCTAT GGACTCTCTA TAAGGAAGAG CAAGGTGAAG ATTTTGGTGA CTATGGTGTG 4020 

AATCGTTACC AAAGAGCTAT TAAAGGTTTG GCTTCTTATG GAGGTTCAGA TGAAAAGTAA 4080 

AAACGGAGTT CCTTTTGGCC TTCTCTCAGG TATTTTCTGG GGCTTGGGTC TAACGGTTAG 4140 

TGCTTATATC TTTTCGATTT TTACAGATTT GTCACCCTTT GTGGTGGCTG CAACTCATGA 4200 
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TTTTTTGACC ATCTTTATCT TACTAGCTTT TCTCTTGGTA AAAGAAGGGA AAGTTCGCCT 4260 

CTCAATTTTC TTAAATATTC GCAATGTCAG TGTTATCATC GGAGCCTTGC TAGCAGGCCC 4320 

TATCGGTATG CAGGCCAATC TTTATGCAGT TAAGTATATC GGAAGTTCTT TAGCTTCATC 4380 

TGTATCGGCT ATTTACCCTG CGATTTCAGT TCTATTGGCT TTCTTCTTTT TGAAGCACAA 4440 

GATTTCGAAA AATACTGTAT TTGGGATTGT CTTGATTATT GGAGGGATTA TTGCTCAGAC 4500 

CTATAAGGTT GAACAGGTTA ATTCTTTCTA CATTGGGATT CTTTGTGCTT TGGTTTGTGC 4 560 

TATTGCATGG GGAAGTGAGA GTGTTCTTAG CTCTTTTGCC ATGGAAAGTG AATTGAGTGA 4620 

AATCGAAGCC CTCTTAATCC GTCAAGTAAC TTCGTTCTTG TCCTATCTTG TGATTGTGCT 4680 

CTTCTCTCAT CAGTCATTTA CTGCAGTAGC CAATGGACAA TTGCTAGGTC TCATGATTGT 4740 

TTTTGCAGCC TTTGATATGA TTTCCTACTT GGCTTATTAT ATCGCTATCA ATCGCTTGCA 4800 

ACCAGCCAAG GCTACAGGCT TGAACGTGAG CTATGTAGTA TGGACGGTCT TGTTTGCAGT 4860 

TGTTTTCTTG GGTGCACCGC TAGATATGCT GACCATTATG ACGTCACTTG TCGTCATTGC 4920 

TGGAGTTTAT ATTATTATTA AAGAATAAAG GAGATTCGTG TGAAAGCCAT TATCTTAGCA 4980 

GCGGGATTGG GAACTCGCTT GCGTCCTATG ACTGAAAATA CCCCTAAAGC CTTGGTTCAG 5040 

GTTAATCAAA AACCTTTGAT TGAGTACCAA ATTGAGTTTC TCAAAGAAAA AGGAATCAAT 5100 

G AC AT CATC A TCATTGTTGG TTATCTTAAA GAACAATTCG ATTACTTGAA AGAGAAATAC 5160 

GGTGTTCGTC TCGTTTTCAA TGATAAATAC GCTGACTACA ATAACTTTTA CTCTCTCTAT 5220 

CTTGTAAAAG AAGAATTGGC CAACAGCTAT GTTATTGATG CTGACAATTA TCTCTTTAAA 5280 

AATATGTTCC GCAATGATTT GACACGTTCG ACTTATTTTA GTGTTTATCG TGAAGATTGT 5340 

ACCAACGAAT GGTTCTTGGT TTATGGAGAT CACTACAAGG TTCAAGACAT TATTGTTGAT 5400 

AGCAAGGCAG GTCGCATCCT TAGTGGTGTA TCCTTCTGGG ATGCTCCAAC TGCAGAAAAG 5460 

ATTGTCAGCT TTATCGACAA GGCTTATGTA AGTGGTGAAT TTGTTGATCT CTATTGGGAC 5520 

AATATGGTTA AGGATAATAT CAAAGAGCTA GATGTCTATG TTGAAGAATT AGAAGGCAAT 5580 

AGCATTTATG AGATCGATAG TGTCCAAGAC TATCGTAAAT TAGAAGAAAT TCTTAAAAAC 5640 

GAAAATTAAA GATTCCAACA TCTGACAAAA TAGTCGGATG TTTTTTGATT TTTTACGAAC 5700 

TTTTACGAAT AGATAGATGA GTAGAAAAAG AAATGGAGTT ATTTATGAAA ATCACAAACT 5760 

ATGAAATCTA TAAGTTAAAA AAATCAGGTT TGACCAATCA ACAGATTTTG AAAGTGCTAG 5820 

AATACGGTGA AAATGTTGAT CAGGAGCTTT TGTTGGGTGA TATTGCAGAT ATCTCAGGTT 5880 

GCCGTAATCC AGCCGTTTTT ATGGAACGTT ATTTTCAGAT AGACGATCCG CATTTGTCGA 5940 
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AAGAGTTTCA AAAATTTCCA TCTTTCTCTA TTTTAGATCA CTGTTATCCT TGGGATTTGA 6000 

GTGAAATATA TGATGCGCCT GTACTTTTAT TTTACAAGGG AAATCTTGAC CTCCTGAAAT 6060 

TCCCGAAGGT AGCGGTCGTG GGCAGTCGTG CTTGTAGCAA ACAGGGAGCT AAGTCAGTTG 6120 

AAAAAGTCAT TCAAGGCTTG GAAAATGAAC TGGTTATTGT CAGTGGTCTG GCCAAGGGCA 6180 

TTGACACAGC AGCTCATATG GCAGCTCTTC AGAATGGCGG AAAAACCATT GCAGTGATTG 6240 

GAACAGGACT GGATGTGTTT TATCCTAAAG CCAATAAACG CTTGCAAGAC TACATCGGCA 6300 

ATGACCATCT GGTTCTAAGT GAATATGGAC CTGGTGAACA ACCTCTGAAA TTTCATTTTC 6360 

CTGCCCGTAA TCGCATCATT GCTGGACTTT GTCGTGGTGT GATTGTAGCA GAGGCTAAGA 6420 

TGCGTTCAGG TAGTCTCATT ACGTGTGAGC GAGCAATGGA AGAAGGACGC GATGTCTTTG 6480 

CTATTCCTGG TAGCATTTTA GATGGACTAT CAGACGGTTG CCATCATTTG ATTCAAGAAG 6540 

GAGCAAAATT GGTCACCAGT GGGCAAGATG TTCTTGCGGA ATTTGAATTT TAAAAATGAC 6600 

CTAAGCTAGA ATTCTAAGAA AAAATCAATT TTAAGAGAAA ATGAACCCAA CATTTCCATA 6660 

ATAAAACGCA TATTAGCAAG TTTTTAACAC TTGATAATAT GC G TTTTTTC TAAGTGGATT 6720 

AGTAGAGTAG ACGATTTTTC TCATATAATA CTCTTCGAAA ATCTCTTCAA ACTACGTCAG 6780 

CTTCCATCTG CAACCTCAAA ACAGTATTTT GAGCgaCTtC GTCAGTCTTA TCTACAACCT 6840 

CAAAGCAGTG CTTTGAGCAA CCTGTGGCTA GCTTCCTAGT TTGCGCTTTG ATTTTCATTG 6900 

AGTATAAGGG AAAGTATAGT GAATTGAAAT AAGATGTGAA CAACTCTATC AGGAAAGTCA 6960 

AATTAATTTA TAGAAATATT TTACCAGCCA AGGTGTACTG TTATAGATTC AATTACACTA 7020 

TAATTTAGTG TAATTGAGAA AGGAGAAATG ATTGTGATTG ATGTTGGCTA GGTTATGTTC 7080 

AATGATTCCT ACCGTCTCAA ATCTTGTCAG TAAGGAAAAA TAAATTCTTC AAAAGTAGAG 7140 

ATTACAAGGC TTGTTTAAGA AAGAATTCAA AGACCTTGAC AAATAAAAAT AAAATGGTTA 7200 

TTATAAAAAA TGGTCTGAAA TAGATGATGA TACTTTTCGA AAATCTCTTC AAATACGTCA 7260 

GCTCAGCTTT GCCTTGCTGT GTTTTGAGCA AGCTACGGTT AGCTTCCGAG TTTGATTT'l'C 7320 

ATTTACTAGA AATGAAACTG ATGAGAGATA TCAGTAGACA TTTGAGTCAG GATATTATGG 7 380 

AAAATGATAA AAAGAGCTCG TGAGATTGGC ATATCAGACT ACTAAAGTAT TGAGTTTGTT 744 0 

AGGATTTTAG CGACTAGTTA GCTGGGAAAG GAAGATATTT GTGACAAATA ATAAACTGTA 7500 

TTCGTTGATA GAATTTAGAA ATAAAATATA TGAAGAATTA GAACTTTCCA GAAGTGATTT 7560 

AGCGATTTTA CTATGTCCCA TGCTTATCGC CTCTATCGGA TTAAATATGG ATTCGACTCC 7620 

CGTGATTATT GGAGCCATGT TAATCTCTCC TTTGATGACA CCTATTCTGG GAGTGGGGCT 7680 

CTCTCTAGCT ATATTTGATT TTAAATTGTT AAGAAAATCT TTTAAAATAT TAGCTATTCA 7740 
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AATTCTTGCC AGTCTAATAG CTTCAACACT TTATTTTTAT CTTTCTCCCA TTTCGTATGC 



7800 



TAGTTCGGAG ATTGTTGCTA GAACCTCTCC GACTATTTGG GATGTTCTCA TTGCTTTTGT 



7860 



AGGAGGGATA GCAGGTATCA TTGGTGCTAG GAAAAAAGAG AC 



7902 



(2) INFORMATION FOR SEQ ID MO: 113: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18627 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

GAAGTTGAAA TGGCCAGCTG ATGAGCAATA TCGGTCATAG AAATCTTCTC AATCAACTTT 60 

TGCGCAATTT TTTGGTTGAT AATACGAGGA ATTTGCTGAT TTTTCTTGAC GATAGAAGTT 120 

TCAGCGACCA TCATTTTTGA ACAGTGATAG CACTTGAAAC GACGCTTTCT AAGTAGAATT 180 

CTAGTAGGCA TACCAGTTGT CTCAAGGTAA GGAATCTTAG ACGGTTTTTG AAAGTCATAT 240 

TTCTTCAATT GGTTTCCGCA CTCAGGGCAA GATGGGGCGT CGTAGTCCAG TTTGGCGATG 300 

ATTTCCTTGT GTGTATCTTT ATTGATGATG TCTAAAATCT GGATATTAGG GTCTTTAATG 360 

TCTAGTAATT TTGTGATAAA ATGTAATTGT TCCATATGAA TCTTTCTAAT GAGTTGTTTG 420 

GTCGCTTTTC ATTATAGGTC ATATGGGACT TTTTTTCTAC AATAAAATAG GCTCCATAAT 480 

ATCTATAAGG GATTTACCCA CTACAAATAT TATAGAGCCA AAAATCCTTT GTTTACTAAA 540 

CAAGGGATTT TTCTTTTGTC TCTGCTCCTT TTTTGATATA ATAGTTCTAT GTTAAAATCA 600 

GAAAAACAAT CACGTTATCA AATGTTAAAT GAAGAATTGT CCTTCCTATT GGAAGGCGAA 660 

ACCAATGTTT TGGCTAATCT TTCCAACGCC AGTGCTCTCA TAAAATCACG TTTTCCTAAT 720 

ACCGTATTTG CAGGCTTTTA TTTGTTCGAT GGAAAGGAAT TGGTTTTAGG CCCCTTCCAA 780 

GGAGGTGTTT CCTGCATCCG TATTGCACTA GGCAAGGGTG TTTGTGGTGA GGCAGCTCAC 840 

TTTCAGGAAA CTGTTATTGT TGGAGATGTG ACGACCTATC TCAACTATAT TTCTTGTGAT 900 

AGTCTAGCTA AAAGTGAAAT TGTGGTGCCG ATGATGAAGA ATGGTCACTT ACTTGGAGTT 960 

CTGGATCTGG ATTCTTCAGA GATTGAGGAT TACGATCCTA TGGATCGAGA TTATTTGGAA 1020 

CAATTTGTCG CTATTTTGCT TGAAAAGACA GCATGGGACT TTACGATGTT TGAGGAAAAA 10 B0 

TCTTAATGTA TCAAGCACTT TATCGAAAAT ATAGAAGTCA AAACTTCTCC CAGTTAGTTG 1140 

GTCAAGAAGT TGTGGCTAAG ACTCTTAAAC AAGCGGTGGA GCAAGAGAAA ATAAGTCACG 1200 
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CTTATCTTTT TTCTGGTCCT CGTGGAACGG GAAAAACCAG TGTTGCTAAA ATCTTTGCCA 1260 

AGGCTATGAA CTGTCCCAAT CAAGTGGGTG GCGAACCTTG CAATAACTGC TATATTTGTC 1320 

AAGCAGTGAC GGACGGTAGT TTAGAAGATG TCATTGAAAT GGATGCAGCT TCTAATAATG 1380 

GGGTAGATGA AATTCGCGAA ATTCGTGATA AATCTACCTA TGCGCCTAGC CTTGCTCGTT 1440 

ATAAGGTTTA TATCATAGAT GAGGTTCACA TGCTGTCTAC AGGGGCTTTT AATGCCCTCC 1500 

TAAAGACGCT GGAAGAACCA ACACAGAATG TAGTCTTTAT TTTGGCCACT ACTGAATTGC 1560 

ACAAGATTCC TGCTACTATT CTATCCCGTG TGCAACGTTT TGAGTTTAAA TCAATTAAGA 1620 

CACAGGATAT TAAGGAACAT ATTCACTATA TCTTAGAAAA AGAAAATATC AGTTCTGAAC 1680 

CAGAGGCTGT GGAAATCATT GCCAGACGGG CCGAAGGTGG AATGCGGGAC GCCTTGTCTA 1740 

TTTTGGATCA AGCCCTGAGT TTGACACAGG GAAATGAGCT GACGACTGCT ATCTCTGAAG 1800 

AAATTACTGG CACCATTAGC CTATCAGCCT TGGATGATTA TGTGGCGGCC TTGTCTCAAC I860 

AGGATGTTCC CAAAGCTTTG TCTTGCTTGA ATCTTCTTTT TGACAATGGT AAGAGCATGA 1920 

CTCGTTTTGT GACCGATCTT TTGCACTATT TAAGAGACTT GTTAATTGTT CAAACAGGGG 1980 

GAGCAAATAC TCATCATAGT TCAGTCTTTG TAGAAAATTT GGCACTTCCT CAAAAAAATC 2040 

TGTTTGAAAT GATTCGCTTA GCAACAGTGA GTTTAGCAGA TATTAAGTCT AGTTTGCAAC 2100 

CCAAGATTTA TGCTGAAATG ATGACCGTCC GTTTGGCGGA AATCAAGTCC GAACCAGCTC 2160 

TATCAGGAGC GGTTGAAAAT GAAATTGCTA CGCTGAGACA GGAAGTTGCC CGTCTCAAAC 2220 

AAGAGCTTTC TAATGTAGGT GCGGTTCCTA AACAACTTGC ACCAGCTCCT AGTCGACCAG 2280 

CTACGGGCAA AACAGTCTAT CGTGTCGATC GCAATAAAGT GCAATCTATC TTACAAGAGG 2340 

CCGTCGAAAA TCCTGATTTA GCACGTCAAA ATTTAATTCG TTTGCAGAAT GCCTGGGGAG 2400 

AGGTAATTGA AAGTCTAGGT GGGCCGGACA AGGCTCTGCT AGTTGGTTCT CAACCGGTTC 2460 

CTGCCAATGA ACACCATGCT ATTCTTGCTT TTGAGTCTAA CTTCAATCCT GGTCAAACTA 2520 

TGAAACGAGA CAATCTCAAT ACCATGTTTG GTAATATCCT CAGTCAGGCG GCAGGTTTTT 2580 

CACCTGAGAT TTTAGCTATT TCCATGGAGG AATGGAAAGA AGTTCGCGCA GCCTTTTCAG 2640 

CCAAAGCCAA ATCTTCTCAA ACTGAAAAAG AAGTAGAAGA AAGCCTGATT CCAGAAGGAT 2700 

TTGAATTTTT GGCTGATAAA GTGAAGGTAG AGGAAGACTA AAGAAAGATT TCATGATACA 2760 

ATAAGTTTAT GAATAAACAA CAATTTATTA TTATGGCGCT GTTTACAGCT CCTGAGACCT 2820 

ATTTTTTCAA TGAAGCCTGG ATGACTGGCC GCTATATTAT GGCAGCCTTT TGGGCAATTT 2880 

TACTCTTTAG AAATTTCCGA GTCAGTTATG TGATGGGCAA AATCGTTGAT GTCATCGATC 2940 

AGCATTTTAA TAGGAAAGAC TAGCCCTCAG CTTCCAGACA AAATCAAAGC CTTTTAGGCT 3000 
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TTTTTTTGTT ATACTAGAAA AGTATATTTA TAGAATTTTT GCTCTATTTC TGGCGAAATC 3060 
AGACGTTTTT CTAGTAAGTA CTGTAAAAGT TTTGAAAAAG AAAGGAACTA TCATGTCAGT 3120 
ATTAGAGATC AAAGATCTTC ACGTTGAGAT TGAAGGAAAA GAAATTTTAA AAGGGGTTAA 3180 
CCTGACCCTG AAAACAGGAG AAATTGCCGC TATCATGGGA CCAAATGGTA CAGGTAAATC 3240 
GACTCTTTCT GCCGCTATCA TGGGAAATCC AAACTATGAA GTAACTAAAG GTGAAGTTTT 3300 
GTTTGATGGC GTAAACATCC TTGAGTTGGA AGTGGATGAG CGTGCGCGTA TCGGACTTTT 3360 
CCTTGCTATG CAATACCCAT CAGAAATCCC TGGAATTACC AATGCTGAGT TTCTTCGTGC 34 20 

CGCTATGAAT GCGGGTAAAG AAGATGATGA GAAGATTTCA GTTCGTGAGT TTATTACTAA 34 80 

GCTAGATGAA AAAATGGAAT TGCTCAACAT GAAAGAAGAA ATGGCAGAGC GTTACCTCAA 3540 

CGAAGGCTTC TCTGGTGGTG AGAAAAAACG CAATGAAATT CTTCAACTTT TGATGTTGCA 3600 

GCCAACATTT GCTCTTTTGG ACGAGATTGA CTCAGGTCTT GATATTGACG CTCTTAAAGT 3660 

TGTGTCTAAA GGTGTCAATG CCATGCGTGG TGAAGGTTTT GCTCCTATGA TCATCACTCA 3720 

CTACCAACGT CTTTTGAACT ATATCACACC TGATGTGGTA CACGTGATGA TGGAAGGTCG 3780 

TGTTGTCCTT TCTGGTGGTC CAGAATTGGC TGCGCGTTTG GAACGTGAAG GATACGCAAA 3840 

ATTAGCTGAA GAACTTGGCT ACGACTACAA GGAAGAATTG TAATTCCCTC GTATCTTTTA 3900 

GGAGAAGTAA ATGACTAGAG AAAATATTAA ACTTTTTTCA GAAATGCACG CTGAACCAAG 3960 

CTGGTTGGCT GATCTCCGTC AAAAACCTTT TCACAAGATT GAGACTTTGG AATTACCAGT 4020 

TATTGAGTGT GTCAAATTCC ACCGTTGGAA TCTGGGTGAT GGAACGATTA CAGAAAATGA 4 080 

GCCATCAGCA AATGTTCCAG ATTTCACACC TTTAGATCAT CACTTGAAGT TGGTGCAAGT 4140 

AGGAACTCAA ACTGTTTTCG AACAAACTCC AGTTGAGTTA GCTGAACAGG GTGTTGTCTT 4200 

CACAGACTTT CACTCAGCTT TAGAAGAAAT TCCAGAGCTG ATCGAAGAAT TCTTCATGTC 4260 

ATCTGTTAAG TATGATGATG ACAAGTTGGC CGCTTACCAC ACAGCTTACT TTAACAGTGG 4 320 

TGCTGTACTC TATATTCCAG ATAACGTAGA AATCACAGAG CCAATTGAAG GAATTTTCTA 4380 

CCAAGATAGC GATAGCAATG TGCCGTTTAA CAAGCATATT ATGATTATCG TTGGTAAAAA 4440 

TTCTAAGATT AGTTATCTGG AGCGTTTAGA GTCACGCGGT GAAGGAAGTG ACAAAGCAAC 4500 

TGCCAATATC ACAGTGGAAG TGATTGCACG TTCTGGTGCC CAAGTCAAGT TTGCTGCTAT 4 560 

CGACCGTCTA GGTGAAAACG TCACTGCCTA CATTAGCCGT CGTGGTAAAT TAGGCAACGA 4620 

TGCAAGTATT GACTGGGCTA TCGGTGTCAT GAACGAAGGA AATGTCGTTG CTGATTTTGA 4680 

TAGTGACTTG ATTGGTAATG GTAGCCATGC TGACCTCAAG GTTGTAGCTC TTTCAAGTGG 4740 
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TCGTCAGGTA CAAGGGATTG ATACTCGTGT AACTAACTAT GGCTGCAACT CAATCGGAAA 4 800 

CATTCTACAA CATGGGGTTA TCCTTGAAAA AGCAACTTTG ACTTTCAATG GTATCGGCCA 4860 

CATCATCAAG GGTGCTAAGG GAGCAGATGC GCAACAAGAG AGCCGTGTTC TCATGCTTTC 4920 

AGACCAAGCG CGTTCAGATG CTAACCCAAT TCTTTTGATT GATGAAAATG ACGTAACTGC 4980 

AGGCCATGCA GCCTCTATTG GTCAGGTAGA TCCAGAAGAT ATGTACTACC TCATGAGTCG 5040 

TGGCTTGGAT AAGGCAACTG CAGAGCGTTT GGTTGTTCGT GGTTTCCTTG GATCTGTTAT 510O 

CGTGGAGATT CCAGTCAAGG AAGTTCGTGA TGAAATGATT GCAACTATCG AAGAGAAATT 5160 

GTCAAAACGC TAAGGGGCAG CCTATGTTAG ATGTAGAAGC GATTCGCAAG GATTTTCCAA 5220 

TTTTAGATCA GATTGTCAAT GATGAACCTC TGGTCTATCT GGACAATGCT GCGACGACAC 5280 

AAAAACCACT AGTAGTTCTG AAAGCTATTA ACAGCTACTA TGAGCAGGAC AATGCCAATG 5340 

TTCACCGTGG TGTCCATACC TTAGCGGAAC GAGCGACAGC TTCTTATGAA GCTGCTCGTG 5400 

AAACCATTCG TAAGTTTATT AATGCAGGCT CTACAAAGGA AGTTCTCTTT ACCAGAGGAA 5460 

CGACAACCAG CCTTAACTGG GTCGCACGCT TTGCTGAGGA AATTCTCACT GAGGGAGACC 5520 

AGGTCTTGAT TTCAGTAATG GAACACCATT CTAATATCAT TCCATGGCAG GAAGCTTGTC 5580 

GAAAGACTGG AGCAGAGCTT GTCTATGTCT ATCTTAAAGA CGGTGCCTTG GATATGGAGG 5640 

ATTTGCGAGC TAAATTGACT GATAAGGTTA AATTTGTTTC CCTAGCTCAT GCCTCCAATG 5700 

TTCTTGGTGT GGTCAATCCG ATCAAGGAAA TCACTCAATT AGCCCACCAA GTTGGGGCAA 5760 

TTATGGTAGT GGATGGTGCT CAATCTACAC CTCATATGAA GATTGATGTC CAGGACTTGG 5820 

ATCTGGACTT TTTCGCCTTT TCGGGTCACA AGATGGCTGG TCCGACTGGT ATCGCTGTCC 5880 

TTTACGGCAA AGAAAAGTAT CTTGAGCAAA TGTCTCCAGT AGAATTTGGC GGCGAGATGA 5940 

TTGATTTTGT CTACGAGCAA TTTGCTAGTT GGAAGGAATT GCCTTGGAAA TTTGAGGCTG 6000 

GAACGCCAAA TATGGCAGGA GCTATTGGAC TTGCGACTGC AGTTGATTAT CTGGAAAAGA 6060 

TTGGTATGGA TGCCGTTGAA GCTCATGAAC AGGAATTGAT TGCGTACGTC TATCCAAAAC 6120 

TGCAGGCAAT TGAGGGATTG ACCATTTACG GTTCTCAGGA TTTGGCTCAA CGTTCGGGTG '5180 

TTATTGCCTT TAACCTAGGT GATCTCCATC CTCACGATCT TGCGACGGCT CTGGATTATG 6240 

AAGGAGTGGC TGTTCGTGCT GGTCACCATT GTGCGCAACC CTTGCTTCAG TATTTGGAAG 6300 

TCCCAGCAAC AGCTCCTGCA AGTTTTTATA TCTACAATAC CAAGGCAGAT TGCGACAAAC 6360 

TAGTCGATGC CCTACAAAAG ACAAAGGAGT TTTTCAATGG CACTTTCTAA ACTAGATAGC 6420 

CTTTATATGG CAGTGGTAGC AGACCATTCG AAAAATCCAC ATCACCAAGG GAAGTTAGAA 6480 

GATGCTGAGC AAATCAGTCT CAACAATCCG ACTTGTGGGG ATGTCATCAA CCTCTCTGTC 6540 
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AAGTTTGATG CAGAGGACCG TTTGGAAGAT ATTGCTTTTC TAAATTCAGG ATGCACGATT 6600 

TCAACTGCTT CTGCTAGTAT GATGACAGAT GCCGTTTTAG GAAAAACCAA ACAAGAAATT 6660 

TTACAACTGG CGACTATTTT TTCTGAAATC GTTCAAGGGC AAAAAGATGA GCGTCAAGAC 6720 

CAACTTGGAG ACGCGGCATT CTTGTCAGGT GTTGCCAAAT TCCCTCAAAG AATCAAGTGT 6780 

GCAACCCTAG CTTGGAATGC CCTTAAGAAA ACAATTGAAA ATCAAGAAAA ACAGTAAGAC 6840 

AAGTTTCTTT TGTCTTATGA ATTATTAGAA ATGAAGAAAG AAAGGATACT ATGGCTGAAG 6900 

AAAGAGTAGA ACCAAAACCA ATTGACCTTG GTGAATATAA ATTTGGTTTC CATGACGATG 6960 

TAGAGCCTGT CTTATCGACA GGAAAAGGAC TCAACGAAGG TGTTATTCGT GAATTATCTG 7020 

CTGCTAAGGG TGAGCCTGAG TGGATGTTGG AGTTCCGTTT GAAGTCTTAT GAAACCTTCA 7080 

AAAAAATGCC CATGCAAACT TGGGGAGCAG ACTTGTCAGA GATTGACTTT GATGACTTAA 7140 

TCTACTACCA AAAACCATCT GACAAACCAG CCCGTTCTTG GGATGATGTA CCTGAAAAGA 7200 

TTAAAGAAAC CTTTGAACGT ATCGGGATTC CAGAAGCTGA ACGTGCTTAT TTAGCAGGGG 7260 

CTTCTGCCCA GTACGAGTCA GAAGTGGTTT ACCACAACAT GAAGGAAGAG TTCCAAAAAT 7320 

TAGGTATTAT CTTTACAGAT ACAGATTCCG CACTCAAGGA ATACCCAGAC TTATTTAAAC 7 380 

AATACTTTGC GAAGTTGGTA CCGCCGACAG ATAACAAGTT GGCAGCCCTC AACTCAGCAG 7440 

TATGGTCGGG TGGAACTTTT ATCTACGTGC CAAAAGGTGT CAAGGTAGAT ATTCCACTTC 7500 

AAACTTATTT CCGTATCAAT AACGAAAATA TAGGTCAGTT CGAACGTACC TTGATTATCG 7560 

TTGATGAGGG AGCAAGCGTC TACTACGTAG AACGATGTAC AGCACCAACA TATTCAAGCA 7620 

ATAGCTTACA CGCTGCCATT GTAGAAATTT TTGCTTTGGA CGGAGCTTAT ATCCCTTATA 7680 

CAACTATCCA AAACTGGTCT GATAACGTCT ATAACTTGGT AACAAAGCGT GCTAAGGCTC 7740 

AAAAGGATGC CACTGTTGAG TGGATTGATG GAAACTTGGG TGCCAAAACG ACTATGAAAT- 7800 

ATCCATCTGT TTACCTTGAT GGAGAAGGAG CGCGTGGTAC CATCCTCTCT ATCGCCTTTG 7860 

CTAATGCAGG GCAACACCAA GACACGGGTG CTAAGATGAT TCACAATGCT CCACATACCA 7920 

GCTCGTCTAT TGTGTCTAAA TCCATCGCTA AAGGTGGAGG AAAGGTTGAC TACCGTGGAC 7980 

AAGTCACCTT TAACAAGAAC TCTAAGAAAT CTGTTTCCCA CATTGAATGT GATACCATTA 8040 

TCATGGATGA CTTGTCAGCA TCAGATACTA TTCCATTTAA TGAAATTCAC AACTCGCAAG 8100 

TGGCTTTGGA ACACGAAGCC AAAGTATCTA AGATTTCAGA AGAGCAATTG TATTATCTCA 8160 

TGAGCCGTGG ATTGTCAGAA TCTGAGGCAA CTGAAATGAT TGTCATGGGA TTTGTAGAAC 8220 

CCTTTACAAA AGAACTTCCA ATGGAATACG CAGTTGAGCT GAACCGCTTG ATTAGCTATG 8280 
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AAATGGAGGG ATCAGTTGGA TAAAATTTGA TTTTATACTC TTCGAAAATC TCTTCAAACC 834 0 

ACGTCAGCAT CGCCTTACCG TATGTATGGT TwCTGAtTCG TCAGTTTCAT CTACAACCTC 8400 

AAAACAGTGT TTTGAGCAAC C.GCGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTGAGTATT 8460 

AGATTTACTC AAAATCAAGG ATTTTGAAGA TGAACTTGTA TCAAAAAATC GCGGTTTAAA 8520 

ATCGCGATTT TTTATAATTT CTCGTTAACA AAGCGGACAA ACTGATTCCA CCAAACTTTT 8580 

AAGAAGAAGG CTTTTTCAAT TTTCTTGTCT GCTACCATTT CGAAACTAGG GCGCTCTGTG 8640 

GTGATGTAAC CTTGACCAAT CAAGTCCTTG TCTTCATAAG TCAAATGGCC AACCACTGTT 8700 

CCAGCTTCAA GTGGTGCTGG GATTGCTTTG GAATCAGGTG TGAATTGAAC AGATTGGGAA 8760 

GATTGATTCC CAACACGTTC GATTAGATAG ATATCCTCTG GAGCCACTGC AGTTACTGTA 8820 

TCTTCTTTTC CATCTTGTAC AGGGGCTTTG CTATCTTGAT AGGCATCGCC TTGTTGAACG 8880 

ATTTTGCGAA GTGTAAATGT AGAAGAAATA TAATCCATTA GGGAAGATGT AGCTGTAAAT 8940 

CGAGCGTAAG GATTATTGTC TTGATGATCT GCATTTAAAA CAACTGTGAT GACTCTCATG 9000 

CCTTTTTCGA CAGTAGTACC AACAAAAGAC TCTCCAGCCT TATCTGTTGT TCCTGTTTTT 9060 

AGCCCATCAA AACCACCACG GTAAGCAGGC ATACCTTCTA ACATGTAGTT GGTTGAAGTG 9120 

ATTGTCATCC CAGCAAAAGT AGAAGAAGGT TTTTTGGTGA TTTCTAAGAC TTGTGGGTAT 9180 

TTTTTGATGA GGTTGCGAGC AACGATAGCG ACATCATAAG CACTAAGCTT ATTTTCCTCA 9240 

TCTTTTTTAG AACCTGGGTA AATGTTATCC CCTAGAGTTT CATTGTTAAG ACCTGTCGTA 9300 

TTGACAACAG TGGCATCCTG AATTCCCCAT TCCAAGAGTT TTGCCCGCAT CATATCGACG 9360 

AAATCTTTTT CTGAGCCAGC AATTTTCTCA GCTAGGGCAA TAGCGGCGCT GTTGGCACTA 9420 

GATACCAGAG TTGCTTCAAG CAACTCTTCG ACAGTATAAT TACGGGCCTC CATAGGAATA 9480 

TTACTGGCTT CAGAATTTGT CGTCAATTGA TAAGGATAAT CAGAAATATC TACAGGAGTG 9540 

GAGAGGGTAA TACTTCCGTT TTCCAAAGCT TCATAGACCA GATAAACAGT AATCAATTTT 9600 

GTTATGGAAG CAATTTCGAC AGGTTGCGTT GCATCCTTCT CATAGAGAAT TTTACCAGTA 9660 

TTTGCCTCAA CAGCAATCGC ATGTTTAGCG GCAATGGTAA AATCTTGAGC AACAGCAGTA 9720 

GAAGCACCCC CTAAAAGAGA GACAGTTAAC AAAGTTAAAA ATATTTTTTT CATAGTAGTC 9780 

TTATTCTATC ATAAAGAAAA AAAATATTCT TGCTTTAATA ATTCATCTGT TAAGCTTTTT 9840 

GAAAATATGG TAAAATAAAG TAAGGGAGGT AACTCATGTT TCGTAGAAAT AAATTATTTT 9900 

TTTGGACCAC AGAAATTTTA CTCTTAACCA TCATCTTTTA CCTATGGAGA CAGATGGGGT 9960 

CTTTGATTAA CCCTTTTGTT AGCGTGCTTA ATACAATTAT GATTCCATTT TTATTAGGGG 10020 

GCTTTTTTTA TTATTTGACA AACCCTATTG TTACTTTCTT AAATAAAGTC TGTAAACTCA 10080 
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ATCGTTTGCT TGGTATTTTA ATTACCTTGT GTACTTTGGT CTGGGGAATG GTCATAGGTG 10140 

TTGTCTATCT CTTACCTATT TTGATTAATC AGTTATCTAG TTTGATTATA TCTAGTCAAA 10200 

CTATTTATAG TCGAGTACAA GACTTAATCA TAGACTTATC TAATTATCCT GCGCTCCAGA 10260 

ATTTGGATGT AGAAGCTACA ATTCAGCAGT TAAACTTATC CTATGTTGAT ATTCTTCAAA 10320 

ATATCCTAAA TAGCGTATCA AATAGTGTGG GGAGCGTCTT GTCAGCTCTT ATCAGTACTG 10380 

TTTTGATTTT GATTATGACT CCAGTTTTTT TGGTTTATTT CTTATTAGAT GGACATAAAT 10440 

TCTTGCCCAT GCTTGAAAGA ACGATTCTAA AGAGGGATCG CTTGCATATT GCAGGCTTAT 10500 

TAAAGAATTT AAATGCGACG ATTGCTCGCT ATATTAGTGG AGTTTCGATT GACGCAATCA 10560 

TTATAGGTTG TTTGGCTTAT ATTGGCTATA GTATTATTGG TTTAAAATAT GCTTTAGTTT 10620 

TTGCCATTTT TTCTGGTGTA GCCAATTTAA TTCCTTATGT GGGGCCAAGT ATTGGTTTGA 10680 

TTCCTATGAT CATCGCAAAT ATATTCACTG ATCCCCATAG ACTGCTGATT GCAGTGATTT 10740 

ATATGCTTGT TGTTCAGCAG GTAGATGGCA ATATCTTATA TCCTCGAATC GTAGGAAGTG 10800 

TTATGAAGGT TCATCCAATC ACGATTTTAG TTTTACTTTT GTTGTCAAGC AATATCTATG 10860 

GTGTAGTTGG AATGATTGTC GCAGTGCCAA CCTATTCTAT CTTGAAAGAA ATTTCTAAGT 10920 

TCTTATCCCA TTTGTATGAA AATCATAAAA TAATGAAAGA ACGAGAAAGA GAATTAGCTA 10980 

AGTAAAAGTC AGGAGAACCC TGATTTTTCT TTACTGCAAG TGGCCTTTAG ATTAGAAGAC 11040 

TGAAAATAAG TTAAAGTCTT AAACTAATTT TCACAGCTAA CAATAGTACA AGTTAATCTG 11100 

ATAAAAATCG AAAAAACCAG TGGAATTCTG TGTCAGGGTA AGTTCCACTG GTTTTCATAG 11160 

TCTATTAAAG TTCGAATGAA ACCTATTTAT AGTAGATTGA AACTAGAATA GTACACCTCT 11220 

AATTCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TCTATTCGTT CTATTCTTAT 11280 

TTCATTTTAC TATATTTTGG TGCAATAAGT GAAAAGTAGT CCGAATAATA TAAGGATTGA 11340 

TTTTATAGTT TTTAAACTCA AATGAATTCA AATAAAGAGA GTACGAAAAT TCTCATCTGA 11400 

AAGTATTTTA GAATAATTCT CTTCGTGAAT TTCTTCAAAA CAGATAGCTT CATCTTAGGT 11460 

ATGTGATTTC TTTTTGCATT TTTGAGTTAG ATAAGGTATA ATGATTTTAT TGTCTTTTGG 11520 

GGTCGTTACG GATTCGACAG GCATTATGAG GCATATTTTG CGACTCGTGT GGCGACGTAA 11580 

ACGCTCAGTT AAATATAACT GCAAAAAATA ACACTTCTTA CGCTCTAGCT GCCTAAAAAC 11640 

CAGCAGGCGT GACCCGATTT GGATTGCTCG TGTTCAATGA CAGCTCTTAT TATTAGCGAG 11700 

ATACGATTAA GCCTTGTCTA GCGGTTTGAT AAGAGATTGA TAGACTCGCA GTTTCTAGAC 11760 

TTGAGTTATG TGTCGAGGGG CTGTTAAAAT AATACATAAC CTATGGTTGT AGACAAATAT 11820 
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GTTGGCAGGT GTTTGGACGT GGGTTCGACT CCCACCGGCT CCATTATTCC TTTGCATTCT 11880 

TTTGCATTCC TTGGTAAAAC GTTGTTAAAT CAACGTTTTT TATTTTTATC TTTGGTATTC 11940 

CTTTGCATTC TTTTGCTAAA AAGGGAGTCA CAAACAGACC CTATTTTAAA AAAGGATAGA 12000 

AAAAAGGATA CAACATTTGT CGCATCCTAA AAATAATCTT TTTTCGACGG AAGACATGGG 12060 

ATTCGAACCC ACGCACGCTA TTACACGCCT ACCGCGTTTC CAACACGGCC TCTTAAGCCT 12120 

CTTGAGTAAT CTTCCAATAC TTACTCAAAT AGTCTACCAT AAAGGCTCTT ATCTTGCAAT 12180 

AAAAATTCTA GAAATAAGAA AAATGATAGA TTTTGAAAGA AAATGATAAA AAATGCTTGA 12240 

CTTCGAAAGA AAGTATGATA GAATGAATAG TGTAAACGAT AACAGGAGGT GATTCAGTGT 12300 

TAAAAACAGA ACGTAAACAA CTAATTTTAG AGGAGTTAAA TCAACATCAT GTAGTTTCTC 12360 

TAGAAAAATT AGTTAGTTTG CTAGAAACGT CAGAATCAAC GGTTCGAAGA GACTTGGATG 12420 

AGTTGGAAGC GGAAAACAAG CTTCGTCGTG TGCATGGTGG AGCAGAACTC CCCTACTCCT 12480 

TACAGGAAGA AGAAACCATT CAAGAAAAAT CTGTCAAAAA CCTTCAAGAA AAGAAATTGC 12540 

TGGCTCAGAA AGCAGCCTCT CTCATTAAAG AAAAAGATGT CATCTTTATC GATGCTGGAA 12600 

CAACAACTGC TTTTTTGATT CATGAATTGG TCAATAAGAA TGTTACAGTT GTGACCAACT 12660 

CCATTCACCA TGCCGCTCAG TTGGTTGAAA AGCAGAwTCC AACTGTCATG GTTGGAGGAA 12720 

ACGTCAAGAC GGCGACAGAT GCTAGTATCG GGGGCGTTGC TCTTAACCAG ATTAACCAAT 12780 

TGCACTTTGA CCGTGCCTTT ATCGGAATAA ATGGTGTTGA CGATGGCTAT TATACGACTC 12840 

CTGATATGGA GGAGGGAGCT GTGAAAAGAG CTATTTTGGA GAATGCCAAG CAGACCTACG 12900 

TCTTGGTGGA TTCGTCAAAA ATTGGACAAA CTTGCTTTGC CAAGGTAGCC CCACTCAAAC 12960 

GCGCTATCGT TATCACTAGT CAAGGGCATG AGCTCTTGCA GGTTATTAAG GAGAAAACGG 13020 

AGGTAATAGA AGTATGATTT ATACAGTCAC ACTCAATCCA TCCATTGACT ATATCGTTCG 13080 

TTTGGACCAA GTCAAAGTTG GTAGTGTCAA TCGTATGGAC AGTGATGATA AGTTTGCTGG 13140 

TGGGAAAGGA ATCAATGTCA GCCGTGTCTT GAAACGTTTG AATATACCAA ATACAGCGAC 13200 

GGGATTTATC GGTGGCTTTA CTGGTAAATT TATCACAGAT ACTTTAGCAG AGGAAGAAAT 13260 

CGAGACACGT TTTGTCCAGG TGGCAGAAGA TACTCGTATC AATGTTAAAA TCAAAGCAGA 13320 

CCAAGAAACA GAAATCAACG GAACGGGTCC AACTGTTGAA TCGGTTCAGC TAGAAGAATT 13380 

GAAAGCTATT TTATCTAGTC TGACAGCAGA AGATACAGTT GTCTTTGCAG GTTCAAGTGC 13440 

TAAAAATCTA GGCAATGTTA TCTATAAGGA TTTGATTTCC TTGACGCGCC AGACTGGTGC 13500 

GCAAGTGGTC TGTGACTTTG AAGGACAGAC CTTAATTGAT AGTTTGGACT ACCAGCCTCT 13560 

TCTTGTAAAA CCAAACAATC ATGAACTTGG AGCGATTTTT GGGGTTAAAC TCGAAAGTTT 13620 
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AGATGAAATT 


GAGAAATACG 


CTCGTGAGTT 


ACTGGCTAAG 


GGTGCTCAAA 


ATGTTATTAT 


13680 


CTCTATGGCT 


GGTGATGGTG 


CCCTTCTTGT 


CACATCTGAG 


GGAGCTTACT 


TCGCTAAACC 


13740 


AATCAAAGGA 


ACAGTCAAAA 


ATTCAGTTGG 


AGCTGGTGAT 


TCTATCGTTG 


CTGGATTCAC 


13800 


AGGTGAATTT 


GTCAAATCAA 


AAGACGTAGT 


AGAAGCCTTC 


AAATGGGGAG 


TGGCTTGCGG 


13860 


AACGGCAACT 


ACCTTCTCAG 


ATGACTTGGC 


AACGGCGGAA 


TTTATTAAAG 


AAACATATGG 


13920 


AAAAGTTGAG 


GTAGAAAAAC 


GATGAAAATT 


CAAGACCTAT 


TGAGAAAAGA 


TGTCATGTTG 


13980 


CTAGATTTGC 


AGGCAACTGA 


AAAAACAGCT 


GTCATCGACG 


AGATGATTAA 


AAATTTGACA 


14040 


GACCACGGTT 


ATGTAACAGA 


ttttgaaaca 


TTTAAAGAAG 


GAATTTTGGC 


GCGTGAAGCT 


14100 


TTGACTTCTA 


CTGGTTTGGG 


TGATGGAATC 


GCAATGCCTC 


ACAGCAAAAA 


CGCTGCTGTC 


14160 


AAAGAAGCGA 


CAGTTCTATT 


TGCTAAGTCA 


AATAAGGGTG 


TTGACTACGA 


GAGCTTGCAT 


14220 


GGACAAGCAA 


CTGACCTCTT 


CTTCATGATT 


GCAGCTCCAG 


AAGGTGCCAA 


TGATACTCAC 


14280 


TTGGCAGCCT 


TGGCAGAATT 


GTCTCAATAC 


TTGATGAAAG 


ACGGTTTTGC 


AGACAAACTT 


14340 


CGTCAAGCAA 


CATCTGCAGA 


CCAAGTTATC 


GAACTTTTTG 


ACCAAGCTTC 


AGAAAAAACT 


14400 


GAGGAACTTG 


TTCAAGCACC 


TGCTAATGAC 


TCTGGTGACT 


TTATCGTAGC 


TGTTACAGCT 


14460 


TGTACAACAG 


GTATTGCCCA 


CACTTACATG 


GCCCAAGAAG 


CCCTTCAAAA 


AGTAGCTGCT 


14S20 


GAAATGGGGG 


TTGGTATCAA 


GGTCGAAACC 


AACGGTGCTA 


GCGGTGTTGG 


AAATCAACTA 


14580 


ACTGCAGAAG 


ATATCCGTAA 


GGCTAAAGCT 


ATTATCATTG 


CAGCAGACAA 


GGCCGTTGAA 


14640 


ATGGATCGAT 


TTGATGGAAA 


ACCATTGATC 


AATCGTCCAG 


TTGCTGACGG 


TATCCGTAAG 


14700 


ACAGAAGAGC 


TAATTAACTT 


GGCTCTTTCA 


GGAGATACTG 


AAGTCTACCG 


TGCCGCTAAT 


14760 


GGTGCCAAAG 


CTGCAACAGC 


CTCTAACGAA 


AAACAAAGCC 


TTGGTGGTGC 


CTTGTACAAA 


14820 


CACTTGATGA 


GTGGTGTATC 


TCAAATGTTA 


CCATTCGTTA 


TCGGTGGTGG 


TATCATGATT 


14880 


GCCCTTGCCT 


TCTTGATTGA 


CGGTGCTTTG 


GGTGTTCCAA 


ATGAAAACCT 


TGGCAATCTT 


14940 


GGTTCTTACC 


ATGAGTTAGC 


TTCTATGTTC 


ATGAAAATTG 


GTGGAGCTCC 


CTTTGGTTTG 


15000 


ATGCTTCCAG 


TCTTTGCGGG 


TTATGTTGCC 


TACTCTATTG 


CTGAAAAACC 


GGGTTTGGTA 


15060 


GCAGGTTTCG 


TGGCTGGTGC 


TATTGCCAAA 


GAAGGTTTTG 


CCTTTGGTAA 


AATTCCTTAT 


15120 


GCCGCAGGTG 


GTGAAGCAAC 


TTCAACTCTT 


GCAGGTGTCT 


CATCTGGTTT 


CCTAGGTGCC 


15180 


CTTGTTGGTG 


GATTTATCGC 


AGGTGCCTTG 


GTTCTTGCCA 


TCAAGAAATA 


CGTTAAAGTT 


15240 


CCTCGTTCAC 


TCGAAGGTGC 


TAAATCAATC 


CTTCTATTGC 


CACTTCTTGG 


AACAATCTTG 


15300 


ACAGGATTTG 


TTATGCTAGC 


TGTGAATATC 


CCAATGGCTG 


CAATCAACAC 


TGCTATGAAT 


15360 
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GACTTCCTAG 


GCGGTCTTGG 


AGGAGGTTCA 


GCTGTCCTTC 


TTGGTATCGT 


CCTTGGTGGA 


15420 


ATGATGGCTG 


TTGACATGGG 


TGGACCAGTT 


AATAAAGCAG 


CTTATGTCTT 


TGGTACAGGT 


15480 


ACGCTTGCAG CAACTGTTTC TTCAGGTGGT TCTGTAGCCA TGGCAGCAGT TATGGCTGGA 


15540 


GCAATGGTGC 


CACCACTTGC 


AATCTTTGTC 


GCAACTCTTC 


TTTTCAAAGA TAAATTTACT 


15600 


AAGGAAGAAC 


GTAACTCTGG 


TTTGACAAAC 


ATCATCATGG 


GCTTGTCATT 


TATCACTGAG 


15660 


GGAGCGATTC 


CATTTGGTGC 


CGCTGACCCA 


GCTCGTGCGA 


TTCCAAGCTT 


CATCCTTGGT 


15720 


TCAGCAGTAG 


CAGGTGGACT 


CGTTGGTCTT 


ACTGGTATCA 


AACTCATGGC 


GCCACACGGA 


15780 


GGAATCTTCG 


TTATCGCCCT 


TACTTCAAAT 


GCTCTCCTTT 


ACCTCGTTTC 


TGTCTTGGTA 


15840 


GGAGCAATCG 


TAAGTGGTGT 


GGTTTATGGT 


TACCTACGCA 


AACCACAAGC 


ATAAAAAATA 


1590O 


GAAAAATGAA 


AAGATTGGAC 


CGTTTGGTGC 


AGTCTTTTTC 


TCTTCCCCAA 


ATGCCTGTGA 


15960 


AATATGGTAT 


AATAGAAGAA 


TGGCAAACAA 


GAATACAAGT 


ACAACAAGAC 


GGAGACCGTC 


16020 


TAAAGCACAA 


CTGGAAAGAA 


AAGAAGCGAT 


TCAACGAATG 


TTGATTTCGT 


TACGAATTGC 


16080 


GATTTTATTG 


ATTTTCGCAG 


CCTTCAAATT 


AGGGGCTGCA 


GGTATAACCC 


TTTATAATTT 


16140 


AATTCGCTTG 


CTAGTGGGTA 


GCCTAGCTTA 


TCTGGCGATA 


TTCGGCCTAT 


TAATCTATCT 


16200 


CTTCTTTTTC 


AAGTGCATAC 


GAAAACAGGA 


AGGACTCTTA 


TCTGGCTTTT 


TCACCATATT 


16260 


TGCTGGCTTA 


CTCTTGATTT 


TTGAGGCCTA 


CTTGGTTTGG 


AAATATGGTT 


TGGACAAGTC 


16320 


CGTTCTAAAA 


GGGACCATGG 


CTCAGGTTGT 


GACAGATCTG 


ACTGGTTTTC 


GAACGACTAG 


16380 


CTTTGCTGGA 


GGGGGCTTGA 


TCGGGGTCGC 


TCTTTATATT 


CCAACAGCCT 


TTCTCTTTTC 


16440 


AAATATCGGA 


ACTTACTTTA 


TTGGTTCTAT 


CTTGATTTTA 


GTGGGTTCTC 


TCCTAGTCAG 


16500 


CCCTTGGTCT 


GTTTACGATA 


TTGCTGAATT 


TTTCAGTAGA 


GGCTTTGCCA 


AATGGTGGGA 


16560 


AGGGCACGAG 


CGTCGAAAAG 


AGGAACGCTT 


TGTCAAACAA 


GAAGAAAAAG 


CTCGCCAAAA 


16620 


GGCTGAGAAA 


GAGGCTAGAT 


TAGAACAAGA AGAGACTGAA 


AAAGCCTTAC 


TCGATTTGCC 


16680 


TCCTGTTGAT 


ATGGAAACGG 


GTGAAATTCT 


GACAGAGGAA 


GCTGTTCAAA 


ATCTTCCACC 


16740 


TATTCCAGAA 


GAAAAGTGGG 


TGGAACCAGA 


AATCATCCTG 


CCTCAAGCTG 


AACTTAAATT 


16800 


CCCTGAACAG 


GAAGATGACT 


CAGATGACGA 


AGATGTTCAG 


GTCGATTTTT 


CAGCCAAAGA 


16860 


AGCCCTTGAA 


TACAAACTTC 


CAAGCTTACA 


ACTCTTTGCA 


CCAGATAAAC 


CAAAAGATCA 


16920 


GTCTAAAGAG 


AAGAAAATTG 


TCAGAGAAAA TATCAAAATC 


TTAGAAGCAA 


CCTTTGCTAG 


16980 


CTTTGGTATT 


AAGGTAACAG 


TTGAACGGGC 


CGAAATTGGG 


CCATCAGTGA 


CCAAGTATGA 


17040 


AGTCAAGCCG 


GCTGTTGGTG 


TAAGGGTCAA 


CCGCATTTCC 


AATCTATCAG 


ATGACCTCGC 


17100 


TCTAGCCTTG 


GCTGCCAAAG 


ATGTCCGGAT 


TGAAGCACCA 


ATCCCTGGGA 


AATCCCTAAT 


17160 
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CGGAATTGAA GTGCCCAACT CCGATATTGC CACTGTATCT TTCCGAGAAC TATGGGAACA 17220 

ATCGCAAACG AAAGCAGAAA ATTTCTTGGA AATTCCTTTA GGGAAGGCTG TTAATGGAAC 17280 

CGCAAGAGCT TTTGACCTTT CTAAAATGCC CCACTTGCTA GTTGCAGGTT CAACGGGTTC 17340 

AGGGAAGTCA GTAGCAGTTA ACGGCATTAT TGCTAGCATT CTCATGAAGG CGAGACCAGA 17400 

TCAAGTTAAA TTTATGATGG TCGATCCCAA CATGGTTGAG TTATCTGTTT ACAATGATAT - 17460 

TCCCCACCTC TTGATTCCAG TCGTGACCAA TCCACGCAAA GCCAGCAAGG CTCTGCAAAA 17520 

GGTTGTGGAT GAAATGGAAA ACCGTTATGA ACTCTTTGCC AAGGTGGGAG TTCGGAATAT 17580 

TGCAGGTTTT AATGCCAAGG TAGAAGAGTT CAATTCCCAG TCTGAGTACA AGCAAATTCC 17640 

GCTACCATTC ATTGTCGTGA TTGTGGATGA GTTGGCTGAC CTCATGATGG TGGCCAGCAA 17700 

GGAAGTGGAA GATGCTATCA TCCGTCTTGG GCAGAAGGCG CGTGCTGCAG GTATCCACAT 17760 

GATTCTTGCA ACTCAGCGTC CATCTGTTGA TGTCATCTCT GGTTTGATTA AGGCCAATCT 17820 

TCCATCTCGT GTAGCATTTG CGGTTTCATC AGGAACAGAC TCCCGTACGA TTTTGGATGA 17880 

AAATGGAGCA GAAAAACTTC TTGGTCGAGG AGACATGCTC TTTAAACCGA TTGATGAAAA 17940 

TCATCCAGTT CGTCTCCAAG GCTCCTTTAT CTCGGATGAC GATGTTGAGC GCATTGTGAA 18000 

CTTCATCAAG ACTCAGGCAG ATGCAGACTA CGATGAGAGT TTTGATCCAG GTGAGGTTTC 18060 

TGAAAATGAA GGAGAATTTT CGGATGGAGA TGCTGGTGGT GATCCGCTTT TTGAAGAAGC 18120 

TAAGTCTTTG GTTATCGAAA CACAGAAAGC CAGTGCGTCT ATGATTCAGC GTCGTTTATC 18180 

AGTTGGATTT AACCGTGCGA CCCGTCTCAT GCAAGAACTG GAGATAGCAG GTGTCATCGG 18240 

TCCAGCTGAA GGTACCAAAC CTCGAAAAGT GTTACAACAA TAAAAAAATA GCTTCTTTCC 18J00 

AAGTTTGGAG GGAAGCTATT TTAGTGGCTA TTGATTGCTT TTATTTTCTG AAGTTGGCGC 18360 

ATTGGACTGT TTTTCGTTTT CAGTAGCAGG TTTACTTGAA GCAGGAGTAG AAGAGTCCTG 18420 

AGTTGCTGTT TTCTGATCTT CTTTTTTCTC TTCCTTGACG CTAGATTTTG GTGTTTCCTC 18480 

TTGCTGTGTT TTTTCTTGAC TAGTGTTAGT CTCTTTACTT GGACTGGTGT TTTCCTTAGG 18540 

GGATTCCTTT TGGATTTCTT TGACAATGGT TGTCGTCTGG CTTGTCGTAG GTTCTTTTTT 18600 

AATATTTTTG TTATTATCCA AGGCGTT 18627 
(2) INFORMATION FOR SEQ ID NO: 114: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

TAAAATACCT TACCTTCCTT CTGCACGTTC AGCAGGTAAG TCATTGAAAT TTAAAGATCA 60 

AGATATTACA ATTGAAGAAA CGACTGAAAC AGCTTTTGAA GGAGTTGATA TTGCTCTCTT 120 

TTCAGCAGGT AGTTCTACAT CAGCTAAGTA TGCACCATAC GCAGTAAAAG CTGGCGTGGT 180 

AGTAGTAGAT AATACATCTT ATTTCCGTCA AAATCCAGAT GTTCCTTTGG TTGTTCCAGA 240 

GGTCAATGCT CATGCACTTG ATGCTCACAA CGGAATCATT GCCTGCCCTA ATTGTTCAAC 300 

AATTCAAATG ATGGTGGCTC TTGAGCCGGT TCGCCAAAAA TGGGGCTTGG ACCGTATCAT 360 

TGTTTCAACT TATCAACCCG TTTCAGGTGC TGGTATGCGA GCAATTCTTG AGACACAACG 420 

TGAACTTCGT GAAGTCTTGA ATCATGGTCT GAAACCACGT GATTTGCATG CGGAAATCTT 480 

GCCTTCAGGT GGTGACAAGA AACATTATCC TATCGCCTTT AACGCTCTTC C AC AAATTGA 540 

TGTTTTCACT GATAATGATT ACACGTACGA AGAGATGAAG ATGACCAAGG AAACTAAGAA 600 

AATTATGGAA GATGATAGCA TTGCAGTATC TGCAACATGT GTGCGTATTC CAGTCTTGTC 660 

AGCTCACTCT GAGTCTGTTT ATATCGAAAC AAAAGAAGTG GCTCCAATCG AAGAAGTAAA 720 

AGCACCTATC GCAGCCTTCC CACGTCCTGT TCTTGAAGAT GATGTAGCTC ATCAAATCTA 780 

TCCTCAAGCT ATCAATGCAG TTGGTTCGCG TGATACCTTT GTTGGTCGTA TCCGTAAAGA 840 

CTTGGATGCA GAAAAAGGAA TTCACATGTG G G 1T G TTTCA GATAACCTTC TCAAAGGTGC 900 

TGCTTGGAAC TCAGTTCAGA TTGCTGAAAC TCTTCATGAA CGTGGATTGG TTCGTCCAAC 960 

AGCCGAATTG AAATTTGAAT TAAAATAGTC ATATCGTTTA GGAGTTCAGA TGAACTCCTT 1020 

CTTTGAAATA GAGAGGTGTT TTCGTGTCTT ATCAAGATTT AAAAAAATGT AAAATCATTA 1080 

CAGCCTTTAT TACCCCCTTC CATGAGGATG GTTCCATTAA CTTTGATGCT ATTCCAGCCT 1140 

TGATTGAGCA TTTATTGGCC CATCATACGG ATGGAATTCT TCTCGCAGGA ACGACTGCTG 1200 

AGAGTCCAAC TTTGACCCAC GATGAGGAGT TGGAGTTGTT TGCGGCTGTA CAAAAGGTTvi 1260 

TCAATGGACG CGTTCCTTTG ATTGCGGGTG TAGGTACTAA TGATACGCGT GACTCTATTG 1320 

AGTTTGTCAA AGAAGTAGCG GAATTTGGTG GTTTCGCAGC TGGGCTTGCT ATTGTTCCTT 1380 

ACTACAACAA ACCTTCTCAA GAAGGGATGT ATCAGCACTT TAAGACTATT GCAGATGCTT 1440 

CTGACCTACC AATTATTATC TATAACATTC CAGGGCGTGT AGTTGTCGAA TTGACTCCAG 1500 

AAACCATGCT TCGCTTGGCT GACCATCCAA ATATTATCGG TGTCAAAGAA TGTACTAGCT 1560 

TGGCTAATAT GGCTTACTTG ATTGAGCACA AGCCTGAAGA GTTCTTGATT TATACAGGTG 1620 

AGGATGGAGA TGCTTTCCAT GCCATGAACC TTGGGGCGGA TGGGGTTATT TCTGTTGCCT 1680 



WO 98/18931 



PCT/US97/19588 



831 



CTCATACAAA 


TGGGGATGAA 


ATGCACGAGA 


TGTTTACTGC GATTGCAGAA AGCGATATGA 


1740 


AGAAAGCCGC 


AGCAATTCAG 


CGTAAATTCA 


TTCCTAAGGT TAATGCTCTC TTCTCTTATC 


1800 


CAAGTCCTGC 


TCCAGTTAAG 


GCAATTCTTA 


ACTATATGGG 


ATTTGAAGCT GGACCCACTC 


1860 


GTCTACCTCT 


TGTTCCAGCA 


CCAGAAGAAG 


ATGCCAAACG 


CATTATCAAG GTTGTCGTAG 


1920 


ATGGCGACTA 


CGAAGCAACT 


AAGGCAACTG 


TAACAGGGGT 


CTTAAGACCA GATTACTAAT 


1980 


AAAGACAATA 


AAATCCGGCT 


CTTTGTCAAC 


TGTAGTGGGT 


TGAAGTCAGC TAAGCTCGAG 


2040 


AAAGGACAAA 


TTTTGTCCTT 


TCTTTTTTGA 


TATTCAGAGC 


GATAAAAATC CGTTTTTTGA 


2100 


AGTTTTCAAA 


GTTCCGAAAA 


CCAAAGGCAT 


TGCGCTTGAT 


AAGTTTGATG AGATTATTGG 


2160 


TCGCTTCCAA 


TTTGGCGTTT 


GAATAGGGTA 


GTTGAAGGGT 


GTTGACGATT TTCTTTTTGT 


2220 


CCTTTAGAAA 


GGTTTTAAAG 


ACAGTCTGAA 


AAATAGGATG 


AACCTGCTTC AGATTGTCCT 


2280 


CAATGAGTCC 


GAAAAATTTC 


TCCGGTTCCT 


TATTCTGAAA 


GTGAAACACC AAGAGTTGAT 


2340 


AGAGCTGATA 


GTGATGTTTC 


AAGTTTTGTG 


AATAGCTCAA 


AACCTTGTTT AAAATCTCTT 


2400 


TATTGGTTAA 


GTGCATACGA 


AAACTAGGAC 


GATAAAATCG 


CTTATCACTC AGTTTACGGC 


2460 


TATCCTGTTG 


AATGAGTTTC 


CAGTAGCGCT 


TGATAGCCTT 


GTATTCGGGA TTTTCGATGA 


2520 


AACTGATTCA 


TGATTTGGAC 


ACGCACACGA 


CTCATAGCAC 




2560 


(2) INFORMATION FOR SEQ ID NO: 115: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11303 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

TATTGGATTT CCCTTGCAAT CACTTTATGG GACAAGCACC CGGCAGCGCA GAGGAAATCA 60 

ACGCCTTCTC TAGCCTACAT TTTCAAACCA CCTTCCCACG TTTTGCCAAG ATTAAGGTCA 120 

ACGGTAAGGA AGCAGACCCT CTCTATGTCT GGTTACAAGA CCAGAAATCC GGCCCACTAG 180 

GAAAACGAGT CGAATGGAAT TTCGCTAAGT TTCTCATCGG TCGAGATGCG CAAGTCTTTG 240 

AACGCTTTTC TTCAAAAACA GACCCAAAAC AAATTGAAGA GGCGATACAA ACTCTACTAT 300 

AATTCACAAT CTCACTATGA TTAGGTTTCC TTTAACCTGA TGAATAGTGA GATTTTTTGA 360 

TGGGCTTTGA CTTAAATAGA AAAACACCCC ATGATATGAA ACATGAAGTG TTGTAAAGTC 4 20 

TATGTTGTAG GTGCTTATTT CACAATTTCA ATGTGACCAG TGATAACGAA TACCATACAG 480 
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AATCTTCATA TACACTAAAC AAATGACTTT CTAATTATTT CAATTAGTTT TGGCTAGTAA 540 

ATATCATTTC CAACAAACGC CCTCTCAATT CCTTATCCTG ATGATGCAAG ATATTCATTA 600 

AGTCATGAGA GTTTTTCGCA TTGATGAATT GATTTAACAA TCTATCTTTT AATTCATATG 660 

GAAGAGAAGC TGTCTTTAGT AGTCTAAAAA CTTCGTCATT TAAAGATGTC CTTTTATTAT 720 

CTTTCCATTC AAATTTAGCT GTATCATTCT TATTTGGCAA TTCAATTATA GACACATTCG 780 

TTCCTTTAAA ATGAATTCTA TGTTTTCTAT TGCTTGGAAC GATACTAGAA TCTCCTTGTA 840 

ATGCTAACTC TACCATTCCC ATTTCCCAAT CGATTGATAA TCTTGTTTTA TATCTTTGAC 900 

CATTTTGATC TTCAAGCATT TCAAAAGAAT GTTGTTTTCC TGGGAATACA TACCAATCTA 960 

CAACTTCAGC TAAATCAACA CCCATACCTA TCTCAGAACC AACCAAGGGA ATGATTGCAC 1020 

CACTTTTTGC AAACACAGGC GTAGTCGAGA TGTCCCTATA AACACTTAAC TTCACACCAC 1080 

CTGTGTATTT TTTCTCTGAA AAGAAGTCAT ACCATTCACC TTCAGGGAAC CATACATCTA 1140 

CTTTTGCAGA TTGGAATGTC AAATCCATCT TTTCTACAAT GGGAGCCACC ATCAGTTCTG 1200 

TTCCAAAAAA GTATTGGTTT GCAACATTAT AGCTCTCATC ATTCTCTGGA TAGAAATAAT 1260 

AGATTGGACT GATTAATGGG GCACCTTCCT CATGTGTCTG TACATTCATG GTATATAGAT 1320 

AGGGAATCAT CTGATGTCTC AAACGAAGGT ATTTCTTCAT AATCTTAGAT GTTGTTTCTG 1380 

AAAAAAACCA AGGTTCTTTA CTATTAAAAG GACTTCTAGA ACTATGTAAT CGAGTAATCG 1440 

GACTAAAAAC ACCAAACTGT AGCCATCTAG TTTGTAGCTC TTCCTCATAA TCCCCCAACA 1500 

TATGTCCACC GATATCATGA CTCCACCAAC TATAACCGAT ATTAGATGCT GTCGCTGTAA 1560 

AATAGGGTTG AAATCTTAAG GAATTCCAAC TAATAATAGT ATCCCCTGAA AAACCAACAG 1620 

GGTAGCGGTG ACTACCAGGA CCTGCATATC TTGATAAAAT CAAACCACCT TCTGCATTTT 1680 

TACAACTATC CTGATAGTGA TAATGGTTTA AAAGCCAAAG TGGATCTAGC ATACCTTGTG 1740 

TCCCTTGTTG CCAGTCAATC CACCAAAAAT CTACTCCCTG CTTTTCTAGT TCATAATGAA 1800 

CATCTTTAAA GTAGGCTTCC CTAAAAGAGG GATTAAAAAA ATCAAAAATA GCAGGTTCTT 1860 

CTAGTTCTAC ATTTAACCCC AACCGTTTTG CGATTTGAGG ATAAGCTTCT TCATAAGCCC 1920 

GTATCCCATC AGCAGGATGG ACATTTAAGG AGAGTTTTAG CTTTCTATCA TGAAGTTGTT 1980 

GCAATAACTG TTCTGGATTT GCTATTAAGT TTCTATTCCA ACTATATCCT GTCCAGCCAC 2040 

TTCCAAAGCG AGCTGGAATG TCAGTTATAT GCCAATCCAT ATCTAACACA CCGATAGATA 2100 

ATGGAATTTT CTCTGTTTCA AATCTGTCTA TTAAATCCAA GTATTCATCC GACGTATAAG 2160 

GCCAATATCT ACTCCACCAA TTGCCTAAAG CATATCTTGG CAACAAGGGT GTTGAACCAG 2220 

TCAAATGGTA AAAATCTCTG ATTGCTCCTC TATAATCATG CCCATAGGCA AAGAAATACA 2280 
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GGTCAATTTG 


ATTTTCTCTC 


TCAATATAAC 


CAGATTGTTC 


ATCCCAAATA 


AATCCTTGAG 


2340 


AATCATCCAA 


TAAGGCTATA 


CCATTTCGGC 


TAATAATTCC 


ATCTTCTAAC 


GAGATTGCTC 


2400 


CATCTGCCTT 


ATCCAGAGTC 


CGAGCTGTTC 


CTTTTAACGT 


TTCAATAGAT 


TCACCAAAAT 


2460 


ACCAGCGACT 


ACCATATACG 


GCAAAATTTC 


CTTTTAATTC 


TATAAATAAA 


TTTTCGGCGT 


2520 


TAAATTCTCC 


TTTATTAAAG 


TGCAGATGAA 


AATAGTCCGT 


CATAATATCT 


AGTACGTTTG 


2580 


ATGTCTCGAT 


ATAATCTAAC 


GAAATTTGGC 


CAAAATCTCT 


ATTATAGATA 


AGTTGTGTCG 


2640 


TTCTATCCTC 


AAAACTTCCA 


GTTTGAGAGT 


ATTCTAACCT 


TACTAGCTTG 


TCTGTTAATA 


2700 


CAGAGATTCG 


ATAAAACTCT 


CCCTTAAAAA 


TTTTCAATTT 


GTTTTCCTCC 


TTTTATGGTA 


2760 


GCATAAAAAC 


AGAACGCACC 


ATTTTTGATG 


CGTTTTTCAT 


TATTCTGAAT 


GCAATGTTCT 


2820 


ATCTGTTATA 


TCTATGACAA 


ATAATAGTCA 


ATTGAAAAAA 


TGCAGTGGAC 


AAAATATCTT 


2880 


TTAACAAACC 


AAGAGTTTAT 


TAAAGAGTTA 


TCACTTTTCA 


ACTTTTCTAA 


GCTTATGCAG 


2940 


TTCTGAAACA 


AACTACTTTT 


AAACTATTAA 


CTAAGATAGG 


ATTGATAAAT 


AATTTCAAAC 


3000 


TCTTACTAGC 


AATCATACGA 


TATTCAAGCT 


CACGTGCTTT 


TTTCCTTCCT 


GCTTATTTCT 


3060 


TAGAACTGAA 


GAACCCGGAT 


CGGTATATAA 


ATTATCCGGA 


TCAACATAGT 


CATAAGATTC 


3120 


ATAACAGTTG 


CGCTTCATTA 


AGTCATCCCC 


AGAGCAAGAG 


CTTCATCTCG 


TAATTTTTCA 


3180 


ACATCACTAA 


CCGTAGGTCG 


CCATCCTTCA 


ATCATATTTG 


TACTTAAAGC 


ATACCAAACA 


3240 


CTCTTAAAAA 


CGGATCGGTT 


TTCAAAAGCT 


ATTCCCATGA 


TTGTCATCTT 


TTCTTTATCT 


3300 


ATATCTAAGG 


ACATATGCTA 


CCTCCTTTAG 


ATACATTATA 


CCATGTTTCT 


CTGTAGCTTT 


3360 


TAAAAATTTT 


ATTTTGTTTG 


TCATATCTAA 


GTTTTCAGCA 


CGCTTATCCT 


ATTTTATAAG 


3420 


CCTCAAACCC 


AAATATAAAA 


CGCATTCTTT 


TTGCTTTTTT 


ACTATTGTAT 


CGTATTCTAC 


3480 


GATAACATAC 


TTTACTTTAT 


TGTTTTTTTA 


AATAACAGCA 


GTTCCCTGTT 


TATCAACTAT 


3540 


TCGAACTACT 


TTCTATTTTG 


CTTCATACCC 


TACATAGCGA 


AAAAATATGA 


AAAAGCAGAG 


3600 


AAGAATATCT 


TAAAAAGACC 


TCTTCACTGC 


TAA7ATTAAC 


ACTCATTATT 


TAAACTATAT 


3660 


GGATTCTATC 


ATCGAGTATA 


CTTTTTTACT 


TATTAGATAC 


CTTGCTCTTC 


TTTCACCAAT 


3720 


TTTTGATCAT 


ATACACGGAT 


GAATGGAAGA 


TAGACTAGGA 


ATGCTGCAAA 


TGCACATACT 


3780 


AGAGCAACTA 


ATACAGCTCG 


AAGATCTGCT 


GTCCCTAAGA 


AAGCTCCAAT 


CCCTACTGGA 


3840 


GTTGGCCATG 


GAACCTGTGC 


GATAATTGGC 


TTAATAAAGT 


TTAGAGAATT 


CGCTACGTAA 


3900 


TAAATAGTAG 


CAGTAACCAT 


TGGTGCTAAA 


ATAAATGCTA 


TAGCCAAGGC 


TGGATTATAG 


3960 


ATAATAGGTA 


ATCCAAAAAT 


TAATGGTTCA 


TTAATATTAA 


ATAAGGCTGG 


AACTACAGAT 


4020 
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GCTCGTCCTA 


TTGCTTTAAG 


CTGTTCAGAT 


TTAGAGGCAA 


AAGCAATATA 


TAAACATAGT 


4080 


CCTAAAGTTG 


CACCAGAACC 


ACCTGCAATT 


ACAAACATAT 


TAGAAAATTC 


ACCTGCAACA 


4140 


GCGAAGTGCC 


CGCCAGCAGC 


ATTTTCAGCC 


ATGTTAGCAA 


GAGCAATTGG 


ACTAACAAAT 


4200 


GCAAAAACAA 


TGTTCGCACC 


GTGGATACCT 


ACAATCCAAA 


GTAGTTGAGT 


CAATAGATAA 


4260 


ATAATCATTA 


AACCAATCCA 


CGAATTAGTC 


AGATTGGATA 


CAAJtACCAAA 


TGGAATTGCA 


4320 


ATGACTTTAA 


AAATATCTGT 


TCCCATTGCT 


ACAAGAAGAC 


CGTTGATAAA 


GA TAACAAC A 


4380 


AATGCAACAA 


CAAATCCCGG 


AACCAAAGCG 


GTAAATCCAC 


GAGAAACTCC 


TTCTGGAACA 


4440 


GCTTCAGGCA 


TTTTAATAAC 


CCAATTATGT 


TTAACACACA 


TACGATAAAT 


AAGAACAGTC 


4500 

■ mf s* w 


ACAATTGCCA 


TAATGATTGC 


GGTAAAAATC 


CCTGTTGTCC 


CAAAACGTGC 


GACTACATTT 


4560 


CCCATTGCCC 


ATCCATCTGC 


AATTACTGCA 


CCTTCTTTTA 


GACTTGTCAC 


AGTCTTCATC 

• ^ ^m m> w ~imm m m 


4620 


ATTCCACCAT 


CAAAAATGAT 


TTGCGGTACT 


GTCATGACAA 


AAGCCATCAA 


GGCAAGCAAG 


4680 


GCACCATTAA 


GAGGATTCAT 


ATTGAGTTCT 


TCTTCCTCTG 


CAT AAATT TT 


TGTCAATTCA 


4740 


TATGCAAGTG 


ATAGAACGAA 


ATAAAGAGAT 


AGAGAACCCA 


TAGTCGCATA 


GTTTGCAACC 


4800 


ATGTAAAGTG 


ATGTGAATTT 


ATCAAATGAA 


GCAGAGAAAA 


TATCTGCCAC 


AATTGGCCAA 


W W U 


AATGAGAAAG 


CTTGTGGCAA 


AATACTGAAT 


ACCAAAAACA 


TTGATCCTAC 


AATAGTAAAT 


4920 


GGTACAGCAG 


CCATACCTGC 


AGCCGTGATA 


GCACGTACTA 


CTTTAAACTG 


AGCAAGTTTG 


4980 


CCCATTGGTC CCATAACATG 


GTTTTCAAGA 


AAACCAAACA 


ACCCGTTTTG 


TTGATCCATA 


5040 


AATAGACCTC 


CTTAATAAAA 


CATAATAATT 


TTTACTTTCT 


AAAGACTAGT 


TTCAAATACA 


5100 


AATTATACTA 


GATCAGGATT 


ATAAACTAAG 


TGAGTTCTTT 


TCCAATTGGA 


CAAATTGTTG 


5160 

v 4 V W 


ATAAGCCTTA 


TCTGTTCGTT 


TATAAATTTT 


TTTAATTCTT 


CTAATGTCTA 


ACAAACTCAG 


5220 

mm m\» mm V 


AACTAAACCT 


AATAGAAGAA 


CTACAAAAAC 


AAATAAACGT 


GCTACTTGGT 


TATTTTCAAA 


S280 


AATCGGAAAA 


AGATTCTTAA 


ACCAACTTGT 


CCAAGTTAAA 


ACAAGTAATC 


CTATTGAAAT 


5340 


AAGCATTTGT 


ATTCTAACAA 


ACATTAGTGT 


TATTCCCAAC 


TTTTCTTTCC 


TATTTCCATA 


5400 


AAGTTTAAAT 


TGTTCAACAG 


TTGCTAAAAT 


AGAAAATACT 


ATGAGCATAA 


TGGGGAAAAT 


5460 


AATAATAGGC 


GAGGGACTAA 


TAAACTGACT 


CAAAAGCCAA 


TAAATATTCC 


CAAAAAAGAA 


5520 


GAGTGCTATT 


GAATAACGTA 


GAAGAAGATA 


TCGATTGAAA 


AAAGTATTAG 


TTAGAGCCAT 


5580 


CTCTCGACGT 


TGTTGTTCAA 


TCTTTTGTCG 


TTCTTTTTTA 


TCCATATCAT 


TTCCTCCTTA 


5640 


TATAACAACA 


CATATTTAGT 


TAACTTTCTT 


ATAAAGAGCT 


AACATTTCCT 


TTGCTACTTC 


5700 


TAATAATGTC 


ATAGTGGTCA 


TTAAATGATC 


TTGAGCATGT 


ACCATGATAA 


TTTCAATTTT 


5760 


AATTTCCACT 


CCACTTGCGT 


ATTCTTGCAA 


GAGTTTGGTT 


TGTGCATGAT 


GCGCTTCAAG 


5820 
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AATTATCTCA TTTGATTGAT TTAATTTACT TTCTGCATCA TCAAAACTAC CTTCTCTCAT 5880 

TTTTGCAAAT GCTTCATGTA TTTCTGACCT TGCATTTCCC GAATGCAGGA TAATTTCAAA 5940 

TGCTGCAACC TGCAGTTCCT CTTGATTCAT ATAAACCTCC TATTTTATCT TCTCAAATAT 6000 

GTTAATAAAA TCTTCAAAGT TATTGCAAGA TATTAGCTGA TTTTGCAATT CATCATTCTC 6060 

TGTCAGAGAG ACTATCTTTT TAGTCACAGT TGCCAAACCT TCGTTCCCAT ATATTGATGG 6120 

AGATAGAAGA AATACTAGCT GGACATGTGA ACTTTGATTA TCCCAGAGTA ACGAATCTTT 61 BO 

ACAAATTGCA ACCGAAACCT TTCCCTCTGT ACCAAAGGGC TGAATAGGAT GCGGAACTGC 6240 

AATTTTTTCA GAAAAAACAA CTGAACTTAA TTCTTCGCGC TGTTTAATTC CATAAAGTAA 6300 

AGATTGTTCA AACTCATTTG ATTCACCAAC AGATAAACTC TCAACCATCT TTTCAAGTAA 6360 

ATTTACCTTG TCTGATTCAG TACATATTAA AAAGTTTTCT TTACTAAAAT ACTGTCTAAA 6420 

GCCGTTGTTT TCAAATTTGT TAATCTTTGA TGATTGTACA TAACTAGAAA CTTGCATCTA 6480 

ATCCATAGCT TTTCTAATCA TTTCCATCTC ATCACTCTTA AGAAACACAC TAACTTTAAA 6540 

AACTGGGATT TGAAAATATA GATTTGATAA ATCAATAGCT GACACTATAA AATCTATTCC 6600 

TTTAAGTTTT TCTTGATTCA ATTCATAGTA GCCTATTACA TCAACAACTT CTACTCGCTT 6660 

CCCAAACTCC GTTTCCAAAC GATTTCTTAA CATTTGGGCT GCACCAAATC CTGTTGCACA 6720 

AATAGCAAGA ATATTAAACT TAGTACTCTC TTTGCTACGT TCCATAGCAG CTAAAAAGTG 6780 

AAGACTTACA TATGCTACTT CATCATCTGA TATTGTCCAC TCCAAGAACT TGTCCATATT 6840 

TGCAAGAATT TCTCTAGTCA TAAAGAATAT ATCACTATAA TTCTGTTTAA TTTCATCTAC 6900 

CAAAGGGTTA TTTAAGGTAA TCCGGCTTTC TAAACGTACT TGTAGTGTCA TTAGATGAGT 6960 

TATCAATCCT TCAATTAGTT GGAAATCTGA AGAAAAGTTA TACATATCAT CTAATCCTAA 7020 

ATTCTGAAAT GTTTTAAATA AAGATTTTTT TAAAACTTCT TCAGAAATAT TCTTCTGATT 7080 

TTTTTGACAT TGTTGACTCT TAGCTAACAA ATGCAAAGTA ATGTAGTCTA TTTCCTGAAC 7140 

TGGAAATTCC TGATTTGTTA CTTCTCTTAC TTTAGAAAGA ATTCTTTGGG CAACCTTTCT 7200 

CTCTATTGCA TCATCAGTCA TCTGACAGTC TATATTTTTT ATTTCAAATC CGGATTTTAA 7260 

ACGAATCACA GACAATGCTA TGTGAACTAC TAAATTCTGT AGTACAAAAT CAGATAGTTT 7320 

TAGGTTGGCC TCTTGGCATT CATCCAAAAC AATTCTAGCA AATTCTTCTA ATGGAACAGT 7380 

TTGATCAAAA AAGTTAAATT TTACATAGCA ATGTATTGTT TTAAAAAATT GATTCTCTAG 7440 

GAAATAATTT ATGATAAAAC GTCGTTTATC ACGTTCCTCG CCTGAGACAT AAACTCCTTT 7500 

ATTCGCCCTA CTCTCAATGG ACAAATTATA CTCTGATAAC ATCACTCGTA TCTTTCTGAA 7560 
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ATCATGAGAT 


AATGTTGAAC 


GACTAACGTA 


AAGTTCATCA 


GCTAAATCAT 


CAAAAAGAAC 


7620 


TGGAACTTGC 


TCAAATAATA 


ATTTATTTAA 


GATAAATACT 


AAACGATCAT 


CACCTTTTGA 


7680 


AACCGCAGTT 


TTCGTATAGT 


CTTCTTCCAG 


TTCATAAGTT 


TGTCTAAACT 


CCTGGTAAGC 


7740 


GCCTTGATTC 


TCAAAAAATA TTTGATACCC 


TTGACCTTGT 


TTTGAAATCA 


ACCGGACTCC 


7800 


TTGAATAATC 


ATTGTCTTCT 


CAATTAATTT 


CACTACATTA 


CGGACAGTTC 


TATCTGAACA 


7860 


GGATAAATAT 


TCTGCCAGTT 


CTTTGCTTGT 


AACAAAACGT 


TCCTTATTTT 


TTATTAAAAA 


7920 


TTGAAGGATA 


TCTTTCTCTT 


TAATGTTtAA 


CACATTCATT 


CCCTCCTAAA 


ACGTATGTTT 


7980 


TCATATATTG 


AAGCATATTA 


TACACTTAAA 


TCAGTTTATA 


TCAAACTCAA 


AACAATTTAT 


8040 


CTTAACCTAA 


ATATTTATTG 


ACATTTCATG 


TGTTCATCAA 


ATATTCTCAA 


GAATCAAATT 


8100 


AGCCATTTTT 


TCAATTCCCA 


TTGGAATAGG 


AATATAGGCT 


TGAGGAGGTA TTTGTACAAC 


8160 


TGGTTTTCCT 


GCTTTAGAAC 


CAGCCTCTTC 


AAATTGCTTA 


AAGTACATTT 


TTGTTTGAGG 


8220 

#t A W 


ACTGACAAGA 


TACAAATCAA 


AAGCTGCTGC 


TGCGATAGCT 


TTCCCTCCTT 


CAGTAGCACT 


8280 

V w W W 


AATAGCATCA 


ACTACAATAT 


CTTTCCCTTT 


TCCTTTTAGA 


AACTCTGTTG 


* * 4 4 W 4 w 4 


8340 


CATAAGTGAT 


GAAGACATTC 


CTGCTGCACA 


AATAATTAAA 


GCTTTTGCCA 


TAATATTTTC 


8400 


TCCTTTTCTT 


AAATCCAATC 


AAAGCTGTGC 


TAAGTTGGCT 


TATTTGTTAT 


CTATTTTTAT 


8460 


TATAAAATAA 


AGCGTTTCCA 


ATGACAATTC 


CCTCATTTTC 


CTAAATGATA 


TGGAAAAAAA 


8520 


TTATTTATAC 


TTCAATTTAT 


AAAATAAAAT 


TATTCCTGAG 


AGTAGAAATG 


AAACACTATT 


8580 


TGCTAAAATC 


AAAGGCAAGT 


CTCCTATACG 


AATACCATGA 


GCAAGCCACA 


ATGCAATACC 


8640 


AATAACTTGC 


ATAACATACA 


TACCTAGAGC 


AATAGATCCT 


GTGTCCTTTG 


TCTTAACTAC 


8700 


ACGAAAAACT 


TGTGGTAAAA 


ATGCAAATGT 


TGTTAAAATT 


GCTGCAATAC 


TTCCAATCAT 


8760 


ATGTCACCTC 


AATATGCTAA 


ACAAACTGAG 


AATAATCTCA 




ACTATTCTAC 


8820 


TGATTCACCG 


TTAGATGAAA 


TAACTTCCTT 


ATACCAGCCA 


AAAGATTTTT 


TCGGGGAACG 


8880 


ATT AT AAC 11 


CCCTTCCCAT 


TATCATCTTT 


ATCTACATAA 


ATAAAGCCAT 


AACGTTTCCG 


8940 


CATTTCACCG 


GTACCAGCTG 


AAACCAAATC 


AATACATCCC 


CATGGAGTAT 


AACCCATTAA 


9000 


ATCAACACCA 


TCTTCAACTA 


CAGCCTTTTT 


CATTTCACGA 


ATATGGGCAC 


CTAGATATTC 


9060 


AATTCTATAA 


TCATCATGTA 


CCATACCATC 


TGCTGCAACT 


TGATCTATAG 


CTCCAAAACC 


9120 


ATTTTCAACA 


ATAAAGAGTG 


GTAAGTGATA 


GTGGTCTGTA 


AACCAATTTA 


ACGCATAACG 


9180 


CAAACCTTCT 


GGATCAATTT 


GCCACTCCCA 


TTCAGAAGCC 


TTAACATAAT 


TATTTTTCAC 


9240 


TAAATCTTCT 


GTTTCAAGAT 


AATCAAAATA 


AGGATTATTT 


TCACGATGAG 


AGTCGATAGC 


9300 


AAAGGACATA 


TAGTAACTGA 


AACCAATGTA 


ATCTACAGTC 


CCACCAACTA 


AATCTTCTTT 


9360 
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ATCCTGGGCA GTAAAATCAA CTGAAATACC TTTTCGTTCC CAATACTTGA AAATATGCTC 9420 

AGGATATTTA CCTAAAACAT GCACATCAGC AAAATAATAA CCCTTCTGCA TAGCTTTCAT 9480 

TGCCATTAAG ATATCCTTAG GATTGCAAGT AACTGGATAA ATTGGACACA TCGCAATCAT 9540 

ACAACCTATT TGAAAATCTG GATTAATCTC ATGACCAATT TTTACAGCTC GTGOAGAAGC 9600 

AACTAATTCG TAATGTGCTG CTTGATACAT AATTGCTTCT CTATTATCAC CTTCCTCATA 9660 

TACAATACCT GAGTTAGTAA ATGGTGCAAA ATCTTCCTGA TAATTCGCTT GATTATTGAT 9720 

TTCATTGAAA GTCATCCAAT ATTTAACCTT ATCTTTGTAA CGTTTAAATA CGACTTCTGC 9780 

AAAACGAGCA AAGAAATCAA TCAATTTCCT ATTTTTCCAA CCACCATATT CGGTCACTAA 9840 

GTGATAAGGC ATTTCAAAAT GAGATAGAGT GATGACAGGT TCAATACCAT TCTTTAAGCA 9900 

TTCATCAAAA AGATTATCAT AAAACTGTAA TCCTTCTTCA TTCGGCTCTA ACTCATCACC 9960 

TTTTGGAAAG ATACGTGTCC ATGCAATAGA GGTACGGAAG CACTTGAATC CCATTTCAGC 10020 

AAAAAGTGCT ATATCTTCTT TATAACGGTG ATAAAAATCT ATCGCCTCAT GATTTGCATA 10080 

ATATTTACCC TCTAAAACTC CCAAAGTAAT TTCACGAGCT ACTCCATGAC GACCAGCAGT 10140 

CATAACATCA GCAACACTAA TTCCCTTGCC ACCTTCTTGC CATCCACCTT CAAGTTGATC 10200 

AGCAGCAACA GCACCACCCC ATAAAAATCC ATCTTTAAAA GTAGTCATCT TTTTTCCTCC 10260 

TGACTTTGAT ACTCTTATTA TAAACCTTAA ACCAAAAGAT GAAAACGCAT TCTTTTTCCT 10320 

TATTGTTAAG GAAAGAAGTA ATTTTTAATG GAAATAGAAC AATATCTTCT TGTATTCTCG 10380 

TAATGATATC TTTACGATTT TCAATACTTT CAAACTACAA AAACTCTCAC AATAATTCTA 10440 

ATTCCCTGTG TCTATAAACG ACTTATCGCT TTCTGGCATC CCAGAATCAT CTTCTATATA 10500 

ACGTTCAACT TGCATCTGCA AGTGATATTT TTTTCTTAAA TCTAAGATTT TCTGCATTGT 10560 

CTTTGATTGA TAATGTTTAT CTAAAGTTTC TTGATTTATC CACTGATCAA TAAGGAGAAT 10620 

AGTTCCCTCT TTTTCAATTG GTAAAAAATA TTCGTATTTC AACTTACCTT TTTGATTTCT 10680 

AATTTCTTTA ACAAGGCCAC TATCAAGCAT TTCTCTTGCA AACTTTATTG CACTATCTCC 10740 

ATCACCTTTA TAATATACAT GAATAGTCAA TGTCATCTTA TATCCTCCAA AATCATCCTT 10800 

CAATTTTAAA AAAACAAGTT TAGATGAGGA TCTAAACTTG TTTTTTATGA ACTAATTATC 10860 

TAACGTTTCG CCATTACTTT CAATCACTTC TTTATACCAA TAAAATGATT TTTTCTTATA 10920 

GCGATTTATA GTCAATTGAA ACAAGAGCAG GACAAAAGAG CCTCATAAAA GGTATTGCAA 10980 

CTTGGTAATA CCTTTTTGAG GTCCTTTTTC ATATGAGCCC ATGTTTTCTC AATAGGATTG 11040 

TACTCAGGTG AGTAGGGAGG AAGAGGTAAA AGTTTATACC CAAACTCTTC ACACAAGAGT 11100 
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TCTAGCTTCC CCATTCTATG GAATCTTGCA TTATCCATAA TAATAACCCA TGGTGTGGTT 11160 

AATGTTGGTA AGAGAAACTT CTGAAACCAA GCTTCAAAAA AGTCGCTCGT CATCGTCTCT 11220 

TCGTAAGTCA TTGGAGCGAT TAACTCACCA TTTGTTAGAC CTGCAACCAA AGAAATCCTC 11280 

TGATATCTTC TTCCAGATAC TTT 11303 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3112 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

CCTTAGATTT CCACTTGCCA GAGGAATTGA TTGCCCAAAC GCCCCTTGAA AAACGTGATG 60 

CCTCCAAACT CCTCATCGTC AACCGTGAGA CAGGAGAAAT GCAAGATAAA CATTTCCACT 120 

CTATTATTGA TATGCTGGAA CCTGGTGATG CCCTTGTCAT GAACGACACC CGAGTTCTCC 180 

CTGCCCGCCT CTATGGTCAA AAAGTGGAGA CAGGAGGTCA TGTGGAACTT CTCCTCCTTA 240 

AGAACACTAG TGGAGACGAG TGGGAAGTTC TGGCTAAACC TGCCAAACGC CTCAAGGTCG 300 

GTACTCGTAT CAGCTTTGGT GATGGCCGCC TCAGCGCTGT CGTTACAGAA GAATTGACCC 360 

ACGGGGGACG CATTGTCCGC TTTGAATACC AAGGAATTTT CCTAGAAGTC TTGGAAAGTC 420 

TGGGAGAAAT GCCTCTGCCA CCTTATATCC ACGAAAAATT AGATGACCGT GAACGTTATC 480 

AAACCGTCTA CGCCAAGGAA AGTGGCTCTG CTGCAGCACC GACTGCTGGT CTTCACTTCA 540 

CCAAAGAACT GCTGGCAGAA ATCCAACCTA AGGGTGTTCA TCTAGTCTAT CTGACTCTCC 600 

ATGTCGGACT CGGAACCTTT AGACCTGTTT CTGTGGATAA TCTGGACGAA CACGAAATGC 660 

ACTCAGAGTT CTATCAACTT TCTGAGGAAG CTGCTGCCAC CCTTCGCTCT GTCAAAAAAA 720 

ATGGTGGTCG TGTCATCGCT GTCGGAACCA CTTCTATCCG CACCTTGGAA ACTATTGGTT 780 

CCAAGTTTGA TGGGCAAATC CAAGCAGATT CTGGTTGGAC CAATATCTTT ATCAAACCTG £40 

GGTATGAGTG GAAGGTCGTG GATGCCTTCT CAACCAACTT CCACCTGCCA AAATCAACTC 900 

TGGTCATGTT GGTTTCTGCC TTTGCAGGCC GTGAATTAGT CTTAGATGCC TACCACCATT 960 

CCATCCAAGA ACACTACCGC TTCTTCAGTT TTGGTGACGC CATGTTTATT TATTGAGAAA 1020 

GAATTTCTCT AAATCTTCTA ATACCAATAA ATCGCTAAGA TATTATTTCA AAGAACATCT 1080 

ACAATTGAAA CTCTAGCTAG CTGTAGAAGA GGCCTAGTAC ATTGAAATTA AAATGCTTCC 1140 

CCCTAGCTTC GAAAATATTG CCATAGATTG CGTTGACTCT CCAAATTGAT TCATCTATAT 1200 
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TTTATTTCAC CTTCCTATAC TTTCTTCGCT GTTTGTAAAT CAAAATGCAA GACACATGAG 1260 

TAGCACCATA TTTGTTACTC TTATCTGTCC TCTCAAGAGA CTATTATGAG TTATTTCAGA 1320 

ATCATTCACT ACTTTGACCC TGACTCTCCT TAGTCTCAAA ATCAAAGACT TATACTCTTC 1380 

AAAAATCTCT TCAAACCGCG TCAACGTCAC CTTGGATTAT ATATGTGatC TGaCTTCGTC 1440 

AGTTCTATCT ACAACCTCAA AGCAGTACTT TGAGCAACCT GCGACTAGTT TTCTAGTTTG 1500 

CTCTTTGATT TTCATTGAGT ATTAAACAAA AAGTGAACAA ATCTGAATTC TAATGTACAG 1560 

AAGACTAGGC TTGTTCACTT TTTTATAGTC GCTATAAGAT GACCTTATCT ATAGCTTTTT 1620 

ATATATAATT ATATATTCAG ACATACTATT ATCAATTTTG TCGCAGGGAG GAATCTGTTA 1680 

ACGCACCCAT TCACCATTAT CATTGACTCT ATAGCCATCT ATACTTGTAT TGACCGCTAA 1740 

CTCACCCGAT GTATTTACAT AATACCATTT ACCACCAACT TGGAACCATT GATTGACTTT 1800 

CATAGAACCG TTGCTGTTGA GGTAGTACCA TGAACTATTA ACTTGTACCC AACCTGTTGC I860 

CATGGAACCA TCAGTATTAT AAAAATACCA CATACCATTT TCTTGTTTCC AGTCTGTTGT 1920 

TGGAGCAACT GCTTTAGCTG GTTCTACTGC TACATCTGTT CCTTGGTTAG ATGTAACAGA 1980 

TACAGGATAC GAAGGAATAG ATGATTGCTC AGGAACAACA ACTTTTTCAG GTTCTCTCGT 2040 

CCCTCTCCTT ATACGTCTTT TTACCATCTC TTTAGTAATT TGACGAGAAG TACTTTCTTC 2100 

AATTGTTCCA TCACGTTCAT CTACAGTATA GATTGTAGTA AGAGTAATTT ACCAATTTCT 2160 

CCTACTTCTT CTACTTCTTG ACTTTTATCA AGAGTTGGGC CATCGAGATA TTCTGTTTCG 2220 

ATTGGAATTT CTTGGACAAG AACTTGGGGC TTGGTTCTTT TTTTAACAAC TCTTGTTTGA 2280 

GAGTCPTTTT TTTGACTTAA AGTACTCTCA GTTACTTGTC CACTCTTTCC ATCTACATTA 2340 

TAAGTTATCG TTGTAACTGT TTTCCCATTC TTTCCTAGAG TAATCTCTTG CTCCTGTCCT 2400 

GCAGAAAGGT CATTGTCTGC TTCATATTTA GTAGCAAATG GAACAAGAAC TTCTTCAACC 2460 

TTGCTTTTAG CTGGAACTTT GATAACTGTA TCCGTGGCTT CTTTTCTATC AACAGTAACC 2520 

TGTTCGGTAA CATAACCAGT CTCTGGATTA ACATCGTAGG TCCTTGTCGT AGTTACATAG 2580 

eCATCCTCTC CATCAATTGT AACAGGATTT TCACTACGGT CTTTTGTTTC ATCTTTTTCA 2640 

TAACGAATTC GCGTACTTGA AATTTTCTTG GTTACTACCT TAGGTTTAGT CGCTACTTTT 2700 

ACAATAATAT CCCCATTGTC AGCGTCATCA TACTCTATTC CCTCTTCTTT ATCTCTAGTA 2760 

TCATCTCTGA CATATTGAAT CCCATCAGCA GCATGAACAA AACTTGTATT CAGATTCCTC 2820 

CTAAAAATAA AGTTAGCCCG ATTACCGCAG AACCAAAAAT CTTTCCCAGT TTACGTATTG 2880 

CATAGCGCTT ATTAGTATTA GATTTTGCCA TTACATCCTA CTTCTAGTAT AGCATCTTTT 2940 
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CTATCAAACG TTAAACAATA TACGTTATAT ATAAAATAGA CTTAGAATGA TATATTGATT 3000 

ATTGAACTAA CACTTTAACT ATATCGTAAT CAATCTCATA TATAAAGGAT TGCAGACATC 3060 

TTATCTAAAT ACATGCGAAT ATATTTAGAT ACAAACATTC CAACTTGATA AT 3112 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CCCAAAAATC TCTTCAAACC ACGTCAGCTT CGCCTTGCCG TAGTATCGTT ACTGACTTCG 60 

TCAGTTCTAT CCACAACCTC AAAACAGTGT TTTGAGCATC ATGCgGCTAG CTTCTTAGTT 120 

TGCTCTTTGA TTTTCATTGA GTATAAAAAC AGATGAGTTT CTGTTTTCTT TTTATGGACT 180 

ATAAATGTTC AGCTGAAACT ACTTTCAAGG ACATTATTAT ATAAAAGAAT TTTTTGAAAC 240 

TAAAATCTAC TATATTACAC TATATTGAAA GCGTTTTAAA AATGAGGTAT AATAAATTTA 300 

CTAACGCTTA TAAAAAGTGA TAGAATCTAT TTTTATGTAT ATTTAAAGAT AGATTGCTGT 360 

AAAAATAGTA GTAGCTATGC GAAATAACAG ATAGAGAGAA GGGATTGAAG CTTAGAAAAG 420 

GGGAATAATA TGATATTTAA GGCATTCAAG ACAAAAAAGC AGAGAAAAAG ACAAGTTGAA 4 80 

CTACTTTTGA CAGTTTTTTT CGACAGTTTT CTGATTGATT TATTTCTTCA CTTATTTGGG 540 

ATTGTCCCCT TTAAGCTGGA TAAGATTCTG ATTGTGAGCT TGATTATATT TCCCATTATT 600 

TCTACAAGTA TTTATGCTTA TGAAAAGCTA TTTGAAAAAG TGTTCGATAA GGATTGAGCA 660 

GGAAGTATGG TGTAAATAGC ATAGGCTGAT GTCCATCATT TGCTTATAAA GAGATATTTT 720 

AGTTTAATTG CAGCGGTGTC CTGGTAGATA AACTAGATTG GCAGGAGTCT GATTGGAGAA 780 

AGGAGAGGGG AAAATTGGCA CCAATTTGAG ATAGTTTGTT TAGTTCATTT TTGTCATTTA 840 

AATGAACTGT AGTAAAAGAA AGTTAATAAA AGACAAACTA AGTGCATTTT CTGGAGTAAA 900 

TGTCTTATTT CAGAAATCGG GATATAGATA TAGAGAGGAT CAGTATGAAT CGGAGTGTTC 960 

AAGAACGTAA GTGTCGTTAT AGCATTAGGA AACTATCGGT AGGAGCGGTT TCTATGATTG 1020 

TAGGAGCAGT GGTATTTGGA ACGTCTCCTG TTTTAGCTCA AGAAGGGGCA AGTGAGCAAC 1080 

CTCTGGCAAA TGAAACTCAA CTTTCGGGGG AGAGCTCAAC CCTAACTGAT ACAGAAAAGA 1140 

CCCAGCCTTC TTCAGAGACT GAACTTTCTG GCAATAAGCA AGAACAAGAA AGGAAAGATA 1200 

AGCAAGAAGA AAAAATTCCA AGAGATTACT ATGCACGAGA TTTGGAAAAT GTCGAAACAG 1260 
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TGATAGAAAA AGAAGATCTT GAAACCAATG CTTCAAATGG TCAGAGAGTT GATTTATCAA 1320 

GTGAACTAGA TAAACTAAAG AAACTTGAAA ACGCAACAGT TCACATGGAG TTTAAGCCAG 1380 

ATGCCAAGGC CCCAGCATTC TATAATCTCT TTTCTGTGTC AAGTGCTACT AAAAAAGATG 1440 

AGTACTTCAC TATGGCAGTT TACAATAATA CTGCTACTCT ACAGCGGCCT GGTTCGGATG 1500 

GGAAACAGTT TTACAATAAT TACAACGATG CACCCTTAAA AGTTAAACCA GGTCAGTGGA 1560 

ATTCTCTGAC TTTCACAGTT GAAAAACCGA CAGCAGAACT ACCTAAAGGC CGAGTGCGCC 1620 

TCTACGTAAA CGGGGTATTA TCTCGAACAA GTCTGAGATC TGGCAATTTC ATTAAAGATA 1680 

TGCCAGATGT AACGCATGTG CAAATCGGAG CAACCAAGCG TGCCAACAAT ACGGTTTGGG 1740 

GGTCAAATCT ACAGATTCGG AATCTCACTG TGTATAATCG TGCTTTAACA CCAGAAGAGG 1800 

TACAAAAACG TAGTCAACTT TTTAAACGCT CAGATTTAGA AAAAAAACTA CCTGAAGGAG I860 

CGGCTTTAAC AGAGAAAACG GACATATTCG AAAGCGGGCG TAACGGTAAC CCAAATAAAG 1920 

ATGGAATCAA GAGTTATCGT ATTCCAGCAC TTCTCAAGAC AGATAAAGGA ACTTTGATCG 1980 

CAGGTGCAGA TGAACGCCGT CTCCATTCGA GTGACTGGGG TGATATCGGT ATGGTCATCA 2040 

GACGTAGTGA AGATAATGGT AAAACTTGGG GTGACCGAGT AACCATTACC AACTTACGTG 2100 

ACAATCCAAA AGCTTCTGAC CCATCGATCG GTTCACCAGT GAATATCGAT ATGGTGTTGC 2160 

TTCAAGATCC TGAAACCAAA CGAATCTTTT CTATCTATGA CATGTTCCCA GAACGGAACG 2220 

GAATCTTTGG AATGTCTTCA CAAAAAGAAG AAGCCTACAA AAAAATCGAT GGAAAAACCT 2280 

ATCAAATCCT CTACCGTGAA GGAGAAAAGG GAGCTTATAC CATTCGAGAA AATGGTACTG 2340 

TCTATACACC AGATGGTAAG GCGACAGACT ATCGCGTTGT TGTAGATCCT GTTAAACCAG 2400 

CCTATAGCGA CAAGGGTGAT CTATACAAGG GTGACCAATT ACTAGGAAAT ATCTACTTCA 2460 

CAACAAACAA AACTTCTCCA TTTAGAATTG CCAAGGATAG CTATCTATGG ATGTCCTACA 2520 

GTGATGACGA CGGGAAGACA TGGTCAGCTC CTCAAGATAT TACTCCGATG GTCAAAGCCG 2580 

ATTGGATGAA ATTCTTGGCT GTAGGTCCTG GAACAGGAAT TGTACTTCGG AATGGGCCTC 2640 

ACAAGGGACG GATTTTGATA CCGGTTTATA CGACTAATAA TGTATCTCAC TTAGATGGCT 270O 

CGCAATCTTC TCGTGTCATC TATTCAGATG ATCATGGAAA AACTTGGCAT GCTGGAGAAG 2760 

CGCTCAACGA TAACCGTCAG GTAGACGGTC AAAAGATCCA CTCTTCTACG ATGAACAATA 2820 

GACGTGCGCA AAATACAGAA TCAACGGTGG TACAACTAAA CAATGGAGAT GTTAAACTCT 2880 

TTATGCGTGG TTTGACTGGA GATCTTCAGG TTGCTACAAG TAAAGACGGA GGAGTGACTT 2940 

GGGAGAAGGA TATCAAACGT TATCCACAGG TTAAAGATGT CTATGTTCAA ATGTCTGCTA 3000 
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TCCATACGAT GCACGAAGCA AAAGAATACA TCATCCTCAG TAATGCAGGT GGACCGAAAC 3060 

GTGAAAATGG GATGGTCCAC TTGGCACGTG TCGAAGAAAA TGGTGAGTTG ACTTGGCTCA 3120 

AACACAATCC AATTCAAAAA GGAGAGTTTG CCTATAATTC GCTCCAAGAA TTAGGAAATG 3180 

GGGAGTATGG CATCTTGTAT GAACATACTG AAAAAGGACA AAATGCCTAT ACCCTATCAT 3240 

TTAGAAAATT TAATTGGGAA TTTTTGAGCA AAAATCTGAT TTCTCCTACC GAAGCGAACT 3300 

AGAGAGATGG GCAAAGGAGA GATGGGCAAA GGAGTTATTG GCTTGGAGTT CGACTCAGAA 3360 

GTATTGGTCA ACAAGGCTCC AACCCTTCAA TTGGCAAATG GTAAAACAGC GACTTTCCTA 3420 

ACCCAGTATG ATAGCAAGAC CTTGTTGTTT GCAGTAGATA AGGAAGATAT CGGACAGGAA 34 80 

ATTATTGGTA TAGCTAAAGG AAGCATCGAA AGTATGCATA ATCTTCCTGT AAATCTAGCA 3540 

GGTGCCAGAG TTCCTGGCGG AGTAAATGGT AGCAAAGCAG CGGTGCATGA AGTTCCAGAA 3600 

TTTACAGGGG GAGTTAATGG TACAGAGCCA GCTGTTCATG AAATCGCAGA GTATAAGGGA 3660 

TCTGATTCGC TTGTAACTCT TACTACAAAA AAAGATTATA CTTACAAAGC TCCTCTTGCT 3720 

CAGCAGGCAC TTCCTGAAAC AGGAAACAAG GAGAGTGACC TCCTAGCTTC ACTAGGACTA 3780 

ACAGCTTTCT TCCTTCGTCT GTTTACGCTA GGGAAAAAGA GAGAACAATA AGAGAAGAAT 3840 

TCTAAACATT TGATTTTGTA AAAATGCCTC TTTCTCAACT GTAGTGGGTT GAAGTCAGCT 3900 

AAGCTCGAGA AAGGACAAAT TTTGTCCTTT CTTTTTTGAT ATTCAGAGCG ATAAAAATCC 3960 

GTTTTTTGAA GTTTTCAAAG TTCCGAAAAC CAAAGGCATT GCGCTTGATA AGTTTGATGA 4020 

GATTATTGGT CGCTTCCAAT TTGGCGTTAG AATAGTGTAG TTGAAGGGCG TTGACGATTT 4080 

TCTCTTTGTC CTTTAGAAAG GTTTTAAAGA CAGTCTGAAA AAGAGGATGA ACCTGCTTTA 4140 

GATTGTCCTC AATGAGTCCG AAAAATTTCT CCGGTTCCTT ATTCTGAAAG TGAAACAGCA 4200 

AGAGTTGATA GAGCTGATAC TGATGTTTCA AGTCTTGTGA ATAGCTCAAA AGCTTGTTTA 4260 

AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGGCG ATAAAAATGT TTATCGCTGA 4320 

GTTTACG 4327 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 

{Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

CTCTGGCCCT GCCACTCCAA CGTTTTGTCA GGGTGCTTTT TTCATAAAGG AGTTCTTATG 60 
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TTAGATATCA AACGTATTCG TACAGATTTT GAAGCTGTCG CAGAAAAATT AGCTACACGT 
GGTGTAGATG CTGCTGTCTT GAATGAAATG AAAGAAATCG ATGCTAAACG TCCTAACATC 
TTGGTCAAGG TTGAAACTCT CAAAGCAGAA CGTAACACAG TTTCTGCTGA GATTGCCCAA 
GCTAAGCGCA ACAAGGAAAA TACAGATGAC AAGATTGCTG CCATGCAAAA TCTATCTGCT 
GAGGTTAAAG CCTTGGATGC TGAATTGGCA GAAATCGATG CTAAATTGAC AGAATTTACA 
ACGACTCTTC CAAATATCCC AGCTGACAGC GTTCCTGTTG GGGCTGACGA AGACGACAAT 
GTGGAAGTTC GCCGTTGGGG TACTCCACGC GAGTTTGACT TCGAACCTAA AGCTCACTCG 
GATCTCGGTG AAGACCTTGG TATCCTTGAC TGGGAACGCG GTGCTAAGGT AACAGGCGCT 
CGCTTCCTCT TCTATAAAGG CCTCGGTGCT CGTTTGGAAC GTGCTATCTA CAACTTTATG 
TTGGATGAAC ATGGAAAAGA AGGCTATACT GAAGTCATCA CACCTTACAT AGTCAACCAT 
GATTCTATGT TTGGTACTGG TCAGTATCCA AAATTTAAGG AAGATACTTT TGAACTCAGC 
GATACCAACT TTGTCTTGAT TCCAACTCCT GAAGTTCCTC TGACAAACTA CTACCGTCAT 



GAAATCTTAG 
TCTGAGGCTG 
AAGGTTGAAA 
ACAGCCAACG 
TCTACTGGAG 
GCACAAAACA 
CGTGCCCAAA 
AACGGTTCTG 
GAAGATGGTT 
ATCAAACCAT 
CAGATAAGCA 
CAGGTAATCC 
GATAATGGTT 
GGGCAGACGA 
GACCCAAGTC 
AGGAGCACAG 
ATCGCTATAA 



ACGGCAAAGA 
GTTCTGCCGG 
TGGTCAAATT 
CTGAAAACAT 
ATATGGGCTT 
ATTACCGTGA 



TCTTCCAATC 
TCGTGATACG 
TGCCAAACCA 
TCTTCAAAAA 
CTCAGCTGCG 
AATCTCAAGC 



TACTTCACTG CCATGAGTCC CTCATTCCGT 
CGTGGCTTGA 
GAAGAATCTT 
CTCAACCTTC 
AAGACTTACG 
TGTTCAAACA CAGAAGATTT CCAAGCCCGT 



TCCGTTTGCA CCAATTCCAC 
ACCAAGAATT GGAAAAAATG 
CATACCGTGT CGTTGCTCTC 
ACTTGGAAGT GTGGATTCCA 



TCCGTTACCG 
GACTTGCAGT 
CTGTGACCAT 
AAAAAATAAG 
CCTAGTACAA 
AGAAATGGAA 
GGGAAGGTGA 
GTTAAAACAA 
GGCAGCCTGG 
TAGATTCCTT 
ATAAGGGGAT 



TGATGAAGCA GATGGCAAGG 
TGGACGTACA GTGGCTGCAA 



TGAAACTCCT TCATACCTTG 
TTCTTGAAAA TTACCAAAAT 



CCCAGAAGCA CTTCGTCCAT ACATGGGTGC AGCTGAACTC 



GTTTAGCTAT TTCTAGCTAG 
AGAATAAAAT AGTTAGGCAT 
GTTTCAAAAT TCCCTGAGCC 



ACCTTTTTTC GTAACCAAAT 
ATAATGGTTT CAGCCAATAC 
ATCTTGAGCG AGGTCGCTGT 



GGGCTGAGAA GGCTGGTTGA AAACCTTGTT TTAAAATGTT 
AGAAAAAGAA GGATTGAGAA 



TTCCTCCTAC TCGAACTAGA 
CTTGTCCAAG CAAGGCTAGT 
AGAGATAGAA GGTCAAGAGA 



GCCAAAATCA TGACAATCAA 
GAAGCCAAGA GTAGAGAGAA 
GGGAGTGGAT GTTTCTTTAA 
AAACCAAAAC TCAAGGTCGC 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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ATAGGCAATT TCGATAATAC CTACCAGAGG ATAGGTCAAG GCAGCCACTG CTATCCCCAC 1860 

ATAGAGAACC GTCCAGCTTG GAGTGGCATG AACCCTCCGC CCTGGACAAG CAAACTTGAT 1920 

GGTAAAACCA GCAATCAAGG TCAAATCCAA GAGAAATGAA AACCACCAAA TCCCTTGTGC I960 

TACCAAAGGA AGATAAGAGA ATACGCGAAA GACATAGGTC GATAAAATCA TCCCAGCCAT 2040 

AGGAAAGGTT GCCATTCCTG ACAAAAGAGG GGGCTTGGTC AATTCTTGCT TGGTTTCTTT 2100 

CCAATTAAAG AGATGCAGAA TTAGAAAGTA AATCCATAAA ACCAAACCAA TCAGACTAAA 2160 

AAGATGGGAT AGAACCGGCA ACGTATCTAA AATAAGATTT CCAGCTCCTG CCAAACCTAG 2220 

CAAACAACCT GAAAATACTA AGGGGAGTTT TTTCATCCTA ACCTCCAATA ATCATGTTAG 2280 

TTTCAGTATA ACATAAAAGC GCTTAAATGA GGATTTAAAA AAACGAGTCC GCTTATTTCA 2340 

GACTTCATTT TACTCAGATA TGAATTAGGC ATAAGGTTGC AATTCTGGAT TAATTGGTGT 2400 

ATTAGCTAAG TTGTTGGCAT AGTTACAGAG GATTGCTAGG CTGACACCAA AAACCACATC 24 60 

CAAGGCATTT TGTTGAGTGT AGCCAGCTTC TAAAAACTCA GACAAGGCTT CATCTCCTAC 2520 

ACGACCCTTG GTATTGATAA CTGCCAAGGT AAACTTAGCT AGGGTATCCA ATTTAGGATC 2580 

TGTTTCAATT GGAGTACGAT TGCGAAGAGC TTGAATCAAG TCATCATTCA TCTGGATTTG 2640 

TTTGATGGAA AAGGCTGTGT GACCTGCGAC ACAGAAGGCA CAACCATTGG TCACGGCTGC 2700 

CGTGATTTGC ACCACTTCAC GCTCAACGGG TGTCAGGCTG TTGCGACGGT GGATAGATGA 2760 

GACAATTTGG TAGGCTTCTA AAACAGTCGG GGCATTGGCC AAGAGACCGA TTAGGTTGGC 2820 

AATATAGCCA TTGTTGTCTT TTTCTACTCT TTCAAGAATT TCTTTCACTT CTGCTGGTGC 2880 

TGACTCTACT GTATGGATAG TAAATGTTGT CATAAGATAC CTCTTTTCTT ATTATTGACA 2940 

CTAATATTAT TGGAAAATCT TATAAAATCC TGATTCCTAA GTTTATCTAA GATAAAGCTT 3000 

TATTCTCTCA TAAGATTTTC GTTGTTATAT TAGTTTATCA CACTTCCAAT CACTTGTATA 3060 

ATATATATTA TATATCAGGC TGATAAAAAT TATTTATAGG CAAAAAAATC ACACGAGCTG 3120 

TGTGATTCCA TTATTTGTCA AAATACTTTT TAGTTTCAGC AATAACGACT GGCGACAAGA 3180 

CCAAGAGGGC AATCAAGTTT GGCAGAGCCA TCAAGGCGTT AACGATATCT GCGATAATCC 3240 

AGACCATATC CAACTCGATA AATCCTCCTA ACAAGACCAT GAGCACAAAA ACCACACGGT 3 300 

AGAGCCAGAT AAAGCGAACC CCAAAGAGGA ACTCAAAACA GCCTTCTCCG TAATAGTTCC 3 3 60 

AACCTAGAAT CGTTGTAAAG GCAAAAAGTA CAAGGAAGAT GGTCAAGAGA GCAGGCCCAA 3420 

AGTGTGAAAA GTTTGTTGAG AAAGCTGACT GAGTCAAGGC AACCCCATTC AAGTCACCGC 3480 

TCCAAACTCC AGTTACCAAG ATGGTCAAAC CAGTTAGAGT A 3521 
(2) INFORMATION FOR SEQ ID NO: 119: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1968 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D> TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


119: 






AACCTGGGCA 


AGCAAGCTAA 


AAGCAATGGG 


ACCTGGAATC 


CTAATGGCAA 


CTGCCGCTGT 


60 


TGGAGGTTCC 


CACATTGTAT 


CCTCAACTCA 


AGCTGGCGGT 


TCTTACGGTT 


GGTCTCTACT 


120 


TCTCTTGGTC 


ATCTTAGCCA 


ATGTCTTTAA 


ATATCCATTT 


TTCCGTTTTG 


GTGCTGAATA 


180 


CACAGCTGAT 


ACTGGAAAGA 


CTTTGGTTGA 


AGGTTATCCC 


GAAAAAGGAA 


AACTCTATCT 


240 


CTGGATTTTC 


TTTATCCTCA 


ATGTCTTTTC 


GGCTATGGTC 


AACACGGCTG 


GTGTTGCCAT 


300 


TCTGTGCTCA 


GCTATCATCG 


CCAGTGCCTT 


CCCAATGATT 


GGACTTAGCA 


TTACTCAGTG 


360 


GTCCCTCATT 


CTCGTTGCAA 


TCATTTGGGC 


TATGCTACTC 


TTTGGAGGCT 


ACAAACTTTT 


420 


AGACGGCATG 


GTCAAATGGA 


TTATGTCTGC 


CTTAACCATT 


GCGACTGTTC 


TTGCAGTTAT 


480 


CATTGCGGCG 


GTCAAGCATC 


CAGAATACAG 


TTCTGATTTT 


GTCGAGAAGA 


CACCTTGGCA 


540 


AATGGCAGCT 


CTGCCCTTCA 


TCGTCTCCCT 


CCTAGGATGG 


ATGCCGGCTC 


CTATTGAAAT 


600 


TTCAGCCATC 


AATTCACTTT 


GGTCAGCTGA 


AAAGAGAAAG 


ACCGTCAACT 


TTAACACAGA 


660 


AGACGCTCTG 


TTTGACTTTA 


ACACTGGTTA 


TATTGGAACA 


GCTATCCTAG 


CCGTCTTCTT 


720 


TGTGGCACTG 


GGAGCACTGA 


TTCAGTATCC 


TACAGGGCAG 


GCGGTTGAAG 


CTGCTTCAGC 


780 


CAAATACATC 


TCTCAATTCG 


TGGGCATGTA 


TGCCTCTGTT 


CTTGGCGAAT 


GGTCCCGTTA 


840 


CTTGATTACC 


TTTATTGCCT 


TCCTCTGTAT 


CTTTGGAACA 


GTTATAACTG 


TTATCGATGG 


900 


CTATTCTCGC 


GTTAATCAGG 


AATCTCTCCG 


ACTGCTAATC 


AGTCAAAAAG 


AGGACAATCG 


960 


TAAATCTTTG 


AACATCTGGA 


TGACCATCAC 


TGCTATCATC 


GGTATCGTCA 


TTATCAAGTT 


1020 


CTTCGCTGGT 


CAGGTTTCAA 


CCATGCTCCG 


CTTTGCCATG 


ATTGGCTCTT 


TCCTGACAAC 


1080 


ACCTTTCTTT 


GCTCTTTTGA 


ATTACGCCTT 


GGTAACGCGT 


GAAAACAAAA 


ATCTTCCTTC 


1140 


TTGGCTCAAA 


CACCTTGCCA 


TTGCGGGATT 


GATTTTCCTC 


TTTGCTTCCC 


CATCTTCTTT 


1200 


ATCTACGCAC 


TCGCAATCGG 


AAAAGCAGGG 


TAAGGGACAA 


GCGCGAGATG 


AAGATAAGGT 


1260 


TTCATTTCAA 


GAGAAAATTC 


AGCAAATATT 


TCTATGATAA 


AAAGCATAAG 


AACAAGGTTT 


1320 


TGAAGACCTG 


AACTTATGCT 


TTTTTACGTT 


CTTAAAGACT 


GTTTATACTC 


AAAAAACAGT 


1380 


TGAACAACTT 


CAACCACCTC 


TTATAAGAAC 


TTTATACTAT 


TCGAGAATCT 


CTTCAAACCA 


1440 
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CGTCAGCTCT ATCTGCAACC TCAAAGCTGT GCTTTGAGCA ACCTGCGACT AGCTTCCTAG 



1500 



TTTGCTCTTT GATTTTCATT GAGTATTAAT TCTCCTTTTC CAACTCATAC AAATCTGCGA 



1560 



TAATAGCTGC GACATGTTTG ATATCTTCCA GCATGCCTCG CATTTCAAAG TCAGCCAATA 



1620 



CAGGGAAGCC AAAGCGTTGA CTGTATTGCT TGGCTGTTAG GCAGTATTGG TTATTAAAGT 



1680 



TACGATTTCC TGACCCAACC ACACCAAAAC ACTTACTAGC ATTGTTACCA TAGGCAATAA 



1740 



AATCTCCCAC CGGTGTCGTC AAAATCTCAA CATCTCCGTT ATCCACGCCA TTCCCACCTT 



1800 



CGAGATAGGT CGGCAAAAAA GCGACATAGG GATGGTCCAT TTCATAGAAA TTTTTGCCTT 



1860 



CCTTGACCAA ATCCTTGATA TGAATCTTTT GAACCTCAAT CCCTTTGTAC TGGGACAAGA 



1920 



GATAGTCTTT CAAGCGCGTC ACAAAACTTT CAGTGTTGCC ACTCAAGG 



1968 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

CCGCATTTTT TATCACTAGA CTCGAGACAT CTTTTCAGTG GCTCTTGCTC TCTGGTTTAA 60 

TTTTCTTCCT TGCTCAAGGA CTCCTGCTAT TTCTCTTGGT CGTCCGACTC AAACATCAAT 120 

TCGCTGAGAT TTATCCTCAA ATCAATAAAA AGATTCGCTT CTACTATTTA CGGGTTCTCA 180 

CCATTGATTT TCTATTTTTT GTTCTCTTAG CCTTCATTAG TTCTCAGCGT TTTTCATCTC 240 

TTATGCCAAT CATCACTGCT TGCCATTCTA CTTTTTATTA TATGACAGCT GACTACCTAA 300 

GAGAAAACTA TCCAGACTTT TACGACAAAC ACATCTCTTT ATGGGAGTGT CTCTAAAGAA 360 

AAGGAGGTTT TAGCATGAAA AAAATCATCT TCATCAAAAC CATTCAACTC CTTGTCATTG 420 

ATGGAATCAT GCTGGCATTT TTGACATTTA AAAGGGGGCT TACTTGGGAC TGGATTTTGA 480 

TTTATAGCGG TTGGCTCATT TTCTTTCATC CTGTGCTATT GACCTATCTT TCAAACCAAC 540 

TTTGTGACCA CTTTAGTTAA CTCTATTCCC AGATTAGACC GAGATTCTGG CGTTTTGCTT 600 

TACAAATTCT CCTATGGGAT AGCCTGATGA TTCTCTCCTT GGTGTCTTTA AGTGATATTC 660 

CACTTTTCCT TCAGGGAACT CTCCTCATCC TAGGACATCT CATCCCTTCC TATCGCATCT 720 

GCCAAAGCCT GAAAAGAGAC TTCCCCCAAG CATATCAAGA ACCGATTTCT TTTTGGAGTA 780 

TTTTATGATA GATGAGAAAG ACCAAGCCGA CTGGGCTTGG TCTTTCTTAT CTCTTTTTAG 840 

TATCTAGGAT AATGGTAACA GGTCCATTAT TAACCAGCTC AACCTGCATA TCTGCTCCAA 900 
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AGATCCCTGT CTGAACCGGC ACTTCTTGCG CTAATTTTTG ATTGAAAGCA TCATAGAAGT 960 

CTGATGCCAT ATCAGGTTTA GCTGCCCCTG TAAAGgCTGG ACGATTCCCT CTCTTAGTAT 1020 

CCGCAAAGAG GGTAAACTGA GAAATAGAGA GGATTTCTCC TTCAATATCT TTGACAGACA 1080 

GGTTCATCTT GCCTTCTGCG TCTGAAAAAA TCCGCATATT GACCAGTTTT CTCACAGCAT 1140 

AGTCCAAATC TTCCTCTTGG TCCTCTGGTC CAACACCAAC CAGCAATAAA AGTCCCTGAT 1200 

TGATTTTTCC CTGAATCTGG CCTTCTATAC TCACTTGGGC TTTTTTAACC CGTTGGATAA 1260 

TGATTTTCAT AATAGCCTTT CTAGTAAGAG CTAGGACAAC TAGCCGTTGG TCCGTTTGAC 1320 

AGAGTAAACT TCTGGCACAC TCTTAATTTT ATCGACAACC GTGGTCAGTG TAGAGAGGTT 1380 

GGCAATACCG AAGgACACAT GGATATTAGC AAACTTCATA TCCTTGGTTG GTTGGGCATT 1440 

GACCGTTGAA ATATTCTTGG TTGTATTTGA AAGAACTTGC ACTACATCGT TCAACAGTCC 1500 

TGTACGGTTG AGACCGTAGA TATCGATATG GGCCATATAC TCCTTATTTG AGCTAGGGTA 1560 

CTGGTCTTCC CATTCCACAT CAAGGAGACG TTGCTCGTAG TTTTCTTGGG CACGCAGGTT 1620 

CATACAGTCC ACACGGTGAA TAGCCACACC ACGACCCTTG GTAATGTAGC CAACAATATC 1680 

GTCACCAGGC ACGGGGTTAC AACACTTAGC AATCCGCACT AGGAGACCAG AAGCACCTTC 1740 

AATAACCACT CCCCCCTCAT GCTTGACCTT GAGGGTTTCT TTATTTTCAA CCTTGACCTC 1800 

GCCACCTTTG ACAAGCTCCT CTGCCTCAGC TTTGGCCTTG GCACGCTCTT CCTCACGGCG 1860 

TTCCTTTTCA GTCAGACGGT TAAAGACGGT AATCGCACCG ATTTCCCCAA AACCAATGGC 1920 

CGCAAAGAGG GAGTCTTCTG TCTTGTAACT GGTCTTTTGC AGAACTTGAT CCATGTGGCG 1980 

CTTGTCCATA AATTTATTTG CCACATAGCC ATTTTCTTGG AACTGAGCCA TCAGCATCTC 2040 

ACGACCCTTG TTGACAGACA ATTCCTTATC TTGGTTTTTA AAGAACTGCC GAATCTTATT 2100 

GCGCGCCTTG CTAGTCTTGA CCATATTGAG CCAGTCACGG CTAGGTCCAA AGGAGTTCGG 2160 

GTTGGCGATA ATTTCAACCT GATCCCCTCT CTTTAACTTG GTTGTCAGTG GAACCATGCG 2220 

GCCATTGACC TTGGCACCAG TTGCTTTTTC ACCGACCTTG GTATGGATTT CGTAGGCAAA 2280 

ATCAATCGGT CCTGAATCTT TGGGAAGGGA ACGGACAGCT CCATCTGGGG TAAAAACCTA 2340 

AATCTCCTCA GCCAAATAGT TTTCCTTAAC AGAGTCCACA AATTCCTTAG CATCATCAGC 2400 

CTGGTCTTGG AGCTCCATCA TCTCCTTGAT CCAGTTCATT CCAATAGCTG ATTCCTTGCT 2460 

GTTAACTTGC CCCTTTATAC CTTTCTTATA AGCCCAGTCA GCCGCAACCC CGTACTCAGC 2520 

CACCTCGTGC ATTTCCTTGG TTCGAATCTG GAATTCAATC GGCCCTTTTG GTCCATAAAC 2580 

AGTCGTATGG ATAGACTGAT AACCATTGGC CTTGCGGTTG GCGATATAGT CTTTGAAGCG 2640 
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ACCTGGCATC 


GGTTTCCAAA 


ATTCATGCAC 


GTAACCAAGC 


ATGGCATAAA 


CATCACTTTG 


2700 


GGTATCTAAA 


ATACAACGAA TAGCAATCAG 


ATCATAGATT 


TCCTCAAACC 


GTTTTCTCTT 


2760 


GTCCTGCATT 


TTGCGGAAAA 


TTGAGTAAAT 


ATGCTTGGGA 


CGACCATAAA 


TCTTCCCTTT 


2820 


CAAGTGACGT 


TCTGTCGTAT 


ACTCCTCTAA 


TTTTGTGACT 


ACCTCATCCA 


CCAAGGCCTC 


2880 


ACGCTCCCTG 


CGCTTTTCCT 


TCATCATATG 


GGTAATCTTG 


TAAAACTCCG 


TTGGATTGAG 


2940 


ATAACGGAAA 


GACAAGTCTT 


CTAATTCCCA 


TTTGACACTG 


GAAATCCCCA 


AACGATGGGC 


3000 


AAGCGGGGCA 


TAGATTTCCA 


TGGTTTCTTT 


GGAAATACGC 


TCCTGCTTGT 


CTTTTCGAAG 


3060 


ATGTTTCAGG 


GTCCGCATAT 


TGTGCAAGCG 


GTCAGACAGT 


TTGACCAAAA 


TAACGCGGAT 


3120 


GTCCTCAGAC 


ATGGCCATGA 


GCATCTTGCG 


ATGATTTTCC 


GCTAATTGCT 


CCTCGATCGA 


3180 


TTTGTACTCG 


ACCTTGCCAA 


GCTTGGTAAC 


TCCGTCAACA 


ATCATCCCCA 


CATCAGGACC 


3240 


AAACTCTCTT 


TCCAAATCGT 


CCAAAGTCGC 


ATCTGTATCT 


TCCACCACAT 


CATGCAAGAA 


3300 


TCCACAAGCT 


ACTGTTACAG 


CATCCAGCTT 


TAGCTTAGCT 


AAAATACCTG 


CCACTTGGAT 


3360 


AGGGTGAATG 


ATATAAGGCT 


CGCCTGATTT 


GCGATATTGA 


CCACTGTGGC 


ATTCAACAGC 


3420 


ATAGACCAAG 


GCCTTATCGA 


CAAAATGAAC 


ATCCTCTTCC 


GTTAAATATT 


CTTTGGTTAA 


3480 


AGCGACAACT 


TCTTCCCCTG 


TTAAATTCAC 


TTCTTTCGGC 


ATCTCTACTC 


TCCAATTCTT 


3540 


CCTACCATTT 


TATCACTTTT 


TTAAGAATAT 


GAAAACTAGA 


TTGGAACAGA 


ATAAGAAAAA 


3600 


AATAATTCAA 


AATTGCTTGA 


TAATTCTGAA 


TTATTGGTCC 


GTAATATACT 


ACGAAGTTAG 


3660 


ATTTTAAACT 


TAGGTGATAG 


AAGGAGAGAT 


AGAAGAACGG 


AAACCATATT 


GTAACCCAAA 


3720 


GACTTTCTGA 


CTTCCCCAAT 


TCCATTGAAG 


ATACGAAAGA 


TAAACGGTGG 


AACTCGTATC 


3780 


ACATACACTG 


GTACCTTGAC 


TGGATTTTGG 


AATTAATACT 


AAATGAAAAT 


CAAAGAGCAA 


3840 


ACTAGGAAAC 


TAGCCGCAGC 


TTACTCAAAG 


CACCGCTTTG 


AGGTTGCAGA 


TAAAGTTGAC 


3900 


GCGGTTTGAA 


GAGATTTTTG 


AAGAGTATAA 


AAATCCTCAA 


GATACTTTCT 


TCTATCCTTT 


3960 


n\* I 1 1A1 f\J\\j 


CAGAATACCT 


ATGAAAAAAA 


CTGCTATTTC 


TATCTTTGCT 


CTCCTAATGT 


4020 


TAGGAGTTTG 


CTGCCTGTTC 


CTATTCAGCC 


AGCAAAGCTA 


TAAAAAACAG 


TCGTTCAATA 


4080 


CTATGCTAAC 


GACCAGAACC 


TGCCCAGTAG 


GATAACTTAT 


AGTGAATATA 


GCGACAAATG 


4140 


AGAAGCCAAC 


TACGGTAGCA 


CTCTAAACAT 


CACGTCTATC 


AAACAAGCTA 


ATGACGGAGT 


4200 


TTATGCAACC 


TATGAAGGGC 


AATTGACACC 


TTTCCAATAT 


TGATAAATTG 


ATAACCAGCC 


4260 


TGTCTTCATC 


TAGTCATGCT 


GGTTTTTAAG 


TTCATTTTAA 


ATCCTTACCT 


ATTCTCCCTA 


4320 


ACTGTGCTAT 


ACTTAATTTA 


TACTCAATGA 


AAATCAAAGA 


GCAAACTAGA 


AAGCTAGCCG 


4380 


CAGGCTGTTC 


AAAGCACTGC 


TTTGAGGTTG 


CAGATAAAGT 


TGACGCGGTT 


TGAAGAGATT 


4440 
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TTCGAAGAGT ATTAGTACAT TCTTTGAGAT TGGAGCTAGT ATGAAAATCC ATAAAACCGT 4500 

GAATCCTGTT GCCTATGAAA ATACCTATTA TCTAGAAGGC GAAAAGCACC TCATCGTCGT 4560 

CGATCCTGGT AGTCATTGGG AAGCCATTCG TCAGACAATC GAGAAGATCA ACAAACCGAT 4620 

CTGTGCTATT CTCTTGACCC ACGCCCATTA TGACCATATC ATGAGTCTGG ACTTGGTTCG 4 680 

CGAGACGTTT GGCAATCCTC CTGTCTATAT CGCAGAGAGC GAAGCCAGCT GGCTCTACAC 4740 

TCCTGTCGAT AATCTCTCCG GTCTCCCTCG CCACGATGAT ATGGCAGATG TGGTCACAAA 4 800 

ACCTGCAGAA CACACCTTTG TCTTTCACGA AGAATACCAA CTAGAGGAAT TTCGTTTTAA 4860 

GGTTCTACCG ACCCCAGGGC ACTCTATCGG TGGTGTTTCC CTAGTCTTTC CTGATGCTCA 4920 

TCTAGTCTTG ACGGGAGATG CTCTATTCCG CGAAACTATC GGACGGACCG ACCTTCCGAC 4980 

TGGTAGCATG GAGCAACTCC TTCATAGTAT CCAGACCCAA CTCTTCACCC TACCAAACTA 5040 

CGATGTCTAT CCAGGACATG GTCCAGCTAC TACTATCGCT CACGAAAAGG CCTTCAATCC 5100 

CTTTTTCTAG CAAGATGATG ACAATCGAAA TTTAAGTAAA CTATCCAGCA AATCTTTCTA 5160 

TTACAAAAGG CATCCTATCA AGGTTTTCAC ACATGATTGG ATGCCTTT T T TCTGATGACT 5220 

AGATTTTTTG CATTACCAAA TAATCACGCG CTCCTCTGGT GAACGCCACA TTCCGTCTCC 5280 

TTCTTTGACA TCATAGGTTC TAAAGAAATC CTCCAAGTTT GGTACTTGCA CATTGACACG 5340 

GAGTTTGGCT GGTGCGTGCA CATCGACGCT AGCCAAAAGT TTCATAAATT CTGGTCGACC 5400 

TTTCATGCGC CAGATGCGAC CGAAGTTGTA GAAGAACTCT TCTGCTGAGA AGTCTGCTTC 5460 

TCTCTTAGCT GCTTCAAGCG CTGCTGCGAT TCCTCCCAAG TCAGCCACGT TTTCTCATAC 5520 

AGTCAATTTA CCGTTAATGG TTGCTCCATA AGAATCCTGT CCATCAAATT GGTCAATGAC 5580 

TTTTTGTGTT TTCTCCTTCA AGGCAGCATA GTCGCTCTCT GTCCACCAAT CCTTGAGGCT 5640 

ACCATTTTCG TCAAAGGAAG CCCCGTTAGT ATCAAAGGCG TGGGAAATTT CATGGGCAAT 5700 

CACTGCCCCA ATACCACCGT AGTTAGCAGA AGATGACTGA TGCAAGTCAT AGAAAGGCGC 5760 

CTGTAAAATG GCCGCTGGAA AGACAATCAG GTTCTTCTCA GGATTGTAGT AGGCATTGAC 5820 

CATATGAGCA GGCATGCCCC ATTCCTTATA ATCTACAGGC TGGTTCCACT TACTCCAACT 5880 

GTGCTTGATT TCCACACGCG CAAAGGCTAG AGCATTCTCA AAAAGACTGG CAGTTTCATT 5940 

CACTACCTTA TCCTTGTAAC GTGCAGGCAA TTCTTCTGGA TAGCCAATAT AAGGTTTGAT 6000 

CACATTGAGC TTCACGATAG CCTGTTTACA GGTTTCTGGA GTGAGCCAGT CATTCTTAAG 6060 

CAGACGCTCC TTATAAACAT CAATCATGGT TGCCACTTTT TTCTCCACAT CCGCCTTGGC 6120 

TTCTGGAGAG AACTTCTCAC GGGCGTACCA AAGACCCAGG GCTTGCTTGA AAGGTTCTTG 6180 
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TGCTAGATGA 


TAAGCTGCTT 


TGACCTTATC 


TTTTGCCTCT 


GGAACTCCAG 


AAAGGGCACG 


6240 


GCTGTAGGCA 


CCAGACAAAA 


CACGGATATC 


CTCTGTTAAA 


TAGCTGGTTG 


AAAGATTGAC 


6300 


AACACTCAAA 


ATCAAGGTTG 


CTTTAAGGAG 


AGACCAGGCT 


TCCTCACTGT 


AGAATTGCTC 


6360 


TGCTCCTTGC 


CAGAAACGTT 


CCTCGTCTAC 


AATAACCTTG 


TCTGGTAATT 


GCCCAATAAC 


6420 


TGCTTTGAAG 


AAGTCATCCA 


AAGGTAGGGC 


AGGCGCGAAT 


TTCTTGAAAT 


CTTCGTAAGA 


6480 


ATATGGATGA 


TAGAGTTTAG 


CATATTCTGA 


ACTTTCTTCA 


TTAGAGAGCA 


CCACTGCCGC 


6540 


AACTCGGCGG 


TCCAATTCAA 


GTCTTTTTTC 


TAGCAAGTCT 


TCAATTTCTT 


CATCAGAGAA 


6600 


ATCATAAGCC 


TTGAGGAGAT 


TTGCGCTGCT 


TTCTTTCCAA 


AGAGTCAAGA 


GCTCTTCGCG 


6660 


CTGAGGATGT 


TCTTCTGCAT 


AGTAGGTCGT 


ATCTGGCAAG 


ATTGTGCTTG 


GAGCGCTAGC 


6720 


CCATAGAACA 


TTGATTCTAG 


CATCCATAAA 




&CACCAAAAG 


GAAGGAAGTT 


6780 


TGGTTTTCCT 


GCAAGCTCAA 


ACTCTGCTAG 


TTTAGCTGTA 


AAATCCGCAA 


AAGTCTCCAA 


6840 


TTCTTGGAAT 


TCTTTAAGGA 


GTGGTAAGAC 


AGGTGTGATA 


CCGTCAGCTT 


CTCTCTTGTC 


6900 


AAAATCACGA 


ACTAGGCGGT 


GGTATTTGAC 


AAAGTTTTCC 


AAGATAGCAT 


CCTCAGGCAC 


6960 


TTCTTCACCT 


GCTAACCACT TGTCTGTTGT 


CCCCAGCATC 


AGGTCTTCAA 


TTTCCTGGTC 


7020 


TAAATCAACA 


AAACCTCCTG 


TTTGAGACTT 


ATCTGCTGGG 


ATTTCAGCTG 


TCTGTTGCCA 


7080 


TTCTCCATTG 


ATAGCATCAT 


AAAAATCATC 


TTGATAACGT 


GTCATCTTGT 


TCTCGCTTTC 


7140 


ATTTGTATTT 


GCATTTATCT 


TAACAAAAAT 


CG 






7172 



(2) INFORMATION FOR SEQ ID NO: 121: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



lxi> SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

CGGGAAGTTA TGCGATCTAG ACTTCGTTCC TGTACAGCTA CTTTCTCAGG TGGTCTTGTT 60 

GTTTGTATGA GTTTGTTTAG AGAGGATCTT TCTATGTCTT TCTTTCTTAT TTTTGTTTTA 120 

TATGCTTTTC TGATTTCTTA TCTAATTTAT GGTTATTTCA GACTAAAAAG GAAATACCGA 180 

GTAGATGAAT AGCAAGGTTC TAGGTCTTCA GATTGATTTT TAGCACTCTT GATAAAAGAG 240 

TGCTAATTTT TTGAGTTTTT GTCTTGACAT TCTCTTCTAA GGGTGTATAA TAGAATCATG 300 

AGTTAGCACT TGGATGCATT GAGTGCTAAT TGATCAGACA GAGAGGAGTG ATGAGATGGT 360 

TACAGAGCGT CAGCAGGATA TTTTAAATCT GATTATTGAC ATCTTTACCA AAACGCACGA 420 
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ACCTGTCGGA TCAAAAGCCT TGCAAGAGTC TATTAACTCT AGCAGTGCAA CCATTCGTAA 4 80 

TGACATGGCG GAACTAGAAA AACAAGGGTT GCTTGAGAAG GCTCATACTT CAAGTGGTCG 540 

GATGCCAAGT GTTGCTGGTT TTCAGTACTA TGTGAAACAC TCACTGGATT TTGACCGGCT 600 

GGCTGAAAAT GAGGTATATG AGATTGTCAA AGCCTTTGAT CAGGAATTCT TCAAATTGGA 660 

GGATATTCTG CAAGAGGCTG CTAACTTACT AACAGACCTG AGTGGCTGTA CGGTAGTGGC 720 

ACTGGATGTT GAGCCGAGCA GGCAACGTTT GACAGCCTTT GATATCGTTG TTTTGGGGCA 780 

ACATACAGCC TTGGCGGTAT TTACCCTAGA CGAGTCGCGA ACGGTTACTA GTCAGTTTCT 840 

GATTCCAAGG AACTTCTTGC AGGAGGATTT GCTGAAACTG AAGAGCATCA TTCAGGAACG 900 

TTTCCTCGGT CACACCGTTT TAGATATTCA CTACAAGATT CGGACGGAGA TTCCGCAGAT 960 

TATCCAGCGT TACTTTACAA CAACGGATAA TGTCATCGAT CTCTTTGAAC ACATCTTTAA 1020 

GGAAATGTTC AACGAAAACA TTGTGATGGC GGGCAAGGTC CATCTCTTGA ATTTTGCCAA 1080 

TCTAGCAGCC TATCAGTTCT TTGACCAACC GCAAAAGGTG GCCTTGGAGA TTCGTGAGGG 1140 

GTTGCGTGAG GATCAGATGC AAAATGTTCG TGTTGCAGAC GGTCAAGAGT CCTGTTTAGC 1200 

TGACCTAGCG GTAATCAGTA GTAAGTTCCT CATTCCTTAT CGGGGAGTTG GAATTCTAGC 1260 

CATTATCGGT CCAGTTAATC TGGATTACCA ACAGCTAATC AATCAAGTCA ATGTGGTCAA 1320 

CCCTGTTTTG ACCATGAAGT TGACAGATTT TTACCGCTAC CTCAGCAGTA ATCATTACGA 1380 

AGTACATTAA GATTGAAATC ATTAAAGGAG GCGAACATGG CCCAAGATAT AAAAAATGAA 1440 

GAAGTAGAAG AAGTTCAAGA AGAGGAAGTT GTGAAAACAG CTGAAGAAAC AACTCCTGAA 1500 

AAGTCTGAGT TGGACTTGGC AAATGAACGT GCAGATGAGT TCGAAAACAA ATATCTTCGC 1560 

GCTCATGCAG AAATGCAAAA TATCCAACGC CGTCCCAATG AAGAACGTCA AAACTTGCAA 1620 

CGTTATCGTA GCCAGGACTT GGCAAAAGCA ATCTTACCAT CTCTTGACAA CCTTGAGCGT 1680 

GCACTTGCAG TTGAAGGTTT GACAGATGAT GTGAAGAAGG GCTTGGGGAT GGTGCAAGAA 1740 

ACCTTGATTC ACGCTTTGAA AGAAGAAGGA ATTGAAGAAA TCGCAGCAGA TGGCGAATTT 1800 

GACCATAACT ACCATATGGC CATCCAAACT CTCCCAGCAG ACGATGAACA CCCAGTAGAT 1860 

ACCATCGCTC AAGTCTTTCA AAAAGGCTAC AAACTCCATG ACCGCATCCT ACGCCCAGCA 1920 

ATGGTAGTGG TGTATAACTA AGATATAAAG CCCGTAAAAA GCTCGCAGTA AAAATAGGAG 1980 

ATTGACGAAG TGTTCGATGA ACACAAGAAA ATCTATCTTT TTTACTCAGA GCTTAGGGCG 2040 

TGTTCGATTC GGCAATTCTG ACGGTAGCTA AAGCAACTCG TCAGAAAACG GCAATCGCTA 2100 

TGGCGTTTGC CTAGCTTCCT TACTAACTCG TCGTCGAAAT AAAATCGATT TCGACTCCTC 2160 
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GTGTCGCAAT TTACATAATA GAAAACTTGT CCGAAACGAC AATAAACTAT GAAGAAAGAT 2220 

AAAATATGTT TGGCTTTGTA ATAGTGAGCG AAGCGAACCA AACACGATAC TCTTCGCCGT 2280 

GGCGCTATTT GCGCAAATTT TGAGACCTTA GGCTCAAAGT TTAGTCAAAG AGATTGACGA 2340 

AGTCAAGCTC TGACGGCGTC GCCACTGTCG CCACTTAAGA AGAGTATCAA AAAGAAAAAT 2400 

AGAAAATTAA CTAACAAGGA GAAAAACACA TGTCTAAAAT TATCGGTATT GACTTAGGTA 2460 

CAACAAACTC AGCAGTPGCA GTT C TTGAAG GAACTGAAAG CAAAATCATC GCAAACCCAG 2520 

AAGGAAACCG CACAACTCCA TCTGTAGTCT CATTCAAAAA CGGAGAAATC ATCGTTGGTG 2580 

ATGCTGCAAA ACGTCAAGCA GTTACAAACC CAGATACAGT TATCTCTATC AAATCTAAGA 2640 

TGGGAACTTC TGAAAAAGTT TCTGCAAATG GAAAAGAATA CACTCCACAA GAAATCTCAG 2700 

CTATGATCCT TCAATACTTG AAAGGCTACG CTCAAGACTA CCTTGGTGAG AAAGTAACCA 2760 

AAGCTGTTAT CACAGTTCCG GCTTACTTCA ACGACGCTCA ACGTCAAGCA ACAAAAGACG 2820 

CTGGTAAAAT TGCTGGTCTT GAAGTAGAAC GTATTGTTAA CGAACCAACT GCAGCAGCTC 2880 

TTGCTTATGG TTTGGACAAG ACTGACAAAG AAGAAAAAAT CTTGGTATTT GACCTTGGTG 2940 

GTGGTACATT CGACGTCTCT ATCCTTGAAT TGGGTGACGG TGTCTTCGAC GTATTGTCAA 3000 

CTGCAGGGGA CAACAAACTT GGTGGTGACC ACTTTGACCA AAAAATCATT GACCACTTGG 3060 

TAGCAGAATT CAAGAAAGAA AACGGTATCG ACTTGTCTAC TGACAAGATG GCAATGCAAC 3120 

GTTTGAAAGA TGCGGCTGAA AAAGCGAAGA AAGACCTTTC TGGTGTAACT TCAACACAAA 3180 

TCAGCTTGCC ATTTATCACT GCAGGTGAGG CTGGACCTCT TCACTTGGAA ATGACTTTGA 3240 

CTCGTGCGAA ATTTGACGAT TTGACTCGTG ACCTTGTTGA ACGTACAAAA GTTCCAGTTC 3300 

GTCAAGCCCT TTCAGATGCA GGTTTGAGCT TGTCAGAAAT CGACGAAGTT ATCCTTGTTG 3360 

GTGGTTCAAC TCGTATCCCT GCCGTTGTTG AAGCTGTTAA AGCTGAAACT GGTAAAGAAC 3420 

CAAACAAATC AGTAAACCCT GATGAAGTAG TTGCTATGGG TGCGCCTATC CAAGGTGGTG 3480 

TGATTACTGG TGATGTCAAG GACGTTGTCC TTCTTGATGT AACGCCATTG TCACTTGGTA 3540 

TCGAAACAAT GGGTGGAGTA TTTACAAAAC TTATCGATCG CAACACTACA ATCCCAACAT 3600 

CTAAATCACA AGTCTTCTCA ACAGCAGCAG ACAACCAACC AGCCGTTGAT ATCCACGTTC 3660 

TTCAAGGTGA ACGCCCAATG GCAGCAGATA ACAAGACTCT TGGACGCTTC CAATTGACTG 3720 

ATATCCCAGC TGCACCTCGT GGAATTCCTC AAATCGAAGT AACATTTGAC ATCGACAAGA 3780 

ACGGTATCGT GTCTGTTAAG GCCAAAGACC TTGGAACTCA AAAAGAACAA ACTATTGTCA 3840 

TCCAATCGAA CTCAGGTTTG ACTGACGAAG AAATCGACCG CATGATGAAA GATGCAGAAG 3900 

CAAACGCTGA AGCCGATAAG AAACGTAAAG AAGAAGTAGA CCTTCGTAAT GAAGTAGACC 3960 
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AAGCAATCTT TGCGACTGAA AAGACAATCA AGGAAACTGA AGGTAAAGGC TTCGACGCAG 4020 

AACGTGACGC TGCCCAAGCT GCCCTTGATG ACCTTAAGAA AGCTCAAGAA GACAACAACT 4080 

TGGACGACAT GAAAACAAAA CTTGAAGCAT TGAACGAAAA AGCTCAAGCA CTTGCTGTTA 4140 

AACTCTACGA ACAAGCCGCA GCAGCGCAAC AAGCTCAAGA AGGAGCAGAA GGCGCACAAG 4 200 

CAACAGGGAA CGCAGGCGAT GACGTCGTAG ACGGACAGTT TACGGAAAAG TAAGATGAGT 4260 

GTATTGGATG AAGAGTATCT AAAAAATACA CGAAAAGTTT ATAATCATTT TTGTAATCAA 4 320 

GCTGATAACT ATAGAACATC AAAAGATTTT ATTGATAATA TTCCAATAGA ATATTTAGCT 4 380 

AGATATAGAG AATTATATTA GCTGAACATG ATAGTTGTAT CAAAAATGAT GAAGCGGTAA 4440 

GGAATTTTGT TACCTCAGTA TTGTTGTCTG CATTTGTATC GGCGATGGTA CCGTATCTGA 4500 

CGAACGTTCA GCTTATAT 4518 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 5 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

TGCTATTTTC GATTCCCTTG GGCGTTTTGA TTGCCTTTGC CTTGCAAGTC CATTCGAAGC 60 

CCCTCCATTA TCTGATTAAC ATTTACATCT GGGTTATGCG AGGAACCCCC TTACTCTTGC 120 

AACTGATTTT TATCTATTAT GTGCTCCCAA GTATTGGGAT TCGTTTAGAC CGCCTTCCTG 180 

CAGCTATTAT TGCCTTTGTT CTCAACTATG CAGCTTACTT TGCAGAAATT TTCCGTGGGG 240 

GAATTGACAC TATTCCAAGA GGACAGTATG AGGCCGCCAA GGTCTTGAAG TTTAGCCCTT 300 

TTGACAGAGT CCGCTATATT ATCTTGCCCC AAGTCACCAA GATCGTTCTT CCTAGTGTOT 3 60 

TTAATGAAGT TATGAGTTTG GTCAAGGATA CTTCTTTGGT CTATGCTCTC GGAATTTCAG 420 

ACCTTATCTT GGCTAGTCGA ACAGCTGCTA ACCGCGATGC TAGTCTAGTT CCTATGTTCT 480 

TGGCAGGAGC CATTTATTTG ATTTTGATTG GGATTGTGAC AATTATTTCC AAAAAAGTTG 54 0 

AGAACAAGTA TAGTTATTAT AGATAGGAGG CTGCCATGTT AGAATTACGA AATATCAATA 600 

AAGTCTTTGG AGACAAACAA ATCCTGTCTA ATTTCAGTCT AAGTATTCCT GAAAAGCAAA 660 

TCCTGGCTAr CGTTGGACCT TCTGGTGGAG GTAAGACAAC TCTTTTACGT ATGCTTGCAG 720 

GTCTTGAAAC CATTGATTCA GGGCAAATCT TTTATAATGG ACAACCTTTA GAGCTGGATG 780 
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AATTGCAGAA GCGCAATCTA CTGGGATTTG TCTTCCAAGA TTTTCAACTA TTTCCTCATC 840 

TATCAGTTCT GGAAAATTTG ACTTTATCGC CTGTGAAGAC CATCCGAATC AAGCAGGAAG 900 

AGGCTGAGAA GAAGGCGAGT GGACTCTTGG AACAGTTAGG ACTAGGAGGA CACGCAGAGG 960 

CCTATCCTTT CTCACTATCT GGTGGGCAAA AGCAGCGGGT GGCTTTGGCG CGTGCTATGA 1020 

TGATTGACCC AGAAATCATT GGCTACGATG AACCAACTTC TGCCCTGGAT CCAGAATTAC 1080 

GTTTGGAAGT GGAGAAGCTA ATCTTGCAAA ATAGGGAACT TGGGATGACC CAGATTGTGG 114 0 

TTACCCATGA TTTGCAGTTT GCTGAAAATA TCGCAGATGT ATTATTGAAA GTAGAACCTA 1200 

AATAGGAGGA AAAATGGATG AAAAAATGGA TGCTTGTATT AGTCAGTCTG ATGACTGCTT 1260 

TGTTCTTAGT AGCTTGTGGG AAAAATTCTA GCGAAACTAG TGGAGATAAT TGGTCAAAGT 1320 

ACCAGTCTAA CAAGTCTATT ACTATTGGAT TTGATAGTAC TTTTGTTCCA ATGGGATTTG 1380 

CTCAGAAAGA TGGTTCTTAT GCAGGATTTG ATATTGATTT AGCTACAGCT GTTTTTGAAA 1440 

AATACGGAAT CACGGTAAAT TGGCAACCGA TTGATTGGGA TTTGAAAGAA GCTGAATTGA 1500 

CAAAAGGAAC GATTGATCTG ATTTGGAATG GCTATTCCGC TACAGACGAA CGCCGTGAAA 1560 

AGGTGGCTTT CAGTAACTCA TATATGAAGA ATGAGCAGGT ATTGGTTACG AAGAAATCAT 1620 

CTGGTATCAC GACTGCAAAG GATATGACTG GAAAGACATT AGGAGCTCAA GCTGGTTCAT 1680 

CTGGTTATGC GGACTTTGAA GCAAATCCAG AAATTTTGAA GAATATTGTC GCTAATAAGG 1740 

AAGCGAATCA ATACCAAACC TTTAATGAAG CCTTGATTGA TTTGAAAAAC GATCGAATTG 1800 

ATGGTCTATT GATTGACCGT GTCTATGCAA ACTATTATTT AGAAGCAGAA GGTGTTTTAA 1860 

ACGATTATAA TGTCTTTACA GTTGGACTAG AAACAGAAGC TTTTGCGGTT GGAGCCCGTA 1920 

AGGAAGATAC AAACTTGGTT AAGAAGATAA ATGAAGCTTT TTCTAGTCTT TACAAGGACG 1980 

GCAAGTTCCA AGAAATCAGC CAAAAATGGT TTCCAGAAGA TGTAGCAACC AAAGAAGTAA 2040 

AAGAAGGACA GTAAGATAAA ATAGTGGCTG AAACTGCGTT TTGATTAGCA AAACGTAGTT 2100 

TTTTTTGTAA TCTAGGAAAA CGATAATAGC GATTGAATAT GGATAATTGA ATATGGAATA 2160 

GCCCACTGTG ATTTCTAAAA CATTGTTAAA AATTGATTTG ACTTCCAAAA TTAAAATGTT 2220 

CTGTAATGAA ATACTGATGT AACTGTTTTA GGAACAATAA AACGCATAAT ATCAAGCTTT 2280 

TTGCACCTTA CATTATGCGT TTTTGTGATT TTAAGACTTG TTAGCTGATT TTTTACAATC 2340 

CTGCGAAATC TTTGATTTCT TGTGCTGACA TTGAAGAGTC GCAACGGACG TTGATTTGTC 2400 

CATCTGTAAT ATGAACAAAA CCTGCTACAG TTGGGATTCC ATAGCGTGAG CGGAATGCTT 24 60 

GCAAATCATT GAGTTGGCTT GGTTCTTCAC TATTGATGAA GTAAATGTGA GCTTTGGTTT 2520 

CAGCTACGAC ACCTGACAAT GTACCTGCAA ATTTACGGCA GTAAGGGCAA GTTTTGCGAC 2580 
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CCATAAAGAA GGTTGCAGTT TCTTTTTTAT CAAGAGCTTC TTGCGCACGC ACAACTGTAG 2640 

TGACTTCAAG GTCTTTGATG TTATCTAAAA ATTGTTCCAT GAGATTACCT CGCTTTCATT 2700 

GATAAGTCTA GTATGCCATA AAGTTTCTAA AATTGCTTAG ATTTGATACG AAAAAAGATG 2760 

AGGTTGGTTG GTCTCATCTT TTATAGGTCT TTATTTTACA AATGCATTGA TTTCTGCTTC 2820 

GATGTTAGCA ATCTTAGCTT GTGATTCTTC GTTGGTTTCC CCTACAACTG CAATGTAGAA 2880 

CTTGATTTTT GGTTCTGTAC CTGAAGGGCG AACGGCAATC CATGAACCGT CAGCAAGTGT 2940 

GTATTTCAAC ACATCACTTG GAGGAGTTGT CAAGTTTGTA ACAGTACCGT CAGCAACAGT 3000 

AGCAGTTTGT GCCTTGAAGT CTTCTACGAC AGTGATAGCT GTTGCGTTCC ATTCTGTTGG 3060 

AGCATTGTTG CGGAATTTAG CCATAATCGC TTTGATTTGT TCAGCACCAT CGACACCTGA 3120 

AAGAGTAACA GAGATTGTTT TTTCTGCGTA GTAGCCATAT TCTTTATAGA TTTCTTCGAT 3180 

ACCGTCAGCA AGTGTCAAAC CACGAGAACG GTAGTAGGCA GCAAGTTCAG CAACTACAAG 3240 

AACGGCTTGG ATGGCATCTT TATCACGTAC AAATGGTTTA ATCAAGTAAC CGAAGCTTTC 3300 

TTCAAATCCC ATCATGTAAG TGTGGTTGTG TTTTTCTTCG AATTCTTGGA TTTTTTCAGC 3360 

GATAAATTTG AAACCTGTCA AGACGTTGAA CATAGTTGCG CCGTAGCTTT CAGCAATCTT 3420 

CGTTACCAAG TCAGTTGAAA CGATAGATTT GCAGAGAGCG GCATTTTCAG GAAGAGTTCC 3480 

AGCGTTTTTG TGAGCTTCCA AGATGTATTT ACCCATGATA GCACCGATTT GGTTACCTGA 3S40 

AAGGTTGAGG TAGCTACCAT CTTTTTGAAG AACTTCAACA CCAACACGGT CAGCGTCTGG 3600 

GTCAGTTGCG ACAAGAACAT CTCCACCAAC TTGACGACCA AGTTCTTCAG CAAGGGCAAA 3660 

GGCTGCTTGG CTTTCTGGGT TTGGAGATGT TACAGTTGAA AAGTCTGGGT CAGCAGTTGC 3720 

TTGCGCTTCA ACAACTTGAA CACAGTCAAA TCCTGCTTGG GCAAGAGCAC GACGAGCCAA 3780 

CATTTCACCA GTACCATGAA GTGGTGTGTA GACAATCTTC ATGTCTTTAC CAAATTCTTC 3840 

AATCAAGGCT GGGTTGATGT TTATGTCCTT AACCTCTTTA ACGTATTCTA TGTCAACAGC 3900 

TTCGCCGATA ACTTCAATCA AGCCAGAAGC TTTTTCAGTT TCCACATCAG CAACTTCAAC 3960 

TGCAAATGGG TTTTCGATTG CACGGATATA AGTAGTCAAA GCGTCCGCAT CGTGTGGAGG 4020 

CATTTGTCCA CCGTCTTCAC CGTAAACCTT GTAACCGTTA AATGGAGCAG GGTTGTGGCT 4080 

GGCTGTGACC ATGATACCTG CGAAACAGTT GAGATGACGA ACTGCAAATG ATAGTTCTGG 4140 

AGTCGGACGA AGGCTTTCAA ATACGTAAGA TTTGATGCCG TGTTTAGCAA GAACTGCCGC 4200 

AGATTCAAAG GCAAACTCAG GTGAGAAGTG ACGGCTATCG TAGGCAATTG CTACACCGCG 4260 

TTCTTTCTCG TTTCCACCTT TTGACTCAAT CAAACGAGCC AATCCTTCAG TAGCTTGGCG 4320 
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AACAACGTAG 


ATGTTGATAC 


GGTTTGTACC 


AGCACCAACC 


AAGCCACGCA 


TACCTGCAGT 


4380 


ACCAAATTCA 


AGATTTGTAT 


AGAAGGCATC 


TTCCTTAGTT 


TTTTCGTCCA 


TATTTTCCAA 


4440 


ATCTTGACGA 


AGGTAGTCAC 


GAAGCTCCAC 


AAAATCAACC 


CATTTCTGGT 


AATTTTCTTG 


4500 


GTAAGACATT 


CAAATTCTCC 


TTTATTTTTA 


AAACATTTAA 


TCAGTTTAAT 


TATATCATTT 


4560 


TTTTTAGTTT 


TAGTAAAACC 


TTATCTGCTT 


CGAACATCTC 


TTCAAACCAG 


GTCAGATTGA 


4620 


ATTTTGGGGT 


TATATGATGT 


TGAGGCTAGG 


AAAAATTCAA 


TTTCAGTAAA 


AAAAGTAAGT 


4680 


CTTCTCATAA 


CAAAACATTG 


ATATAGTTAC 


TTAGTTTTAA 


ACAAGCATAT 


TATAATAAAG 


4740 


CTATGGCATA 


TAGTACTGAT 


TTTAAACAGC 


GAGCATTAGA 


TTACATCAAA 


GAGGGGCACA 


4800 


GCCATGTCGA 


GGCAGCCAAG 


TTTTTTGGTG 


TTGGCGTCAG 


AACTCTCTTC 


ACCTGGGAAA 


4860 


AGAAAGACGT 


GAACAAGAAC 


ACATAGAGAG 


GAAAAAGCGA 


GTCGTCAAAA 


ACCGAAAGAT 


4920 


TCCTTTAGAG 


GAATTGAAAG 


CCTTTGTAGA 


GGCTCATCCA 


GATGCVH'TT 


TACGGGAAAT 


4980 


TGCGGCACAT 


TTTGATTGTG 


CTGTTCCTTC 


AGTATGGGCA 


GCTTTAAAGC 


AGATTAAGGT 


5040 


CACTTTAAAA 


AAAGATGACG 


AGCTTTAAGG 


AACAAGACCC 


AGAAAAGTAG 


CCTTATTTCT 


5100 


TAAGAATTTT 


AATAGTTTAA 


AGCACCTAGC 


ACCTGTTTAT 


ATTGATGAAA 


CAGGAATCGA 


5160 


CCGCTATCTC 


TATCGTCCTT 


ATGCAGGGGC 


TCCTAGAGGG 


GAGAAAGTCT 


ATGAAAAGAT 


5220 


TAGCGGACGT 


CGTTTTGAGC 


GAACTTCAAT 


TGTTGCAGGA 


CAAGTAGACG 


GAGAGTTTAT 


5280 


AGCTCCCATG 


ATTTACAAGA 


AAAGCATGAC 


AAGCGATTTC 


TTTGTGGAGT 


GGTTCAAAAC 


5340 


GCAACTCCTA 


CCTGCTTTGA 


AGACACCTCA 


TGTTATTGTC 


ATGGGCAATG 


CTGGTTTTCA 


5400 


TCCCAAGAAC 


ATTTTGGATG 


AACTCTGCAT 


CCAAGATAAA 


CACTTTTTCT 


TACCTCTACC 


5460 


ACCTTATTCA 


CCGGATTTGA 


ATCCTATTGA 


GCAAGCTTGG 


GCTATCTTGA 


AAAAGAAAGT 


5520 


GACGGATGTA 


TTAAGGGAAG 


TTCCAACTAT 


TTTTGAATGT 


TTGGAATGCT 


TTTTTAAAAC 


5580 


TAGATGACTA 


TAACGGTTCT 


AAAGGAACCT 


ATCGAGTAGT 


CATTAAAACT 


AAGGATACTG 


5640 


v. l vj\j i i A/tun 




TACAATCAAA 


CCATTCACCG 


TGTAGCCGAA 


ATCGTTCAGA 


5700 


ATGAAGACTT 


GTATCAGAAT 


GAAGACTTGT 


ATAAGAAAGG 


TTTGAATGTT 


GAACTTGCGC 


5760 


ACCAACAAAT 


TAAGGGATTT 


TTTGAAGCAG 


AGTTTAAAAA 


TCGTATTAAT 


GGAGTTCTTA 


S820 


ATACTAAAAT 


AAAAAATAGT 


ACATTAAATC 


GTGTAAATAA 


AAAAACTATA 


CACCAGAGCA 


5880 


ACAAAAACTC 


CATCATCAAT 


TTGAAGCAGA 


AGCAACGGAA 


GATGCTAAAA 


AACAAGGCGA 


5940 


TATTGTGTTG 


AATGTTGACC 


AGGATTTCAT 


GAGCATATCT 


AAGTCTAATA 


AAAGTGGTTC 


6000 


AGACTGGAAG 


AAAACTTTCA 


CAGTGAGGAT 


AACCAATAGG 


CTACCAAATG 


ACTTGAATAA 


6060 


TGTCTTGAAA 


CAGGTTGATA 


AAGATACTCC 


TAATACCCCA ACTTGGCTAA 


ACTCAGCTGC 


6120 
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TTCTAAAGCT AAAGATGATC ACAGAGTATA TAAACTACTG AAGACTCTTA TACCAGGAGA 6180 

AAATTACCTA TCATGTTAAG GATAATCAGC TAGAAGTAGA AACAGATAAA TACACATATA 6240 

CTGCCGCTAG AAATGGTAGT AAGGAAGTTG GTATTCAAGA GTCAGATATA GCAGCAACTC 6300 

TAAGTGCCGA TGAATATAAT TCTAATCGCC AAACTTTTGA GAGAGAATAC AAATACAAAA 6360 

GCAAATGCCC TTAATAATGG TTGGGCTAGA TCTGGTTCTG AAGAGTTCAA AAAGTTCTCC 6420 

CACTTTGTAG GGGTAGACAA AGGGATTGTG CGAACGAATG TACTGACTGG TAAAAAACTA 6480 

TCTGATAAGA TTAGGAAAGA AGTGGGCTCT GGAGATAGCA AACTAGGAAA AGGCGGCTAT 6540 

TTCTCTACTG GGGATGTTCT ATTAGGAAAA GATGTTGTTT CTTATACCGT ACAAGTATTT 6600 

TCAGAGAATA ATGAAAGAGT AGGAGTAAAC ACTCAAAGTC ACCGTGTTCA GTATAATCTC 6660 

CCAATTCTAG CTGACTTTTC AGTCATCCAA GATACTGTGG AACCATCACG AACCGTTGTT 6720 

GAAAAAATCA TTCCAAAACT AAATATTCCC GAAGAAGAGA AAGGGAAAAT AACCGAAGAA 6780 

ATCAAGAAAA AGAAAAAAAC CTCAGAATTG GCAGAACTAA TCTCAGAAAA TGTGAAAGTT 6840 

CGCTATGTTG ATGAACAAGG GCGTTTGCTA TCATTGAAAA ATGATACTGG AATTGGAGAA 6900 

AAAGAAAGTG ACGGAACCTA CATTACCAAT AAAAAACAAC TGATTGGTAC CAGCTATAAT 6960 

GTCACACATA AAAAACTCAG TAGCATGACT ACTACTGACG GAAAATATTA TACTTTTAAA 7020 

GAAGCAGATA CAAATTCTGC AAGTTTAACT GGGAATATTG TAAGCGAAGG TAGAACAGTG 7080 

ACCTTAGTTT ATAGAGAAAG CGAAGCGCCA ACCACTGCTA CAGTAACAGC CAATTACTAT 7140 

AAAGAAGGTA GGCAAGAGAA GTTGGTAGAG TCTCTTATAA AAGCTGATTT AGCGATAGGT 7200 

TCTGAGTATA CCACAGAATC AAAAACTATT GAAGGGAAAA CAACAACTGA GGACAAAGAA 7260 

CACCGAGTTA TCACAAGCAA AACAACATAC ACCTTGGTAG CAACTCCTGA AAATGCGTAC 7320 

CAGAAGACGG TGCAACAGTT GACTATTACT ACCGTGAGAA TGTTGAGGAA ACAGTGGTTC 7380 

CCAAAACAGC AACCTCTACT GAGACGAAGA CTATAACGCG TATCATTCAT TACGTTGATA 7440 

AAGTTACGAA CCAAAATGTA AAAGAAGATG TTGTTCAACC TGTAACCTTA AGCCGTACAA 7500 

AAACTGAGAA CAAGGTCACG GGAGTTGTAA CCTACGGTGA ATGGACAACA GGAAACTGGG 7560 

ACGAGGTTAT ATCTGGTAAG ATTGACAAGT ACAAAGATCC AGATATTCCA ACAGTTGAAT 7620 

CACAAGAAGT TACGTCAGAC TCTAGTGATA AAGAAATAAC GGTAAGGTAT GACCGTTTAT 7680 

CAACACCAGA AAAACCAATC CCACAACCAA ATCCAGAGCA TCCAAGTGTT CCGACACCAA 7740 

ACCCAGAACT ACCAAATCAA GAGACTCCAA CACCAGATAA ACCAACTCCA GAACCAGGTA 7800 

CTCCAAAAAC TGAAACTCCA GTGAATCCAG ACCCAGAAGT TCCCACTTAT GAGACAGGTA 7860 
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AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AAGCTAATGC TACCTTGGCT AGTGCTGGTA 7920 

TCATGACCTT GTTAGCTGGT CTAGGATTAG GATTTTTCAA GAAAAAAGAA GATGAAAAAT 7980 

AATAGATTTT AGAATCTAGG AACCAGGAAA AGCTCACAGA TGTGGGCTTT TTTCCTGGTT 8040 

TTGAGAACGA GGTCTTTCGT AAAGAATAAA AACGCTTACA AGTCTGTTGA ACTGGGAAAC 8100 

TATGAATCCT ATTTTTTTAA AAATATTTCC AGAAATCAGT TGCGG 8145 



(2) INFORMATION FOR SEQ ID NO: 123: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8697 base pairs 
(8> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{ D ) TOPOLOGY : 1 i near 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



CGGTACCGGG 


AACGATACTT 


AGTCTAATTT 


TGCACCTTTT 


CCATGTATGG 


TAAAGGTTTT 


60 


TCTTTTTTTA 


AAAAGGAAAA 


CGAGAAGAGG 


AGGTTCTTAT 


GAAACCAAGC 


ATTGCCTTGC 


120 


AAGTTTTACC 


CCTAGTACAG 


GGGATTGATC 


GGATAGCTGT 


TATTGATCAG 


GTCATTGCTT 


160 


ATCTGCAwAC 


TCAACAAGTG 


ACGATGGTAG 


TGACACCATT 


TGAAACGGTC 


TTGGAAGGGG 


240 


AGTTTGATGA 


GCTTATGCGC 


ATTCTAAAAG 


AAGCGCTGGA 


AGTGGCAGGG 


CAGGAGGCAG 


300 


ACAATGTCTT 


TGCCAATGTC 


AAAATAAATG 


TAGGAGAGAT 


TTTAAGTATT 


GATGAGAAAC 


360 


TTGAGAAGTA 


TACTGAGACG 


ACACATTAGT 


CTATTGGGCT 


TTCTCGCAGT 


ATTGTCAATC 


420 


TGGCAGTTAG 


CAGGTTTTCT 


TAAACTTCTC 


CCCAAGTTTA 


TCCTGCCGAC 


ACCTCTTGAA 


480 


ATTCTCCAGC 


cctttuttcg 


TGACAGAGAA 


TTTCTCTGGC 


ACCATAGCTG 


GGCGACCTTG 


540 


AGAGTGGCTT 


TACTGGGGCT 


GATTTTGGGA 


GTTTTGATTG 


CCTGTCTTAT 


GGCTGTGCTC 


600 


ATGGATAGTT 


TGACTTGGCT 


CAATGACCTG 


ATTTACCCTA 


TGATGGTGGT 


CATTCAGACC 


660 


ATTCCGACCA 


TTGCCATAGC 


TCCTATCCTG 


GTCTTGTGGC 


TAGGTTATGG 


GATTTTGCCC 


720 


AAGATTGTCT 


TGATTATCTT 


AACGACAACC 


TTTCCCATCA 


TCGTTAGTAT 


TTTGGACGGT 


780 


TTTAGGCATT 


GCGACAAGGA 


TATGCTGACC 


TTGTTTAGTC 


TGATGCGGGC 


CAAGCCTTGG 


840 


CAAATCCTGT 


GGCATTTTAA 


AATCCCAGTT 


AGCCTGCCTT 


ACTTTTATGC 


AGGTCTGAGG 


900 


GTCAGTGTCT 


CCTACGCCTT 


TATCACAACT 


GTGGTATCTG 


AGTGGTTGGG 


AGGTTTTCAA 


960 


GGTCTTGGTG 


TTTATATGAT 


TCAGTCTAAA 


AAACTGTTTC 


AGTATGATAC 


CATGTTTGCC 


1020 


ATTATTATTC 


TGGTGTCGAT 


TATCAGTCTT 


TTGGGTATGA 


AGCTGGTCGA 


TATCAGTGAA 


1080 


AAATATGTGA 


TTAAATGGAA 


ACGTTCGTAG 


AATTAGAATG 


TTTCTGAAAA 


AGAAAAGAGG 


1140 
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AAATCAAAAT GAAGAAAACA TGGAAAGTGT TTTTAACGCT TGTAACAGCT CTTGTAGCTG 1200 

TTGTGCTTGT GGCCTGTGGT CAAGGAACTG CTTCTAAAGA CAACAAAGAG GCAGAACTTA 1260 

AGAAGGTTGA CTTTATCCTA GACTGGACAC CAAATACCAA CCACACAGGG CTTTATGTTG 1320 

CCAAGGAAAA AGGTTATTTC AAAGAAGCTG GAGTGGATGT TGATTTGAAA TTGCCACCAG 1380 

AAGAAAGTTC TTCTGACTTG GTTATCAACG GAAAGGCACC ATTTGCAGTG TATTTCCAAC 1440 

ACTACATGGC TAAGAAATTG GAAAAAGGAG CAGGAATCAC TGCCGTTGCA GCTATTGTTG 1500 

AACACAATAC ATCAGGAATC ATCTCTCGTA AATCTGATAA TGTAAGCAGT CCAAAAGACT 1560 

TGGTTGGTAA GAAATATGGG ACATGGAATG ACCCAACTGA ACTTGCTATG TTGAAAACCT 1620 

TGGTAGAATC TCAAGGTGGA GACTTTGAGA AGGTTGAAAA AGTACCAAAT AACGACTCAA 1680 

ACTCAATCAC ACCGATTGCC AATGGCGTCT TTGATACTGC TTGGATTTAC TACGGTTGGG 1740 

ATGGTATCCT TGCTAAATCT CAAGGTGTAG ATGCTAACTT CATGTACTTG AAAGACTATG 1800 

TCAAGGAGTT TGACTACTAT TCACCAGTTA TCATCGCAAA CAACGACTAT CTGAAAGATA I860 

ACAAAGAAGA AGCTCGCAAA GTCATCCAAG CCATCAAAAA AGGCTACCAA TATGCCATGG 1920 

AACATCCAGA AGAAGCTGCA GATATTCTCA TCAAGAATGC ACCTGAACTC AAGGAAAAAC 1980 

GTGACTTTGT CATCGAATCT CAAAAATACT TGTCAAAAGA ATACGCAAGC GACAACGAAA 2040 

AATGGGGTCA ATTTGACGCA GCTCGCTGGA ATGCTTTCTA CAAATGGGAT AAAGAAAATG 2100 

GTATCCTTAA AGAAGACTTG ACAGACAAAG CCTTCACCAA CGAATTTGTG AAATAATGAC 2160 

AGAAATTAGA CTAGAGCACG TCAGTTATGC CTATGGTCAG GAGAGCATTT TAGAGGATAT 2220 

CAACCTACAG GTGACTTCAG GCGAAGTGGT TTCCATCCTA GGCCCAAGTG GTGTTGGAAA 2280 

GACCACCCTC TTTAATCTAA TCGCTGGGAT TTTAGAAGTT CAGTCAGGGA GAATTGTCCT 2340 

TGATGGTGAA GAAAATCCCA AGGGGCGCGT GAGTTATATG TTGCAAAAGG ATCTGCTCTT 2400 

GGAGCACAAG ACGGTGCTTG GAAATATCAT TCTGCCCCTC TTGATTCAAA AGGTGGATAA 2460 

CGCAGAAGCT ATTTCCCGAG CGGATAAAAT TCTTGCGACC TTCCAGCTGA CAGCTGTAAG 2520 

AGACAAGTAT CCTCATGAAC TTAGCGGTGG GATGCGCCAG CGTGTAGCCT TACTCCGGAC 2580 

CTACCTTTTT GGGCACAAGC TCTTTCTCTT AGATGAGGCC TTTAGCGCCT TGGATGAGAT 2640 

GACAAAGATG GAACTCCACG CTTGGTATCT TGAGATTCAC AAGCAGTTGC AGCTAACAAC 2700 

CCTGATCATC ACGCATAGTA TTGAGGAGGC CCTCAATCTC AGCGACCGTA TCTATATCTT 2760 

GAAAAATCGC CCTGGGCAGA TTGTTTCAGA AATTAAACTA GATTGGTCTG AAGATGAGGA 2820 

CAAGGAAGTC CAAAAGATTG CCTACAAACG TCAAATTTTG GCGGAATTAG GCTTAGATAA 2880 
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AGGGAGTTGG 


TGAAGATTAT 


CCTTTACCAG 


CGCCCTTTTT 


CTTTTAAAAA 


2940 


Tn an a aaatt 


TCGGTATAAT 


AGTCAAACAA 


GGTCAAGGTT 


TAAAGAGAGA 


GGTGGGTTTG 


3000 




TAAAAATACA 


TCGGATCATA 


TTGAGGCCTA 


CATCAAGGCG 


ATTTTAGATC 


3060 


AATPTflfSTAT 


CGTGGAGTTG 


CAACGGAGTC 


AGTTGGCAGA 


TACCTTTCAG 


GTTGTTCCTA 


3120 


flTr'AflATTAA 


CTACGTGATC 


AAGACACGCT 


TTACGGAAAG 


TAGAGGCTAC 


TTGGTTGAAA 


3180 




TGGCGGAGGC 


TACATTCGTA 


TAGGACGGAT 


TGAGTTTTCT 


AGTCATCATG 


3240 


AAA 1 ui. 1 v-V_Vj 




TACTCGATTG 


GTGAGCGAGT 


CAGTCAAGAA 


ATTTATGAGG 


3300 


AlAl Iv,IV_UA 


GCTTTTGGTT 


GAGCAGGAAT 


TCATGACCAA 


GCAGGAGATG 


AATTTGCTAG 


3360 


ft. ft ff»f» */■■«¥* ft/"*/"" 

AATt. AV* I AuL 


TTTGGATCGC 


GTTTTAGGAG 


AAGAAGCTCC 


AGTTGTTCGA 


GCAAACATGC 


3420 


TACGTCAOAT 


CATACAAGAG 


GTAGATAGAA 


AAGGGAAGTA 


AGATGAACTA 


TTCAAAAGCA 


3460 


nwtvt ft ft TV IV & 1* 


GTATCGAAAG 


TGCCTACATG 


GTTGCTGGAC 


ATTTTGGACC 


TCGTTATCTA 


3540 


GAGTCGTtAjC. 


ACTTGTTGAT 


TGCCATGTCT 


AATCACAGTT 


ATAGTGTAGC 


AGGGGCAACT 


3600 


TTAAATGATT 


ATCCGTATGA 


GATGGACCGT 


TTAGAAGAGG 


TGGCTTTGGA 


ACTGACTGAA 


3660 


ACGGACTATA 


GCCAGGATGA 


AACCTTTACG 


GAATTGCCGT 


TCTCCCGTCG 


TTTGCAGGTT 


3720 


CTTTTTGATG 


AAGCAGAGTA TGTAGCGTCA GTGGTCCATG 


CTAAGGTACT 


AGGGACAGAG 


3780 


CACGTCv_Tt-T 


ATGCGATTTT GCATGATAGC AATGCCTTGG CGACTCGTAT 


CTTGGAGAGG 


3840 




CTTATGAAGA 


CAAGAAAGAT 


CAGCTCAAGA 


TTGCTGCTCT 


TCGTCGAAAT 


3900 


ft m ft ^ ft ft 


GGGCAGGCTG 


GACTCGTGAA 


GATCTCAAGG 


CTTTACGCCA 


ACGCCATCCT 


3960 


ft /"*» IV, T> H/ r, 0'TV> 

AC A\j1 Auv, 1 v» 


ACAAGCAAAA 


TTCTATGGCC 


AATATGATGG 


GCATGCCGCA 


GACTCCTAGT 


4020 




AGGATTATAC 


GCATGATTTG 


ACAGAGCAAG 


CGCGTTCTGG 


CAAGTTAGAA 


4080 




GTCGGGACAA 


GGAAATCTCA 


CGTATGATTC 


AAATCTTGAG 


CCGGAAGACT 


4140 




CTGTCTTGGT 


TGGGGATGCT 


GGTGTCGGGA 


AAACAGCTCT 


GGCGCTTGGT 


4200 


CTTGCCCAGC 


GTATTGCTAC 


TGGTGACGTG 


CCTGCGGAAA 


TGGCTAAGAT 


GCGCGTGTTA 


4 i, OU 


GAACTTGATT 


TGATGAATGT 


CGTTGCAGGG 


ACACGCTTCC 


GTGGTGACTT 


TGAAGAACGC 


4320 


ATGAATAATA 


TCATCAAGGA 


TATTGAAGAA 


GATGGCCAAG 


TCATCCTCTT 


TATCGATGAA 


4380 


CTCCACACCA 


TCATGGGTTC 


TGGTAGCGGG 


ATTGATTCGA 


CTCTGGATGC 


GGCCAATATC 


4440 


ttgaaaccag 


CCTTGGCCCG 


TGGAACTTTG 


AGAACGGTTG 


GTGCCACTAC 


TCAGGAAGAA 


4500 


tatcaaaaac 


ATATCGAAAA 


AGATGCGGCA 


CTTTCTCGTC 


GTTTCGCTAA 


AGTGACGATT 


4560 


GAAGAACCAA 


GTGTGGCAGA 


TAGTATGACT 


ATTTTACAAG 


GTTTGAAGGC 


GACTTATGAG 


4620 


AAACATCACC 


GTGTACAAAT 


CACAGATGAA 


GCGGTTGAAA 


CAGCGGTTAA 


GATGGCTCAT 


4680 
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CGTTATTTAA CCAGTCGTCA CTTGCCAGAC TCTGCTATCG ATCTCTTGGA TGAGGCGGCA 4740 

CCAACAGTGC AAAATAAGGC AAAGCATGTA AAAGCAGACG ATTCAGATTT GAGTCCAGCT 4800 

GACAAGGCCC TGATGGATGG CAAGTGGAAA CAGGCAGCCC AGCTAATCGC AAAAGAAGAG 4860 

GAAGTACCTG TCTACAAAGA CTTGGTGACA GAGTCTGATA TTTTGACCAC CTTGAGTCGC 4 920 

TTGTCAGGAA TCCCAGTTCA AAAACTGACT CAAACGGATG CTAAGAAGTA TTTAAATCTT 4980 

GAAGCAGAAC TCCATAAACG GGTTATCGGT CAAGATCAAG CTGTTTCAAG CATTAGCCGT 5040 

GCCATTCGCC GCAACCAGTC AGGGATTCGC AGTCATAAGC GTCCGATTGG TTCCTTTATG 5100 

TTCCTAGGGC CTACAGGTGT CGGGAAAACT GAATTAGCCA AGGCTCTGGC AGAAGTTCTT 5160 

TTTGACGACG AATCAGCCCT TATCCGCTTT GATATGAGTC AGTATATGGA GAAATTTGCA 5220 

GCTAGTCGTC TCAACGGAGC TCCTCCAGGC TATGTAGGAT ATGAAGAAGG TGGGGAGTTG 5280 

ACAGAGAAGG TTCGCAATAA ACCCTATTCC GTTCTCCTCT TTGATGAGGT AGAGAAGGCC 5340 

CACCCAGATA TCTTTAATGT TCTCTTGCAG GTTCTGGATG ACGGTGTCTT GACAGATAGC 5400 

AAGGGACGCA AGGTCGATTT TTCAAATACC ATTATCATTA TGACATCGAA TCTAGGTGCG 5460 

ACTGCCCTTC GTGATGATAA GACTGTTGGT TTTGGGGCTA AGGATATTCG TTTTGACCAC 5520 

GAAAATATGG AAAAACGCAT GTTTGAAGAA CTGAAAAAAG CTTATAGACC GGAATTCATC 5580 

AACCGTATTG ATGAGAACGT GGTCTTCCAT AGCCTATCTA GTGATCATAT GCAGGAAGTG 5640 

GTGAAGATTA TGGTCAAGCC TTTAGTGCCA AGTTTGACTG AAAAAGGCAT TGACTTGAAA 5700 

TTACAAGCTT CAGCTCTGAA ATTGTTAGCA AATCAAGGAT ATGACCCAGA GATGGGAGCT 5760 

CGCCCACTTC GCAGAACCCT GCAAACAGAA GTGGAGGACA AGTTGGCAGA ACTTCTTCTC 5820 

AAGGGAGATT TAGTGGCAGG CAGCACACTT AAGATTGGTG TCAAAGCAGG CCAGTTAAAA 5880 

TTTGATATTG CATAAAAGAA TAAAAGTATC AGCATCTGAC CATAAGTCAC AGTGGAGTGA 5940 

AATTCAATGA AAATCAAAGA GCAAACTAGG CAGCTACCCG CAGGTTGCTC AAAACACTGG 6000 

TTTGAGGTTG CAGATAGAGC TGACGTGGTT TGAAGAGATT TTCGAAGAGT ATGAAACTAA 6060 

AACCTATAGC TTCTAAACGA TCCGTGGTTT TCATCATTCA ACACAAAATT CATATGTTTA 6120 

TTACCCTCCG TCGTATTTGT CTTAGAGCGT GTGTAGTACA AAAAGAGCAG TCTTATCTGA 6180 

AATTTTTATT CTTTCAAAAG AGACCTGTTT CTTTTTTGCA TGTCAAATCC GTTCTAGCTG 6240 

GTATTTGAAA AATCAAACTA ATATTCAATG AAAATCAAAG AACAAACTAG GAAGCTAGCC 6300 

GCAGGTTGCT CAAAACACTG TTTTGAGGTT GTAGATAGAG CTGACGTGGT TTGAAGAGAT 63 60 

TTTCGAAGAG TATAAGCTGC AAGATGAATG ATTTTCTTGT ATTGACGTTG TTGTTGACAA 6420 
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AAAGTAGCGG 


ATAAATGAAA 


TCCATTCCAT 


TATCATAGAT 


GATAGGCTGG 


TAGGAAATTT 


6480 


TCAAATAGCA 


TACAGGAAAT 


AGATGTATGG 


AGTTCTGGTA 


GTAGAAAGGG 


AGAGAGATGA 


6540 


ACATTTTAGT 


TGCAGATGAC 


GAGGAAATGA 


TTAGAGAAGG 


AATTGCAGCA 


TTTCTGACAG 


6600 


AAGAGGGTTA 


TCATGTCATT 


ATGGCTAAGG 


ATGGACAAGA 


GGTCTTGGAA 


AAATTTCAAG 


6660 


ATCTCCCTAT 


CCATCTCATG 


GTACTGGATT 


TAATGATGCC 


TAGGAAGAGT 


GGTTTTGAAG 


6720 


TGTTAAAAGA 


AATCAATCAA 


AAGCACGATA 


TTCCTGTCAT 


CGTCTTGAGT 


GCTCTGGGAG 


6780 


ATGAAACTAC 


TCAGTCACAG 


GTATTTGATC 


TCTATGCTCA 


TGATCATGTG 


ACAAAACCTT 


6840 


TTTCTTTGGT 


ACTGCTTGTC 


AAGCGTATTA 


AGGCGCTTAT 


CAGACGTTAC 


TACGTCATAG 


6900 


AGGATCTTTG 


GCGATATCAG 


GATGTAACAG 


TGGATTTTAC 


CTCTTACAAA 


GCACATTATA 

• • W ■ Mr Mr • » 4 #1 


6960 

V m* W 


AAAATGAAGA 


AATTGATCTC 


AAACCAAAGG 


AATTACTGGT 


ACTAAAGTGT 


TTGATTCAGC 


7020 


ATAAAAATCA 


AGTTTTAAGT 


AGAGAGCAGA 


TATTGGAAGA 


AATTTCAAAA 


GATGTAGCTG 


7080 

f W 9 w 


ATTTACCTTG 


TGATAGGGTC 


GTTGATGTCT 


ATATTCGTAC 


TCTTCGCAAA 


AAATTAGCTT 


7140 


TAGATTGTAT 


CGTGACTGTG 


AAAAATGTTG 


GGTATAAGAT 


TAGCTTATGA 


TAAAAAATCC 


7200 


TAAATTATTA 


ACCAAGTCTT 


TTTTAAGAAG 


TTTTGCAATT 


CTAGGTGGTG 


TTGGTCTAGT 


7260 


CATTCATATA 


GCTATTTATT 


TGACCTTTCC 


TTTTTATTAT 


ATTCAACTGG 

* ■ m Mi mm m m\ UW 


AGGGGGAAAA 


7320 


GTTTAATGAG 


AGCGCAAGAG 


TGTTTACGGA 


GTATTTAAAG 


ACTAAGACAT 

W m>^m9 m> m W m^mrm V Mt 


CTGATGAAAT 


7380 


TCCAAGCTTA 


CTCCAGTCTT 


ATTCAAAGTC 


CTTGACCATA 


TCTGCTCACC 


TTAAAAGAGA 


7440 


TATTGTAGAT 


AAGCGGCTCC 


CTCTTGTGCA 


TCACTTGGAT 


ATTAAAGATG 

• • mi m* W mm HWfl * Mr 


GAAAGCTATC 


7500 


AAATTATATC 


GTGATGTTAG 


ATATGTCTGT 


TAGTACAGCA 


GATGGTAAAC 


AGGTAACCGT 


7560 

r mr V V 


GCAATTTGTT 


CACGGGGTGG 


ATGTCTACAA 


AGAAGCAAAG 


AATATTTTGC 


TTTTGTATCT 


7620 


CCCATATACA TTTTTGGTTA 


CAATTGCTTT 


TTCCTTTGTT 


TTTTCTTATT 

• ™ * m. mm* mi m m Mt Mt 


TTTATACTAA 


7680 


ACGCTTGCTC 


AATCCTCTTT 


TTTACATTTC 


AGAAGTGACT 


AGTAAAATGC 


AAGATTTGGA 


7740 


TGACAATATT 


CGTTTTGATG 


AAAGTAGGAA 


AGATGAAGTT 


GGTGAAGTTG 


GAAAACAGAT 


7800 


TAATGGTATG 


TATGAGCACT 


TGTTGAAGGT 


TATTTATGAG 


TTGGAAAGTC 


GTAATGAGCA 


7860 


AATTGTAAAA 


TTGCAAAATC 


AAAAGGTTTC 


CTTTGTCCGC 


GGAGCATCAC 


ATGAGTTGAA 


7920 


AACCCCTTTA 


GCCAGTCTTA 


GAATTATCCT 


AGAGAATATG 


CAGCATAATA 


TTGGAGATTA 


7980 


CAAAGATCAT 


CCAAAATATA 


TTGCAAAGAG 


TATAAATAAG 


ATTGACCAGA 


TGAGCCACTT 


8040 


ATTAGAAGAA 


GTACTGGAGT 


CTTCTAAATT 


CCAAGAGTGG 


ACAGAGTGTC 


GTGAGACCTT 


8100 


GACTGTTAAG 


CCAGTTTTAG 


TAGATATTTT 


ATCACGTTAT 


CAAGAATTAG 


CTCATTCAAT 


8160 


AGGTGTTACA 


ATTGAAAATC 


AATTGACAGA 


TGCTACCAGG 


GTCGTCATGA 


GTCTTAGGCC 


B220 
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ATTGGATAAG GTTTTGACAA ACCTGATTAG TAATGCAATT AAATATTCAG ATAAAAATGG 8280 

GCGTGTAATC ATATCCGAGC AAGATGGCTA TCTCTCTATC AAAAATACAT GTGCGCCTCT 8340 

AAGTGACCAA GAACTAGAAC ATTTATTTGA TATATTCTAT CATTCTCAAA TCGTGACAGA 8400 

TAAGGATGAA AGTTCCGGTT TCGGTCTTTA CATTGTGAAT AATATTTTAG AAAGCTATCA 84 60 

AATGGATTAT AGTTTTCTCC CTTATGAACA CGGTATGGAA TTTAAGATTA GCTTGTAGAC 8520 

AGATTAGTTT TTTATTAAAG TTCATATAGG GTTAACATAA GTGTGTTATT CTTTGTGTAG 8580 

ATAAAAGAAA GGATACTAAT ATGGTATTAG CGATTATTTT AGTAACATTC TTTATTCGAT 8640 

TGATTTTTTT AAAGCGTTCG ATAGAGAATG AGAAACGAAT CCTTAGCAAT GGCGGGG 8697 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

AACCATACAT ACGGCAAGGC AAAGCTGACG CGGTTTGAAG AGATTTTCGA AGAGTATTAG 60 

TTGCCTTTAA AGGCATCCAC CATCGTTTGA AATTCTTCAT TTGAGAGAGT AATCCCTTTG 120 

CCCATTTTAG TATGGTCTGG ACTCCAAGCA CGAATATCAA ACTTTGCAGG GGCACCATTA 180 

AAGCTCACAC GGTTAATTTC CTTGGTCCAA CCTTTTTCGT TTTCAGAAAG AGTCAACAAG 240 

TGCTCTTCGA TTTCAAATGT AAATTCTGCC ATTTTCTTCT CCTTTTTTAG TTTCATTAGT 300 

TTATTCGTAA AATCTTGTAG ATTTTAGGAA AATTTTATAT AATATTGATA TAAAAGAAGG 360 

GAGGCCAATA TGAGACATAA ATTCCAGCAA GTTCTAAATA AAATACATGA TTTTTTAAAT 420 

GGATATGACC AACCTGACCA GACTGAAACC AACTCCCTTA CAGCCACTAT TGAAGAGGCT 480 

ATCCAGAAAC AAACCGCTGT TCACCTTATC TTGTCTGAGA CAAGCTTTAC AGGTGACATC 540 

ATCAAATATG ATCAGCAAGG CCAGCAAATT ATCGTGAAAA ATTTTTCCAA AAATGTGACC 600 

CGGATTATCC GTATAAGCGA TATTCAACGC CTGCGATTTG TCCCCTCAAC TGTCCAAACA 660 

GCCCAAAAAA ATAGATTTAA GAAAGAGTGA GATGTAGTTG CTTCATCCCA CTCTTTTTTC 720 

TTAGCGAATT TGTTCAAAAT GTAAATGAAC TGCGATATGA TCTCCATAAC CACTTCTTTC 780 

CAAGTCACGT TGTAAACGAT AGGAAATGTA GTGTTCTGCA ATGGTAATGT AACCTGCGCC 840 

CAATAAACGA TGTTCAACCA TAGATTGAAT CATACTGATA GTCGCACGTT CCACCTTGGC 900 
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TTCTTGTAAA 


TCCAAAACTA 


CCTTCTTAGT 


GACTTGAGCA 


AGATTTTGAC 


GCAAATCATC 


960 


TGTCAAAACA 


TAAACAGTTT 


GGGCTGCCTT 


CAAGATGGCT 


TGGTAAATCT 


TATCTGGATT 


1020 


AAATTCAGCA 


ATTTCGCCAT 


TACGTTTGAT 


TACTTGCATA 


GGTTTCTCCT 


TTATTCTTTG 


1080 


TTTTCTTTGA 


TTTCTGCCAG 


CATTTTTTCT 


TCTTCTACTG 


TCAGTTGATA 


ATGTTCAAGT 


1140 


AAATCCGGTC 


TGCGCTCGTA 


GGTTTTCTTT 


AAACTCTCGT 


ACAATCGCCA 


CTGACGAATC 


1200 


TTTTCATGGT 


GGCCACTCAT 


CAATACATCT 


GGCACGACCA 


TGCCTCGATA 


ATCATAGGGA 


1260 


CGTGTGTACT 


GAGGATATTC 


TAAAAGACCT 


GAAGAAAAAC 


TATCATCTTG 


GTGGCTAGAC 


1320 


TCCTTGCCAA 


TCACTTCTGG 


AATCAGGCGA 


ACTGTAGCAT 


CAATCATGGT 


CATAGCTGCC 


1380 


AATTCTCCAC 


CAGTGAGGAC 


ATAGTCACCT 


AGGGAAATCT 


CATCTGTTAC 


CAAGCTCTTA 


1440 


ATGCGCTCAT 


CATAACCCTC 


ATAGTGCCCA 


CAGATAAAGA 


TTAGCTCTTC 


CTCTTG AG C P 


1500 

X JW 


AAATCTTCAG 


CATAAGCCTG 


ATCAAACTGC 


TTTCCAGCAG 


GATCAAGCAG 


AATAACGCGC 


X J wv 


GGATTTTTCT 


TTTCAATAGC 


ATCAAAGGAA 


TCGAAAATAG 


GTTGTCCTCT 


GAGCAACATG 


X O A v 


CCCTGACCGC 


CTCCGTAGGG 


CTCATCATCT 


ACATGACGGG 


CCTTTTCAGC 




1 fiflO 

4«OV 


AAATTATGAT 


ACTGGATATC 


CAAGAGCCCT 


TTTTCTCGAG 


CCTTTCCAAC 


GATTGACTGC 


1740 


TCCAGTGGAG 


AAAACATCTC 


TGGAAAGAGG 


GTTAAAATAT 


CAATCTTCAT 




1800 


TTCTAAGATT 


TCCACATCGA 


CCCGTTTACT 


TGGAATATCA 


ACATTGAGAA 


CCACTGGTGG 


I860 

• www 


GATATAAGGT 


AAAAGCAAAT 


CACGTTTGCC 

X* * * » X*^»X* 


TTTTCGTTTG 


ACCACCCAGA 

r\ w ^»*» w ^ nun 


CATC ATT AG C 


1920 


ACCTGGTTGC 


AGGATTTCCT 


TGATGGTTCC 


AACCAAGCTA 


TCACCCTCAT 


AG ACTT C C AA 


1980 


ACCGATAATC 


TCGTGATAGT 


AAAATTCACC 


ATCGTCTAGG 


TCATTCAAAT 


CTTCCTCAGC 


2040 


GACCTTGAGA 


CTGTATCCCT 


TGTACTTTTC 


GATAGTATTG 


ATATGGTACA 


TATCTTTCAA 


2100 


TTTAATAATG 


TCAAAGTTCT 


TCTGTTTACG 


GTGGCTAGCG 


ATGGTCACTC 


TTTCGACAAA 

ft ft ft WO^j 


2160 

4» A V V 


CTGATCTTTT 


TCATCAAACA 


AAACCAGCTC 


AGCTCCTTTT 


TTAAACCGTT 


CTTCTGCAAA 


2220 


ATCCGTCACA 


GACAAGACTC 


GCATCTCCCC 


CTGTAATCCC 


TGCGTATTAA 


CGATTTTCCC 


2280 


AACATTAAAG 


TAGTTCATCT 


TGTCTCCTGT 


AATCTCCTTT 


TTTCCATCTT 


ATTCTAACAA 


2340 


TTCTCGAATA 


ATAGCCGCAA 


TTTTTTCCGA 


TTCTGACCAT 


TGTAAATAAT 


GGTGATTCCC 


2400 


TCCTAAAATG 


AGTTTAGTAT 


TGGAAGTCCA 


ATATTCTGAT 


TCTCTGTACT 


CTTTTTCTCT 


2460 


ATAAGGCTGA 


CAAAAAACAA 


ATACAGGAAT 


ATGAGCTTCT 


ATAGATACAT 


CCTCAAAATC 


2520 


TTCCTCAGTA 


ATCTCTCCAG 


ATATCTGAAA 


TTCTGGATCT 


TGATTTTCCA 


ACTCTAACCC 


2580 


TTTTTCTTGC 


ATTAATTCCC 


AGATTTTTTT 


ATTCGTTTCA 


GGACTAAATG 


TTGCTTGAGT 


2640 


TAAGTTCTTA 


AAATAAAGTT 


CAGGACCACA 


CTCGTCAATC 


AGCCTCATCT 


GCTCTTCCAT 


2700 
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TTCTGGATAA GGATTTTCTG AAAAATCAGC AAACATGACT TTTTTAGTTG TCGGTTCAAT 2760 

TGCTACTAAA GTCTGACGCT TAATTGGTTT CTCGAGTAAT TTGCAAGCTA AAATTCCACT 2820 

CCAACTATGT GCACAAAGTA TATATTCAGA AATTCCTAAT TCTTCAAGTA CTTCATAAAC 2880 

CGCATCTGCA AGATTATCTA GATTTTTTCC AGCTTGGTCA TGAATCGGAC TCCTACCTGT 2940 

GTTCGGAAAA TCAATTGTCA AATAACCAAT TGTAGGAGGA GGTTTTTCAA GTATAAGTGA 3000 

AAAATTTTCA TAACTTGGTA GCAAACCTGC TCCGTTTAAA CAAACTAGCA CTTTCTTTTG 3060 

CTTTTGATAA GTAACAGAGA GGCTACCAAT TTCTGTAGAT ACTTCAAACC TCTTCATAAA 3120 

GAAATCCACT GATTCTATAT AATGAATTAT TAAAAATCCT TATCCTTTAT TTTATCACGT 3180 

TCCAAGGATT TTCTCAAGTT GGAGGAAGGG GACAATATCT CTACTTTCCC TTCAATAATC 3240 

CTTCCAAATT ATGTTTATGT TGGTAATTAA TGGCTGCGGT TTTGTCTTTC TCAAAGACAG 3300 

TCTTGGTAAG GTCAATATGA TTAATAGCTA CGATTGCGAC GGTGTAGTAA ATGATATCAG 3360 

CCAGTTCTCT GGCAAGTTCC TCGTTCGAAT CCTATCCCTT CTTTTCGACC AGAGCGCCTA 3420 

TTCAAAACCT CGACTACTTC TCCGACTTCC TCCACTAACT TCATAAAGAG ACCTTCATCA 3480 

GTCCGAGACT GCTGTTAATG TTCGATTAAG TAGTCTTGGA ATTGCCTAAA CGTTCAATCT 3540 

TTTATAGTAT ATTGAAACTA GAATAGTACA CCTTTACTTC TAAAACATTG TTAGAAATCG 3600 

ATTTGACTGT CCTGATCGAT TTGTCCTGTT CTTGTTTCAT TTTACTATAT CTTCTATTCC 3660 

ACACAAAAAA GCGAGACATC CGTCCCGCCC TTCTTATTTT TCGTCAATAA CGATTCTTAC 3720 

TTTTTTGTAT TCAGTTGGGA CAGAGTAGAC AATCGTTCTT ATCGCACAAA TAGTGCGACC 3780 

CTTACGACCG ATTACACGAC CCACATCGCT TTGATCAAGA TTCAAATGAT ATTCCAAAAA 3840 

TTCTGGTGTA TCCTCAATCT TGATAGTTAA GGCATCTGGT TGTGAAATTA AGGGTTTCAC 3900 

AATCGCAATA ATGAGATTTT CAATCCTATC CATCTGTCAA CCTACTTTAA ACTTATTTTC 3960 

AAAATTTAGA ATCGTGGAAT TTTTTCAATA CGCCTTCTTT TGAAAGGATG TTACGTACTG 4020 

TGTCTGAAGG TTGAGCTCCA TTAGCCAACC ATGCAAGAAC GCGGTCTTCT TTCAAAGTTA 4080 

CTTGGTTTTC AGCAACAAGT GGGTTGTAAG TTCCAACTGT TTCGATGAAA CGTCCGTCAC 4140 

GTGGTGAACG TGAATCTGCT ACGTTGATAC GGTAGAAACG TTTTTTCTTA GAACCCATAC 4200 

GAGTCAAACG GATTTTAACT GCCATTTTTA AAGTCTCATT TCTTTAATTT TTTATTTCGG 4260 

TGAAATAGCT GAGCTATTTA GCACATGTTC TATTATAGCA GATTTCTGGC ATGTGTC 4317 

(2) INFORMATION FOR SEC; ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4881 base pairs 
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{Bl TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

AATTTATTTG ACTGGAAATT GTAGAGGCTT CTCGAAATTT CTTGAATGGT TAAAATAAGG 60 

ACAAGAGAAA ACATGGATAT CTATATCCTT GTGCCAAAAA AACCACTGCC CTCCCCAGAC 120 

CAACCTGAGG AAAGCAGTGA TTCTTATTTT AGGAGTTAGG AATGAATACA CGAAATCAAT 180 

TTAGCTGATT ATTTTTTGTT TTTCAAGAAT TCATCGTATT GTTTTTGCAT TTCGTTCAAT 240 

ACTTTTTCGT AGGCACCTTC AGATTTCAAT TTTTCCATCA ATTCTGGAAT CGCTTTATCT 300 

GGGTCTACAG TACCAGTGTT GATAGCTGTA TCAAATTGTT GCATTGTGTT AGCAATAGCT 360 

GAGATTTCAG ATTTCACATT GTCAGTATTG AAGATAAATC CAAGCGCTGG AGATTCTTTA 420 

GCTTCTGCCA ATTCTTTCTT AGAATTTTCG ATTTGTTGGT CTGTAACGTT TTCGTTGATG 4 80 

TAAAGGATCC AGTTGTTACC AGTGTTCCAT CCACCCATGT GAGTGTTTCC TTTGTAGCCA 540 

TCAAGAACGC GAACACGGTT TTCTTTACCT TCAATTTTTT CCCAGTTCTT GCCTTCTGGA 600 

CCGTAAACAA GACCGTTCAA GAGTTCTGGG TTCGTATTCA AGAGGTTCAA GATTTCCATT 660 

GATTTTTCTT TGTTCTTAGA GTTGTTTGAG ATGACAAAGT TAGCAACTTG TGTTGTTTGG 720 

TTTTTCTTGA TGAAGTTAGT AATTGGTTTG ATTTGGATAT CTTTGTTGGC AACACGTCAA 780 

AGCAAGCTGT TACCGTAGTC AGCTGGTCCT ACTGTTTCTT CACGAACGAA CCAAGTATCT 840 

TGTTGAAGGT CAAAGGAAGT ATCCCTTGTT GCGACGTCTT TTGGAATGTA GCCAGCTTCA 900 

TAGAATTTGT GAAGAGTCTT CAAGTGTTCT TTGAAACGAG GCACTTCGTA ACGGTTTACA 960 

ACTTTAGTAG TATCGCCTTC AAGGTCGATA ACGAATGGAA GACCGTTTGC TACTGGGTAG 1020 

TCAAAATTAT CAGATGGGAT GAAAACTTTA CCAATAGCAA ATGGTACTAC GTCTGGAGCT 1080 

TTTTCTTTGA TT T G TTTCAA GACTGGCTCA AGACTTTCGT AAGAAGTAAC ACCTGAAATA 1140 

TCGATACCAT ATTTAGCAAG GAGAGTTCCG TTGAAGGCAA AGTTTTGAGA TGATGCAACG 1200 

TTGGCTGCAA CTGGAACAGC GTAAATCTTA CCATTTACAG TATTACCCTT GATGTAAGCT 1260 

GGGTCAAGTG CTTTGTAAAG GTCTTTACCT TCTTTTTTGT ACAATTCTGT CAAGTCAGCG 1320 

TAAGCACCTT TTTGAGCATT TACAATATAG TTATCTGCAA AGGCAATATC ATAGTTTTCA 1380 

CCAGATGATG TGATAACTGA CATTTTCTTA CCATAGTCAC CCCAGCCAAG GTATTGGATA 1440 

TCCAATTTGG CACCAACTTT TTCTTCAATG ATTTTGTTGG CATTTGCTAA CAATTCATCC 1500 

AAGTTGTCTG GTTTGTCACC GATTTGGTAC ATTTTGATAA CAGGTTTGTC ACCTGAATCA 1560 
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GC AG CTTTTT TGCTGTTACC TGTCAAATTT CCACAAGCAG CAAGACCTGC AGCCAGAGCG 1620 

ACTACACTAG CAGATGCAAA AGCATATTTT TTCCAGTTTT TCATGATAAA AACTCCTTTT 1680 

TTTATTTTTA AACTTATAAA CAATGTAATC ATCTTATACT CAATAAAAAT CAAAGAGCAA 1740 

ACTAGAAAAC TAGCCGCAGG CTGCTCAAAG CACTGCTTTG AGGTTGTAGA TAAGACTGAC 1800 

GAAGTCAGTT ACATATATCT ACGGCAAGGC GACGTTGACG CGGTTTGAAT TTGATTTTCG I860 

AAGAGTATTA ACTTCACACA AGGGAAGTTG GGAACTGAGA AATGTTATTT CTCAATAAGC 1920 

ACTATTCTTT CACACCACCG ATAGTCAAAC CTTTTACAAA GTAGCGTTGG AAAAATGGAT 1980 

ACAAAATCGC GATTGGAAGG GTTGCAACCA CAACCATGGC CATACGACCT GTTTCTTTCG 2040 

GTACAGCAAC TCCCAGTTGA CCAATCAAGC CGACCGCTTT GGCAATGTAG TCCATATTTT 2100 

GTTGGATTTG CATGACCAAA TATTGCAATG GATACAAGTT GTCACTCTTG ATGTAAAGAA 2160 

GGGCGTTGAA CCAGTCATTC CAGAAACCAA GAGCTGTTAA GAGCGTGATG GTTGCGATAC 2220 

CTGGTAGTGA CAATGGCAAA CAGATTTGGA AGAAAATCCG GGCCTCACTG GCACCATCGA 2280 

TACGAGCCGA TTCTAGAATG GCTTCTGGAA TGGTCTTCTT GAAGAAGGAA CGCATCAAGA 2340 

TGATGTTAAA TGGTGAGAGA AGCATTGGAA CAATCAAGGC CCAAACAGTG TCACCAAGCT 2400 

GAAGTACACG GGTCACCATG ATATAACCTG GTACCAAACC AGCGTTGAAC AACATACTGA 2460 

GAAGGACGAA GATGGTAAAG AATCTGCGAT ACTTAAAGGT TGTCCGTGAA ATAGCGTACG 2520 

CATAGGTTGT TGTGATAAAG ACATTTGTCA ATGTCCCAAC TACGGTTACA AAGACAGAGA 2580 

TGAAGAGGGC TTGTAGGATT TTATCCTTAA ACTGTGCCAA AAACTCAAAA CCGTCTAAGC 2640 

CAAATTGGGA TGGGAAGAAG CTATAGCCGT ATTGGAGGAG GCTTTTCTCG TCTGTCACTG 2700 

AAATAATGAT AACGAATACA AAAGGTAGGA TACAAGAGAG GGCAATCAAA CCCGAAATGA 2760 

TACTGAAGAA GATATCTGCT TTCTTACTGA AGGAGTGAAT GCCGACATTA TCAATTTTTT 2820 

CTTTTTTAAT TTTCTTTTTT GCCATATTCT CCTCCTTTCT AGAACAAAGC TGAGTTTGGA 2880 

TCGACTCGTC TTGCAAGCAA GTTTGATAGG ATAACCAGAA TCAAACCAAC AACGGATTGG 2940 

TAAAGACCGG CTGCTGCAGC CATACCGATA TCTGCTGTCT GAGTCAAACC ATTAAAGACA 3000 

TATACGTCCA AAACGTTGGT TACATTGTAA AGCTGACCAG CATTGTGTGG GATTTGATAG 3060 

AAGAGACCGA AGTCTGCGCG GAAGATATTT CCGACTGCAA GGATCGTCAA TACAGTTACA 3120 

AGCGGAGTCA ACTGAGGAAT GGTTACGTTG CGAATACGTT GCCACTTGCT AGCTCCGTCC 3180 

ACTGTCGCTG CTTCGTAGTA GGTTGGATCA ATTCCCATGA TCGTCGCATA GTACATGACA 3240 

CTGCTATATC CAAAGCCTTT CCAAATACCT AGGAAAAGTA GGAGATAGGG CCAGATGCCC 3300 
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AGGTCAGCGT 


AGAAATTGAC 


TTCTTTGAGA 


CCAAGACTTT 


CCAATAGATG 


ATTGAACACC 


3360 


CCTTTATCAA 


TATTTAGGAA 


GGCATCTGTA 


AAGAAACTGA 


TGATAACCCA 


AGACAAGAAG 


3420 


TAAGGGAACA 


ACATAGAAGT 


TTGAAAAATC 


TTCACCATTC 


TCTTAGAACG 


GAGCTCGCTG 


3480 


ACGATAATGG 


CAATCCCTAC 


AGATACAACT 


AAACCTAGAA 


AGATAAAGCC 


AAGATTGTAG 


3540 


AGGACAGTAT 


TTCOTGTGAT 


AATAAAGGCG 


TCTCTTGAAC 


TAAATAAGAA 


TCTAAAATTA 


3600 


TCGAGTCCGA 


CCCATTTACT 


ATTTATGATA 


CTATCTATGA 


AACCATTACT 


GGTCATGTGG 


3660 


TAGTCTTTGA 


AGGCAACCAC 


GTTCCCAAAT 


ACTGGAATGT 


AAAAGAATAG 


AATCAACCAG 


3720 


AGTGCCCCTG 


GCAAAACCAT 


CAAGAGAAAG 


ATCCAGTTGT 


CTCTCAATGT 


TTTTGAAAAC 


3780 


TTTTTCATAA 


TTTCCTCCCT 


TTTTATTTTG 


ATATCCATCT 


AAAAATTCTT 


TTTTAGACTT 


3840 


TTGATAACGA 


TT AC ATT ATT 


AGTATACTCC 


TATTTGCAGG 


TTAGGTTAAA 


CTCCTAATTA 


3900 


TAGAAAAAAC 


TCCACAAATT 


ATGTAGCAGA 


TTTAAAACTT 


TATCACCACT 


ATCAAACAAA 


3960 


TGTCCTAAAT 


CAATTGTTTA 


TTTTATCTCT 


ATTAGCCCAG 


TGATGGCGTC 


ACTCTGTTAT 


4020 


AAGCATCCAA 


CAACGGGGTA 


TACTGAAAAA 


TCTCCAGACT 


AGGGAACTCA 


GCGATAGTTC 


4080 


CTAATCTGGA 


GATTTTTAAT 


ATGTTATTAG 


GCGTTTGCTT 


TCAACTTAGC 


AATAACCTCT 


4140 


TTAAGATTAT 


CAATCAACTC 


TGCTGCAGTA 


TGCTCAGAGC 


CTTTTTCATC 


TGCCAAGAAC 


4200 


AAAACTGCTT 


TTTGAAGTTC 


TTTTTGAGAG 


TTTTCAAGGA 


CATCCTTATC 


TACTGTTTCA 


4260 


AGGTTTGAGT 


CTTTAAGAAG 


TTTACTTAAT 


TCCTTGGCTA 


ATTTCTTGAG 


TTTGATTTGC 


4320 


AGACTCATCT 


TCTCCTGCTG 


TTTCTTTGCC 


CGCTGTTTGT 


CCTCCATCCT 


TAGTTGCTGA 


4380 


CTGGCTTTCC 


TTAATGGACT 


CTAGGGAAGC 


AATGGCATCT 


TTGACTGTTT 


GCAAGATATC 


4440 


ACGTAAACCT 


TGCTCTGTCA 


AACTATCATC 


TGCAAAAGCT 


TTATTACCCT 


CTCCCAAAAC 


4500 


CAGACGTGCT 


GAATCTGTGG 


TAGGATTCGA 


TACACCTGTC 


AATGATCTCA 


AAAGATTTTC 


4560 


TAAGGTTTGA 


GTCTGCTTAC 


TAATACTAGA 


CTAAAATCAA 


AAAGTATTAT 


ATAACAGTGA 


4620 


TATGAAATCA 


ACTAAAGAAG 


AAATCCAAAC 


CATCAAAACA 


CTTTTAAAAG 


ACTCTCGTAC 


4680 


AGCTAAATAT 


CATAAACGCC 


TTCAAATCGT 


TCTATTTTGT 


CTGATGGGCA 


AATCTTATAA 


4740 


AGAGATTATA 


GAACTTTTAT 


AGTAGTTTGA 


AATAAGATGT 


GAACATCTCT 


ATCAGGAAAG 


4800 


TCAAATTAAT 


TTATAGAAAT 


ATTTTAGCAG 


CCAAGGTGTA 


CTGTTATAGA 


TTCAATACAC 


4860 


TATACTTGGT 


GGTTTAGCTC 


G 








4881 


(2) INFORMATION FOR SEQ 10 NO: 126: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH ; 13121 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


126: 






AGGATCCCCG 


GAAAAGGAGA 


CTAAAAATGA 


AGAAAAAATT 


TCTAGCATTT 


TTGCTAATTT 


60 


TATTCCCAAT 


TTTCTCATTA 


CGTATTGCCA 


AAGCAGAAAC 


GATTAAGATT 


GTTTCTGATA 


120 


CCGCCTATGC 


ACCTTTTGAG 


TTTAAAGATT 


CAGATCAAAC 


TTATAAAGGA 


ATTGATGTTG 


180 


ACATTATTAA 


CAAAGTCGCT 


GAGATTAAAG 


GCTGGAACAT 


TCAGATGTCC 


TATCCTGGAT 


240 


TTGACGCAGC 


AGTCAATGCG 


GTTCAAGCTG 


GGCAAGCCGA 


CGCTATCATG 


GCAGGGATGA 


300 


CAAAGACTAA 


AGAACGTGAA 


AAAGTCTTCA 


CCATGTCTGA 


TACTTACTAT 


GATACAAAAG 


360 


TTGTCATTGC 


TACTACAAAG 


TCACACAAAA 


TTAGCAAGTA 


CGACCAATTA 


ACTGGCAAAA 


420 


CCGTTGGTGT 


TAAAAACGGA 


ACTGCCGCTC 


AACGTTTCCT 


TGAAACAATC 


AAAGATAAAT 


480 


ACGGCTTTAC 


TATTAAAACA 


TTTGACACTG 


GTGATTTAAT 


GAACAACAGC 


TTGAGTGCTG 


540 


GTGCCATCGA 


TGCCATGATG 


GATGACAAAC 


CTGTTATCGA 


ATATGCCATT 


AACCAAGGTC 


600 


AAGACCTCCA 


TATTGAAATG 


GATGGTGAAG 


CTGTAGGAAG 


TTTTGCTTTC 


GGTGTGAAAA 


660 


AAGGAAGTAA 


ATACGAGCAC 


CTGGTTACTG 


AATTTAACCA 


AGCCTTGTCT 


GAAATGAAAA 


720 


AAGATGGTAG 


TCTTGATAAA 


ATTATCAAGA 


AATGGACTGC 


TTCATCATCT 


TCAGCAGTGC 


780 


CAACTACAAC 


TACTCTCGCA 


GGATTAAAAG 


CTATTCCTGT 


TAAGCCTAAA 


TATATCATTG 


840 


CCAGCGATTC 


TTCTTTTGCC 


CCTTTTGTTT 


TCCAAAATTC 


AAGCAACCAA 


TACACTGGTA 


900 


TTGATATGGA 


ATTGATTAAG 


GCAATCGCTA 


AAGACCAAGG 


TTTTGAAATT 


GAAATCACCA 


960 


ACCCTGGTTT 


TGATGCTGCT 


ATCAGTGCTG 


TCCAAGCTGG 


TCAACCCGAT 


GGTATCATCG 


1020 


CTGGTATGTC 


TGTCACAGAT 


GCTCGTAAGG 


CAACTTTTGA 


CTTCTCAGAA 


TCATACTACA 


1080 


CTGCTAATAC 


CATTCTTGGT 


GTCAAAGAAT 


CAAGCAATAT 


TGCTTCTTAT 


GAAGATCTAA 


1140 


AAGGAAAGAC 


AGTCGGTGTT 


AAAAACGGAA 


CTGCTTCTCA 


AACCTTCCTA 


ACAGAAAATC 


1200 


AAAGCAAATA CGGCTACAAA 

} 


ATCAAAACCT 


TTGCTGATGG 


TTCTTCAATG 


TATGACAGTT 


1260 


TAAACACTGG 


TGCCATTGAT 


GCCGTTATGG 


ATGATGAACC 


TGTTCTCAAA 


TATTCTATCA 


1320 


GCCAAGGTCA 


AAAATTGAAA 


ACTCCAATCT 


CTGGAACTCC 


AATCGGTCAA 


ACAGCCTTTG 


1380 


CCGTTAAAAA 


AGGAGCAAAT 


CCAGAACTGA 


TTGAAATGTT 


CAACAACGGA 


CTTGCAAACC 


1440 


TTAAAGCAAA 


CGGTGAATTC 


CAAAAGATTC 


TTGACAAATA 


CCTACCTAGC 


GAATCTTCAA 


1500 


CTGCTTCAAC 


AAGTACTGTT 


GACGAAACAA 


CGCTCTGGGG 


CTTGCTTCAA 


AACAACTACA 


1560 
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AACAACTCCT TAGCGGTCTT GGTATCACTC TTGCTCTAGC TCTTATCTCA TTTGCTATTG 1620 

CCATTGTCAT CGGAATTATC TTCGGTATGT TTAGCGTTAG CCCATACAAA TCTCTTCGCG 1680 

TCATCTCTGA GATTTTCGTT GACGTTATTC GTGGTATTCC ATTGATGATT CTTGCAGCCT 1740 

TCATCTTCTG GGGAATTCCA AACTTCATCG AGTCTATCAC AGGCCAACAA AGCCCAATTA 1800 

ACGACTTTGT AGCTGGAACC ATTGCCCTCT CACTCAATGC GGCTGCTTAT ATCGCTGAAA 1860 

TCGTTCGTGG TGGTATTCAG GCCGTTCCAG TTGGCCAAAT GGAAGCCACC CGAAGCTTGG 1920 

GTATCTCTTA TGGAAAAACC ATGCGTAAGA TTATCTTGCC ACAAGCAACT AAATTGATGT 1980 

TGCCAAACTT TGTCAACCAA TTCGTTATCG CTCTTAAAGA TACAACTATC GTATCTGCTA 2040 

TCGGTTTGGT TGAACTCTTC CAAACTGGTA AGATTATCAT TGCTCGTAAC TACCAAAGTT 2100 

TCAAGATGTA TGCAATCCTT GCTATCTTCT ATCTTGTAAT TATCACACTT TTGACTAGAC 2160 

TAGCGAAACC CTTAGAAAAG AGGATTCGTT AATGGCAAAA TTAAAAATTG ATGTAAATGA 2220 

TTTACACAAG CACTATGGAA AAAATGAACT CCTAAAAGGA ATTACGACTA ACTTCTATGA 2280 

AGGAGATGTT GTTTGTATCA TCGGTCCTTC AGGTTCTGGT AAGTCAACTT TCCTCCGTAG 2340 

CCTCAATCTT TTAGAAGAAG TCACTAGCGG TCACATCACT GTGAACGGCT ATGATTTAAC 2400 

TGAAAAAACA ACCAATGTTG ACCACGTCCG TGAAAATATC GGCATGGTAT TCCAACACTT 2460 

CAACCTCTTC CCTCATATGT CTGTATTGGA CAACATCACC TTTGCTCCTA TTGAGCACAA 2520 

GTTGATGACT AAGGAAGAAG CTGAGGAATT GGGAATGGAG TTGCTTGAAA AGGTTGGACT 2580 

AGCAGATAAA GCTAATGCCA ATCCAGATAG CCTATCAGGT GGTCAAAAAC AACGTGTGGC 2640 

CATCGCTCGT GGCCTAGCAA TGAATCCAGA CATCATGCTC TTCGATGAAC CAACTTCTCC 2700 

CCTTGACCCT GAGATGGTTG GAGACGTACT TAACGTTATG AAGGAATTGG CTGAGCAAGG 2760 

CATGACCATG ATTATCGTAA CCCATGAGAT GGGATTTGCT CGTCAGCTTG CCAACCGCGT 2820 

TATCTTTACT GCAGATGGCG AGTTCCTTGA AGACGGAACA CCTGACCAAA TCTTTGATAA 2880 

CCCACAACAC CCTCGTCTGA AAGAGTTCTT AGATAAGGTC TTAAACGTCT AAACTCAAAC 2940 

TGTAAGGATT TCCTTGCAGT TTTTCTACCT CGTATTGGAA TTTTTGATTT TTCGGAAAAT 3000 

TATGTTAGAA TTAAGTTTAT GAAATGAGGT TTCCTCATAC CTAGCAAGAC TAGGAATAAA 3060 

AATAGAAATT AGGTAGCTAG ATGTCATCTA AGGTTATTGT TACAATTTTC GGTGCGAGTG 3120 

GAGACCTGGC TAAACGCAAG CTCTACCCTT CCCTTTTTAG ACTATATCAA TCCGGCAATC 3180 

TTTCCAAGCA CTTTGCCGTT ATTGGAACTG CCCGTAGACC TTGGAGTAAG GAATATTTTG 3240 

AATCTGTAGT TGTCGAGTCC ATCCTTGATT TGGCAGATAG TACCGAGCAA GCCCAAGAAT 3300 

TTGCTAGCCA CTTCTACTAT CAAAGCCATG ATGTCAATGA TTCGGAACAT TATATTGCTT 3360 
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TCCGTCAATT ACAAGCTGAG CTTAATGAAA AATACCAAGC TGAACACAAT AAGCTCTTCT 3420 
TCTTGTCTAT GGCACCTCAG TTCTTTGGAA CCATTGCCAA ACACCTCAAA TCTGAAAACA 3480 
TTGTCGATGG CAAAGGTTTT GAGCGCTTGA TCGTTGAAAA ACCATTTGGT ACAGATTACG 3540 
CAACTGCAAG CAAGTTGAAT GACGAACTCC TAGCAACATT TGACCAAGAA CAAATTTTCC 3600 

GTATCGACCA TTATCTTGGT AAGGAAATGA TCCAAAGCAT CTTTGCAGTT CGCTTTGCAA 3660 

ACTTGATTTT TGAAAACGTT TGGAACAAGG ATTTTATCGA CAATGTTCAA ATTACCTTTG 3720 

CGGAGCGCTT GGGTGTAGAA GAACGTGGTG GCTACTATGA CCAATCCGGT GCCCTCCGTG 3780 

ACATGGTCCA AAACCACACT CTACAACTTC TTTCGCTCCT CGCCATGGAC AAACCAGCAA 3840 

GCTTCACAAA AGACGAGATT CGTGCTGAAA AGATTAAGGT CTTTAAAAAC CTCTATCATC 3900 

CAACTGATGA AGAACTCAAA GAACACTTTA TCCGTCGGCA ATACCGCTCT GGTAAGATTG 3960 

ATGGCATGAA ATACATCTCT TATCGTAGCG ACCCAAATGT GAATCCAGAA TCAACAACTG 4020 

AAACCTTTAC ATCTGGTGCC TTCTTTGTAG ACAGCGATCG ATTCCGTGGT GTTCCTTTCT 4080 

TTTTCCGTAC AGGTAAACGA CTGACTGAAA AAGGAACTCA TGTCAACATC GTCTTTAAAC 4140 

AAATGGATTC TATCTTTGGA GAACCACTTG CTCCAAATAT TTTGACCATC TATATTCAAC 4200 

CAACAGAAGG CTTCTCTCTT AGCCTAAATG GCAAGCAAGT AGGAGAAGAA TTTAACTTGG 4260 

CTCCTAACTC ACTTGATTAC CGTACAGATG CGACTGCAAC TGGTGCTTCT CCAGAACCAT 4 320 

ACGAAAAATT GATTTATGAT GTCCTAAATA ACAACTCAAC TAACTTTAGC CACTGGGATG 4380 

AAGTTTGTGC GTCATGGAAG TTGATTGACC GTATTCAAAA GCTCTGGGCT GAAAATGGTG 4440 

CCCCACTTCA TGACTATAAA GCTGGAAGCA TGGGACCTCA AGCCAGCTTT GACCTACTTG 4500 

AAAAATTCGG TGCCAAATGG ACTTGCCAAC CAGATATCAC CTATCGTCAA GATGGTCGCT 4560 

TAGAATAAAA AAATTTCCTG CAAGTTTATG CcTTGCAGGA TTTTTGCTTC TGATTACATT 4620 

AAACCTTCCA AGAGACCTTT CATAAAGTTT TCTGAGTTAA ACTCTCCAAT ATCATCGATT 4680 

TTTTCACCAA AACCAATCAA TTTTACAGGA ATATTGAGTT CTTCACGAAT GGCTAGAACC 4740 

ACACCTCCTC GAGCAGTTCC ATCAATCTTA GTCAAAACAA TTCCCGTTAA AGGTGTGATT 4800 

TTCGAAAATT CTTTGGCCTG TACTAGGGCA TTTTGACCTG TTGATGCATC AAGTGCCAAG 4860 

AAGGTTTCAT GTGGTGCTTC TGGCACAACA CGTTTGATAA TACGACCAAT CTTTTCCAAC 4920 

TCAGCCATAA GGTTATCCTT ATTTTGCAGA CGACCAGCAG TATCAATCAT GAGAATATCG 4980 

ATACCTTCAG TCACGGCACG TTCCATACCA TCAAAGACCA CGCTGGCTGG ATCAGCTTTT 5040 

TCAGGTCCAG TTACTACTGG AACATCTACT CGTCGGCCCC ATTCAGCTAG CTGAGCTACT 5100 
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GCACCCGCAC GGAAGGTATC TGCTGCAACC AGCATGACCT TCTTACCAGC TTGTTTGTAG 5160 
CGGTGGGCTA GTTTTCCGAT AGAAGTTGTT TTCCCAACAC CATTCACACC AACAAAGAGC 5220 

ATAACTGTCA AGTTATCTTG GAAGTGGATG CTTTCATCGT AGCTACCATC CTTTTCATAA 5280 

AGCTCAACCA ATTTCTCAAT GATGACACGA CGAAGTACAT CAGGTTTCTT GGCATTTTCA 5340 

AGCTTGGCTT CGTAACGTAG TTCCTCCGTT AAGTTAGAAG CG7CCTTGGAC ACCAACATCA 5400 

CTCATAATCA GCAGTTCTTC CAGTTCCTCG AAAAATTCTT CGTCAACAGA GCGGAAGTTA 54 60 

GCAAAGAAGG CATTCAAGCG GGCACCGAAA CCTGTGCGAG TTTTCTTAAG ACTGCGGTCA 5520 

TATTTTTCCT GAACAGTTTC TTCTGTTTGA GGAGCTTCTG GTTCAAGCAC TTCAGAATTA 5580 

TTTTCTTCTA CAGTTCCTTC GTGCTCAAGC TTCTCTTCCT CTGGTAATTC TTCTGAGTTT 5640 

GGTAATTCTT CTATTTCTTC TTGAGAAACC CCTACAGCTG GCTCTGAATC CTGACTTTCT 5700 

TCAACTGTGT CTTGGATTTC CTCTTCTTGG AACACAGCTT GTTCAACAAT TTCAACCTCT 5760 

GCTTCTTCCT GAGAAACTTC CTCAACTTCT GTGAAGGTAG GATCAACATC TTCAGACAAA 5820 

TCAAGATTTT CCAGAGCTTC TTTTACAACT TCTTCGATTT TAGGTTCTTC TTTTTTTCCG 5880 

AATAGACGGT CAAACAATCC CATATCTTAG TTCTCCTTTA CCACATATTC TTCGATAGCC 5940 

CAGGCGACAG CTTCCTCATC GTTGGTCATC GGCGTCACTA CATTTGCGGC TGCCTTTACT 6000 

TCAGGAACAG CGTTTTGCAT AGCAACACCA AGACCTGCCC ATTCAATCAT AGAGAGGTCA 6060 

TTGGCCTCGT CACCACAAGC CATCACTTGA CTTTGGTCGA TTCCAAGATG GCTGATTAGT 6120 

TTTCCCAAAC CTGTTGCTTT ATGAACATTC TTTGGTGACC ATTCTAGCAA CATTTCACGT 6180 

GATTTAAAGA TTTCATATTG GTCAAACAAT TCTGGAGAAA TCTTCTGAAT GCCTGCATCC 6240 

AAGGGTTCTT GAGCAAAGGC AGTCACGCAT TTGTTGTAGG TCATTTGACT AGATAAGTCT 6300 

TCAAAGTCCA CTGGAACAAA GGTCAAAGCT GGATTGAATT TGGCATAAAG ACTTTCTTGG 6360 

TCCGATTGGA TTTGATAAAC TGTTCCTTCT GAGATGGCAT CAAGAGGCAG TGATAATTTC 6420 

TCTGTTTCTT CATACAAACG TGCCACATCA TCATATGAAA AGACTGTTTT ATCAAGGATT 6480 

TCTCCTGTAT TTTTCTGAAC TAATCCACCA TTAAAAGTAA TGGTATACTC ATCTTCCTCA 6540 

CCGTCAGTCC CTAACTCATG GAGAAAGAAA TCCATGGCTT TTAAGGGACG ACCAGTTGTC 6600 

AATACGACCT TGATACCACG ATCACGCGCA gCTTGCAAGG TTTCCTTGGT ACGATCCGTC 6660 

AGCCTTTTAT CAGTAGTCAG CAAGGTCCCG TCCAAGTCCA ATGCAATCAA TTTTATATCT 6720 

GCCATTATAA GCCCTCCATA TAAGCTATAA CCGACCGTTC CTTATGGTGA CCAATCACAG 6780 

TCTTTGCTAA TTCTAAAATT TCAGGTCGTG CATTTTCAGG AGCTACAGGA TGTCCCACAA 6840 

CCTGCATCAT ATGTAAGTCA TTAAGATTGT CTCCAAAAGC CATGACCTGA TCCATTGTGA 6900 
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TACCAAGTTT 


TTTAACTAAT 


TCAACAATGG 


CCACTCCCTT 


ATCCACATAG 


TCCAGAACAA 


6960 


TATCAATGGA 


TTCAAAGCCA 


GTTGTCATGG 


CCTTAACACC 


AGGAACGTTT 


TCGTTTACCC 


7020 


AAGCCTCCCC 


ATCTTCCAGC 


GTTTCTTCTG 


TGAAGTTGGT 


TGTAAATTTG 


AAAATGTCAT 


7080 


CTGTGATATC 


TTCCAAACTC 


GCTACTTTTT 


GGATATTTTC 


ATTATAGTGC 


TGACTCACTT 


7140 


TCAAATAGGT 


CTCATCAACC 


GTATCTAGAA 


CATATGAACC 


CTTCTTACCC 


GTCAAGAGCA 


7200 


GTTTATTGAT 


ATCTACATAA 


GGTGAAGTTT 


TCAGCTTTTC 


AAAAGTTGCC 


AGATAAAAGT 


7260 


CACCAGACAT 


AGTCGCTTCA 


TACAAGTCCT 


GACCTTGATA 


CTCTACCAAA 


CTGCCATTTT 


7320 


CCGCGATGAA 


AATAATGTCA 


TCACGAACAC 


CAGCAAATAA 


TTTTTCTAGA 


GACAGAAATC 


7380 


CCCGACCCGA 


AGCTACCGCA 


AAGTAAATCC 


CTTTTTCCTT 


GTAGGAAACC 


AAGAGAGACT 


7440 


TGAGACGATC 


CATATCAAAG 


CGTCCATTCC 


CATCTAGGAA 


GGTTCCGTCC 


ATATCCGTTG 


7500 


CTACTAGTTT 


AATTGTCATC 


CTTCAATACT 


TTCTAAATCT 


TTTAACTTAA 


CTGAAACAAT 


7560 


CTTTGAAACA 


CCCCATTCTT 


GCATGGTCAC 


TCCATAGATG 


GAATCAGCCG 


CTGCCATGCT 


7620 


TCCCTTACGG 


TGGGTTACCA 


CGATGAACTG 


GCTGTCCTTG 


TCAAAGCGGT 


TGAGGTAATC 


7680 


CCCAAAACGT 


TTAACATTGG 


CTTCATCCAG 


CGCAGCTTCC 


ACCTCATCCA 


AGATAACAAA 


7740 


TGGAATAGTC 


TTGACACGAA 


TAATGGAGAA 


GAGCAAGGCA 


AGAGCCGATA 


GGGCTTTTTC 


7800 


ACCACCACTC 


ATGAGATTAA 


GAGACTGGAT 


TTTCTTGCCT 


GGTGGTTGGA 


CAGAAATTTC 


7860 


AACCCCAGCT 


GTCAGCAAGT 


CTCCTTCAGT 


CAAAATGAGG 


TCAGCCTGAC 


CTCCACCAAA 


7920 


CATCTGCTTG 


AAGGTCACTT 


TAAAGGACTC 


ACGAATGACC 


TCAAAGGTTG 


ATTTAAAGCG 


7900 


TTCCTTGACC 


TCATCATTCA 


TCTCTGTAAT 


GGTCTCAAGG 


AGCAGGTTTT 


TCGCAGACAA 


804 0 


AATATCATCA 


CGTTGGCTAT 


TTAGGAAATC 


CAGACGGTTG 


TGAACTTCTT 


CGTACTGTTC 


8100 


AATAGCGTCT 


AAATTGACAG 


GACCCAGTGA 


GCGTATACCC 


TTCTCTAAAT 


CCTTAACTTC 


8160 


TTGCTCTCCC 


AGATTGAGAT 


TTTCCAACTC 


ATGCGCCTTT 


TCTAAAGCTT 


CTGTGTAGCT 


8220 


GATCTGGTAC 


TGGTCTGTTA 


ATTGACTTTG 


TAGATGGCGC 


AAGCGCTCGC 


TAACCTTTTC 


8280 


TTTCTTGGCT 


TCAGCACGAG 


TTTGCTTGCG 


AATCCACTCT 


TCATTCTGCT 


GGCGAGCCTG 


8340 


ATCCAAATGA 


CTAGCAATAT 


CATCCAGTTG 


ACCCTCAATA 


TCATCCAACT 


CAAACTGCTT 


6400 


GCGAATCAAA 


CCTTGTTGGA 


GATTTGTTTT 


TTGAGTTTTG 


GATTCTTCCG 


CCTGTTGACT 


8460 


GAGCAATTCT 


GTATCAACCT 


TCTCAAGATT 


ATCAATCTTT 


TCTTGAAGAA 


GGCGCTGGAT 


8520 


TTCCTCTTGT 


TCAAAATCAA 


GATTGTCCAA 


TTCCTTGCCT 


AAGCGTTCAA 


TATCAGCAAC 


8580 


TTCATAACGT 


TTTTGCCCTT 


GCAGTTCTGT 


CTTAAGCAAA 


CGAGCTTGCG 


CTAGCTCTTC 


8640 
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CTGCAAGTTT 


TGATAGCGTT 


CTTGGATGGC 




GACTTAATCT 


CTTCAATCTC 


8700 


AGCTTCCAGA 


TTTTGCTTGT 


CACTGGAGAT 


TGCAGCAAGA 


CGCTCTTGGC 


AGTTTTCCTT 


8760 


ATCCGCTTGC 


CAATCTCCCT 


CGGAAAGACG 


ATCTATTTCC 


TCTTCTTGGA 


GTTTCCAAAG 


8820 


AGTTTCCAGT 


TCTTCAACTT 


GCTGACTAGT 


TTGCTGATAA 


GCGAGGAACA 


AGCCTTGCTC 


8880 


CTGAATACGT 


GCCTGCTCTC 


CTTGAGATTT 


AATAGCTTCT 


AATGACTCGG 


TCAATCTGGC 


8940 


CATCTCATCT 


TGCAAGGTCT 


TCAAAGTCGC 


CTCTTCTGAA 


CCCAAGCTTG 


CTTCTTCTTC 


9000 


AGCAATTTCT 


TTTTGTAATT 


GCTCCAGTTC 


TGGCTTGATA 


AAAATGCTGT 


TATTCTGGCG 


9060 


ATTGGCACCA 


CCTGCATAAG 


AACCACCTGT 


GCGCAACTCT 


GTCCCATCCA 


ATGTCACCAT 


9120 


ACGAACCTGA 


TAACGAACTT 


GGCGAGCTGC 


TGCACGCGCA 


TGTTCTACGG 


TATCAAAGAT 


91 80 

710V 


AGCCGTCGTA 


GCTAGCAAGT 


TCTTGAAAAT 


GGCTTCCAGT 


CTAGTATCAA 


AAGTCACCAA 


9240 


CTCATCTGCC 


ATCCCAAGGA 


AACCTGGGCT 


TACAGCGATA 


GCATCTTGGT 


TCTGACTAGA 


9100 


AATCGTACGC 


GCCTTGATAG 


TGGTCAAAGG 


AAGAAAGGTT 


GCACGACCGG 


i X* A \J * * WWW 


7 J w U 


TTTAAGGAAG 


TCAATAGCCT 


TGGTTGCCGA 


CTCTTCATCT 


TCTACGATGA 


TATGCTGGCT 

* *» * WW 4 WWW 4 


9420 

J m \t 


ACTTGCCCCT 


AAGGCAATCT 


CTAGGGCAGT 


TTGATAATAA 


ACATCAAACG 


TCAGATGCTC 


Q4 AO 


ACTGACTCCA 


CCAATAATCC 


CACCTAGGCG 


ATCTTTTTCT 


TGGAGAACAC 


TCTTAACACC 


9S40 


TGCATAAAAG 


TTACTATGAT 


TTCTCAGGAT 


ATTTTCCAAA 


CTTTGAGCTC 


TGGCCTGCTT 


9600 


GTTTTTGAGA 


TTATCCAGAC 


GCTCAAAGAG 


TTGGCTTTGT 


TGAGCTTGAT 


AGGAAGTTTT 


9660 


CTGCTCCTCT 


TGCTCCTTGG 


CAATAGCTTG 


GTAGTCAGCC 




GAACCTGCTC 

*^*W»V^ W A WW 4 W 


9720 


CTTGGCAGTT 


TCAAGCTCTT 


CCTTTTGCTG 


ACTAGCCTTC 


TCTTTAGCTA 


TAGCTAATTG 


9780 


CTCTTTCAGC 


TTTTCTAGTT 


GATCTGCTTG 


TTTTTGAGAA 


AGCTGACGAC 


TATTTTCCAA 


9ftd^ 


CTCATTCTCA 


ATACGGGTCA 


ACTGGTTTGA 


GACATCCGCT 


TCTTCTTGTA 


AAAGAGCTAC 


9900 


AAAGCGTTCA 


CGTAAGAGCT 


CAATCATCTG 


ATCAGGATCG 


TCTGAGAAAG 


CCAGCAATTC 


9960 


AGCTTCTAAA 


CGATTGAGTT 


TTTGATTATT 


TTGGACTAGA 


TTTCCCTCTA 


ACAGAGCTAA 


10020 


AGAGCTTTCT 


TTATCAGACT 


TTTCTTTGCT 


GAGTGAATTT 


CTCTTATCCT 


CCAAAGCAGC 


10080 


CAAACGGGCT 


TGTGCCTCCT 


GTTGATTCAA 


GGCCACTTGC 


TCGGACTCCA 


GTTTCGATAG 


10140 


GGCTAATTTT 


CTTTCTAAAT 


CACTAATCAG 


ACTAGTCAAG 


TCCATCAAAC 


TGCCTTGGTC 


10200 


TTTGGCCATT 


TCAGCCTGTA 


AATCTTGGCG 


TTGCTTTTTA 


AGAGTTTGAT 


TTTCTTCTTC 


10260 


TAATTTTTCA 


CGCTTTTGGT 


AATAACTCAT 


CAAGAGTTCT 


TGAACCTGAG 


TCAACTCTTC 


10320 


TTCTGTCGAC 


TCTAGTTCAG 


CCTTATTTTC 


CTTGATTTGA 


GCAACCAGAA 


CATCTAAATA 


10380 


AATAGCCTTA 


CGTTGTCCTT 


CCAAGTCTAA 


AAACTTACGG 


GCATTCTCAG 


CTTGCTTCTC 


10440 
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AAGAGGCTTG 


ATTTGATTAT 


CCAACTCGTA 


GATAATGTCC 


TCTAAGCGCT 


CCAGATTATC 


10500 


CTGAGTTTGC 


TGCAGTTTAC 


TCTCGGTTTC 


TTTTCTGCGA 


GTCTTGTATT 


TTAAAACTCC 


10560 


AGCAGCTTCT 


TCAAAAATAG 


CTCGTCGTTC 


CTCAGGCTTG 


GAATTAAAAA 


TCTCCTCAAC 


10620 


CTTCCCTTGG 


GAAATAATAG 


AGAAGGAATC 


TCGTCCCAAT 


CCAGTATCCA 


AGAAGAGGTC 


10680 


ATGAATATCA 


CGCAGACGGA 


CTTTCTTGCC 


GTCAATCTTG 


TATTCGCTAT 


CTCCACTACG 


10740 


ATAGACATGG 


CGTTCCACCC 


TGATTTCTTG 


ACCTGCATCC 


TTGATAAATC 


CGTCATGATT 


10800 


ATCCAGAGTC 


ACAACTACAG 


AAGCATAATT 


GAGCGGTTTG 


CGACTTTCGG 


TTCCAGCAAA 


10860 


GATGATATCC 


GGCATCTTGC 


CCCCACGGAG 


ACTCTTGACA 


CTAGACTCCC 


CCAAAGCCCA 


10920 


ACGCAGACTT 


TCTGTAATAT 


TGGACTTTCC 


AGATCCATTG 


GGTCCAACAA 


CTGCCGTCAC 


10980 


ACCTTGGTCA 


AAAACGACCT 


TGGTCTTATC 


AGCAAAAGAC 


TTGAACCCCT 


GAATTTCGAT 


11040 


TTCCTTTAAA 


TACATGAATC 


CAGCCCCTTC 


TCAACGCCAT 


TTTTGGCAGC 


TTCCTGCTCT 


11100 


GCTAATTTCT 


TAGAACGACC 


TTGGCCTTGA 


CCGATGCTCT 


TACCTTCAAC 


AAGAACTTCT 


11160 


ACATCAAAAA 


CCTTATCGTG 


AGCAGGCCCT 


GTTTCAGAAA 


TCACCTGATA 


ACGAATACCC 


11220 


ACATCACCAT 


TGACCTGAAG 


CAACTCTTGG 


AGATGGGTTT 


TATAGTCTGT 


AATCATCTCA 


11280 


AACTCGCCTG 


CTTCAACCTT 


AGGAATCATG 


ACTTGATAGA 


TAAATTCCTT 


GACCTTGGCC 


11340 


ACATCCTTAT 


CCAAAAGAAG 


GGCACCAAGA 


AAGGCTTCAA 


AGGCATCACC 


AAGAATGGTG 


11400 


TCACGATTGC 


GACCACCTGA 


TTTTTCTTCC 


CCTTTACCCA 


ACTTGATAAA 


CTGGTCAAAC 


11460 


TGGCAATCAC 


GCGCAAAACC 


AGCTAAACTC 


TCCTCACGGA 


CAATCATAGC 


ACGGAGTTTT 


11520 


GATAGGTCAC 


CTTCAGGCTT 


TTTAGGATAT 


TTTTTATATA 


GATATTCTGA 


AATCAATAAC 


11580 


TGTAGAACAG 


CGTCTCCTAA 


AAATTCCAAG 


CGTTCATTCT 


GTGAAATTTT 


TAAGAGGCGG 


11640 


TGCTCATTGG 


CATAACTCGT 


ATGAGTAAAG 


GCAGTTTCCA 


GTAACTTTTT 


GTCTGCAAAT 


11700 


TCGATTGCAA 


AATGATTCTT 


TAGTACAGTT 


TGTAATTCTT 


TCATACCAAC 


CTCTTTCTAA 


11760 


CTGATAATAG 


TCCTTTTTAT 


TATATCAAAA 


AAAGCCCCCT 


GAGTCACTCT 


AAAACGGGAC 


11820 


TGGAAAGCAT 


TTGGGAATTC 


TTTAGACAGA 


GATTCTCAGT 


TTTAGCGGCA 


AATTTGGGTC 


11880 


AGGATAAAGA 


AAAAAGCCCT 


ATTAAAGGCT 


TTTTAGGATG 


TTTACATCCA 


CCCTGAGGGA 


11940 


ATCGAACCCC 


CATCTCAAGA 


ACCGGAATCT 


TACGTGATAT 


CCATTACACT 


AAGGGTGGAA 


12000 


ACTTGTTTTA 


TTATAACAGA 


AATTTGCTCT 


AATAACAAGT 


TTTTTGGTCA 


AAGACCCCGT 


12060 


CTTAGTGGGA 


AGCATCCCCA 


TTCCAGATGG 


AGTTTTTCAC 


GATCACATAA 


TCAACGTGTT 


12120 


TAAGGTCAGC 


AACCTGACGT 


CCACCTGCAT 


AAGAAATAGC 


ACTTTGAAGG 


TCTTGTTCCA 


12180 
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TCTCAGTTAA AGTGTCTTGC AGATGACCTT TAGCAGGAAG CAAGATACGT TTGCCTTCCA 
CATTTTTGTA AGCACCTTTT TGATATTGTG AGGCTGAACC ATAATATTCT TTGAACTGTT 
CACCATCGAC TTCAATCGTT TTCCCTGGAC TTTCAATGTG TCCTGCAAAG AGGGAACCAA 
TCATGATCAT GCTAGCACCG AAGCGGATAG ACTTAGCAAT ATCACCGTGA GTACGAATTC 
CTCCATCAGC GATAATCGGT TTACGCGCAG CCTTGGCACA CCAGCGTAGA GCAGCCAACT 
GCCAACCACC TGTACCAAAA CCAGTCTTAA CCTTGGTGAT ACAAACCTTA CCAGGACCGA 
TTCCGACCTT AGTAGCATCC GCACCAGCAT TTTCCAATTC ACGCACAGCT TCTGGTGTTC 
CCACATTTCC AGCAATGACA AAGGTATCTG GCAATTCTTT CTTGATGTGT TGAATCATAG 
AAATCACGCT ATCCGCATGA CCATGAGCAA TATCAATAGT GATATACTCA GGAGTATCAG 
CCTTGAGCTG GCTAACAAAA TCATACTCAT AATCCTTAAC ACCGACAGAG ATAGAAGCAA 
TGAGCCCTTG ATTGTGCATT CGTTTAATAA AAGGAATGCG TCCTGCCTCA TCAAAACGGT 
GCATAATGTA GAAGTAACCA CCTTTAGCCA GTTGCTCTGC TACATTTTCA TCCAAAATCG 
TCTGCATATT CGCTGGCACA ACAGGTAGTT TAAAGGTGTG ATTTCCTAAA CTGACACTTG 
TATCCGCTTC TGCACGGCTT TTAATGACAC ATTTATTTGG AATCAATTGA ATATCTTCGT 
AATCAAAAAT TGGAAATTCA TTTAACATAT CGATGTCTCG TTTCTTTTGT AATGACCTAC 
CTATGCTCTT GCATCACTAC GCCTTTTCCG ACGTTTCCTG G 
(2) INFORMATION FOR SEQ ID NO: 127: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



12240 

12300 

12360 

12420 

12460 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13121 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
CCGAATGCAA TGTTTACGGT TGAACTTGAA AATGGACATC AGATTTTAGC AACAGTTTCT 
GGTAAAATTC GTAAAAACTA TATTCGTATT TTACCGGGAG ATCGTCTTAC TGTCGAAATG 
AGTCCATATG ACTTGACACG TGGACGTATC ACTTACCGCT TTAAATAATC GAAAAACTTG 
GAGGGATAAG AAATGAAAGT AAGACCATCG GTCAAACCAA TTTGCGAATA CTGTAAAGTT 
ATTCGTCGTA ATGGTCGTGT TATGGTAATT TGCCCAGCAA ATCCAAAACA CAAACAACCT 
CAAGGATAAG ATAGAAAGGA GAAAACATGG CTCGTATTGC TGGAGTTGAT ATTCCAAATG 
ACAAACGCGT AGTAATCTCA TTGACTTATG TTTATGGTAT CGGACTTGCA ACATCTAAGA 
AAATTTTGGC TGCTGCTGGA ATCTCAGAAG ATGTTCGTGT ACGTGATCTT ACATCAGATC 



60 
120 
180 
240 
300 
360 
420 
480 
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AAGAAGATGC TATCCGTCCT GAAGTGGATG CAATCAAAGT TGAAGGTGAC CTTCGTCGTG 540 

AAGTAAACTT GAACATCAAA CGTTTGATGG AAATCGGTTC ATACCCTCCT ATCCGTCACC 600 

GTCGTGGACT TCCTGTCCGT GGACAAAACA CTAAAAACAA CGCCCGCACT CGTAAAGGTA 660 

AAGCTGTTGC GATTGCTGGT AAGAAAAAAT AATATAGGAG GTAAAAGTCT TGGCTAAACC 720 

AACACGTAAA CGTCGTGTGA AAAAGAATAT CGAATCTGGT ATTGCTCATA TTCACGCTAC 780 

ATTTAATAAC ACTATTGTTA TGATTACTGA TGTGCATGGT AATGCAATTG CTTGGTCATC 840 

AGCTGGTGCT CTTGGTTTCA AAGGTTCTCG TAAATCTACA CCATTCGCTG CTCAAATGGC 900 

TTCTGAAGCT GCTGCTAAAT CTGCACAAGA ACACGGTCTT AAATCAGTTG AAGTTACTGT 960 

AAAAGGTCCA GGTTCTGGTC GTGAGTCAGC TATTCGTGCG CTTGCTGCCG CTGGTCTTGA 1020 

AGTAACAGCA ATTCGTGATG TGACTCCAGT GCCACACAAT GGTGCTCGTC CTCCAAAACG 1080 

TCGCCGTGTA TAATCATCGC ATTACACTGC TTTTCGTTTA AGAGGGAGTA ACTAAATGAT 1140 

CGAGTTTGAA AAACCAAATA TAACAAAAAT TGATGAAAAT AAAGATTATG GCAAGTTTGT 1200 

AATCGAACCA CTTGAACGTG GCTACGGTAC AACTCTTGGT AACTCTCTTC GTCGTGTACT 1260 

TCTAGCTTCT CTACCAGGAG CAGCTGTGAC ATCTATCAAC ATTGATGGTG TGTTACATGA 1320 

GTTTGACACA GTTCCAGGTG TTCGTGAAGA CGTGATCCAA ATCATTCTGA ACATTAAAGG 1380 

AATTGCAGTG AAATCGTACG TTGAAGACGA AAAAATCATC GAACTGGATG TTGAAGGTCC 1440 

TGCTGAAGTA ACAGCTGGTG ACATTTTGAC AGATAGCGAT ATTGAAATTG TAAATCCAGA 1500 

TCATTATCTC TTTACAATCG GTGAAGGTTC TTCTCTAAAA GCGACTATGA CTGTTAACAG 1560 

TGGTCGTGGA TATGTACCTG CTGATGAAAA TAAAAAGGAT AATGCACCAG TTGGAACACT 1620 

TGCTGTAGAT TCTATTTATA CACCAGTTAC AAAAGTCAAC TATCAAGTGG AACCTGCTCG 1680 

TGTAGGTAGC AATGATGCTT TCGACAAATT AACCCTTGAA ATCTTGACAA ATGGAACAAT 1740 

TATTCCAGAA GATGCTTTAG GCCTTTCAGC ACGTATTTTG ACAGAACATC TTGATTTGTT 1800 

TACAAATCTT ACTGAGATTG CTAAGTCAAC TGAAGTGATG AAAGAAGCTG ATACTGAATC I860 

TGACGACCGT ATTTTAGATC GTACGATTGA GGAACTGGAC TTGTCTGTGC GTTCATACAA 1920 

CTGTTTAAAA CGTGCCGGTA TCAATACTGT CCATGATTTG ACAGAAAAAT CTGAAGCAGA 1980 

GATGATGAAA GTACGAAATC TTGGACGCAA GAGTTTGGAA GAAGTGAAAC TCAAACTCAT 2040 

TGATTTGGGT CTTGGATTAA AAGATAAATA AAGGAGGAAT ACATGGCTTA CCGTAAACTA 2100 

GGACGCACTA GCTCACAACG TAAAGCAATG CTTCGCGATT TGACAACTGA CCTTTTGATC 2160 

AACGAATCAA TCGTGACAAC TGAAGCTCGT GCTAAAGAAA TCCGTAAAAC TGTTGAAAAA 2220 
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ATGATTACTC 


TAGGTAAACG 


TGGTGATTTG 


CATGCACGTC 


GTCAAGCAGC 


TGCTTTCGTA 


2280 


CGTAATGAAA 


TCGCATCTGA 


AAACTATGAT 


GAAGCAACTG 


ATAAGTACAC 


TTCTACTACA 


2340 


GCACTTCAAA 


AATTGTTCTC 


AGAAATCGCA 


CCTCGTTATG 


CTGAACGTAA 


CGGTGGATAC 


2400 


ACTCGTATCC 


TTAAAACTGA 


ATCACGTCGT 


GGTGATGCAG 


CGCCAATGGC GATCATCGAA 


2460 


TTAGTATAAA 


ATCATCAATT 


TTGTTGAGTG 


TTATGATGAT 


GGAGTCTTGT 


GCTCTTAGTC 


2520 


TAGCTCTGGT 


CTACCGCTAG 


GATTTCGGTC 


CTAGCGGGAA 


CACTCATCAT 


AAGTTGGGAT 


2580 


AGTAGACGCT 


TGTTTACGAA 


ATTGTTTTTT 


TCTTAAGAAC 


AACTTCGTAA GCAGGCGTTT 


2640 


TTGAGTATTT 


TCGTTAGAAT 


TATGCTATAC 


TATTTGAAAA 


GAATCCTGTT 


TAATGTTAAG 


2700 


GTTTCTTATT 


TTAAGAAGAA 


TTGGAGTTTA 


CTTATGAAAG CCATTATAAC 


TGTTGTTGGT 


2760 


AAAGATAAAT 


CTGGAATTGT 


TGCAGGTGTT 


TCTGGTAAAA 


TTGCAGAATT 


AGGATTGAAT 


2B20 

mm 9 m V 


ATTGACGATA 


TCTCTCAAAC 


TGTCTTGGAT 


GAATATTTTA 


CGATGATGGC 


TGTTGTATCT 


2880 

^ W W w 


AGTGATGAAA 


AGCAAGATTT 


TACCTATCTT 


CGTAATGAAT 


TTGAAGCTTT 


TGGGCAAACT 


294 0 


TTGAATGTAA 


AAATCAATAT 


TCAGAGTGCA 


GCGATTTTCG 


AAGCTATGTA 


TAATATCTAG 


3000 


GAGGTCATCA 


TGGATATTAG 


ACAAGTTACT 


GAAACCATCG 


CCATGATTGA 


GGAGCAAAAC 


3060 


TTCGATATTA 


GAACCATTAC 


CATGGGGATT 


TCTCTTTTGG 


ACTGTATCGA 


TCCAGATATC 


3120 

J X v 


AATCGTGCTG 


CGGAGAAAAT 


CTATCAAAAA 


ATTACGACAA 


AGGCGGCTAA 


TTTAGTAGCT 


3180 


GTTGGTGATG 


AAATTGCGGC 


TGAGTTGGGA 


ATTCCTATCG 


TTAATAAGCG 


TGTATCGGTG 




ACACCTATTT 


CTCTGATTGG 


GGCAGCGACA 


GATGCGACGG 


ACTACGTGGT 


TCTGGCAAAA 


33Q0 


GCGCTTGATA 


AGGCTGCGAA 


AGAGATTGGT 


GTGGACTTTA 


TTGGTGGTTT 


TTCTGCCTTA 


3360 


GTACAAAAAG 


GTTATCAAAA 


GGGACATGAG 


ATTCTCATCA 


ATTCCATTCC 


TCGCGCTTTG 


3420 


GCTGAGACGG 


ATAAGGTCTG 


CTCGTCAGTC 


AATATCGGCT 


CAACCAAGTC 


TGGTATTAAT 


3480 


ATGACGGCTG 


TGGCAGATAT 


GGGACGAATT 


ATCAAGGAAA 


CAGCAAATCT 


TTCAGATATG 


3540 


GGAGTGGCCA 


AGTTGGTTGT 


ATTCGCTAAT 


GCTGTTGAGG 


ACAATCCATT 


TATGGCGGGT 


3600 


GCCTTTCATG 


GTGTTGGGGA 


AGCAGATGTT 


ATCATCAATG 


TCGGAGTTTC 


TGGTCCTGGT 


3660 


GTTGTGAAAC 


GTGCTTTGGA 


AAAAGTTCGT 


GGACAGAGCT 


TTGATGTAGT 


AGCCGAAACA 


3720 


GTTAAGAAAA CTGCCTTTAA 


AATCACTCGT 


ATCGGTCAAT 


TGGTTGGTCA 


AATGGCCAGT 


3780 


GAGAGACTGG 


GTGTGGAGTT 


TGGTATTGTG 


GACTTGAGTT 


TGGCACCAAC 


CCCTGCGGTT 


3840 


GGAGACTCTG 


TGGCACGTGT 


CCTTGAGGAA 


ATGGGGCTAG 


AAACAGTTGG 


CACGCATGGA 


3900 


ACGACGGCTG 


CCTTGGCCCT 


CTTGAACGAC 


CAAGTTAAAA 


AGGGTGGAGT 


GATGGCCTGC 


3960 


AACCAAGTCG 


GTGGTTTATC 


TGGTGCCTTT 


ATCCCTGTTT 


CTGAGGATGA 


AGGAATGATT 


4020 
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GCTGCAGTGC 


AAAATGGCTC 


TCTTAATTTA 


GAAAAACTAG 


AAGCTATGAC 


GGCTATCTGT 


4080 


TCTGTTGGAT 


TGGATATGAT 


TGCCATCCCA 


GAAGATACGC 


CTGCTGAAAC 


TATTGCGGCT 


4140 


ATGATTGCCG 


ATGAAGCAGC 


AATCGGTGTT 


ATCAACATGA 


AAACAACAGC 


TGTTCGTATC 


4200 


ATTCCCAAAG 


GAAAAGAAGG 


CGATATGATT 


GAGTTTGGTG 


GTCTATTAGG 


AACTGCACCC 


4260 


GTTATGAAGG 


TTAATGGGGC 


TTCGTCTGTC 


GACTTCATCT 


CTCGCGGTGG 


ACAAATCCCA 


4320 


GCACCAATTC 


ATAGTTTTAA 


AAATTAAGAA 


AATAGGAGAA 


ATTTTAAGTT 


CTATTTAAGA 


4380 


TTAGACGTGT 


ATACTATAAT 


CATTAAATAA 


AGACCTCCTA 


ATATTATTTG 


AAACAGATAA 


4440 


CACTGAATTA 


GTTTGAATTT 


GATTTTCATC 


TAATATCTTT 


ATTTAATGAA 


CTCCTAAACT 


4500 


TTTrCATAAT 


AATCTCCTTC 


AAAAGTCGCC 


TGTATGGGTG 


GCTTTTATTT 


TATCATTCAT 


4560 


GATATAATAG 


AAGCAAACGG 


AGGACGGAAA 


ATGGTAAAAG 


TACGATTGTA 


TTTGGTACGT 


4620 


CATGGCAAGA 


CCATGTTTAA 


CACGATTGGT 


CGCGCGCAAG 


GTTGGAGCGA 


TACTCCCTTA 


4680 


ACTGCTGAAG 


GTGAACGAGG 


GATTCAAGAG 


TTAGGAATCG 


GTTTGCGAGA 


ATCTGATCTA 


4740 


CAGTTTGAGC 


GTGCTTATTC 


GAGTGATTCT 


GGTCGTACCA 


TTCAGACCAT 


GGGAATTATC 


4800 


CTTGAAGAAC 


TTGGCTTGCA 


GGGGGAAATC 


CCTTATCGCA 


TGGACAAGCG 


TATCAGAGAA 


4860 


TGGTGTTTCC 


GTAGTTTTGA 


TGGAGCCTAT 


GATGGCGATC 


TTTTCATGGG 


CATTATTCCT 


4920 


CGTATCTTTA 


ATGTGGACCA 


CGTTCACCAA 


TTGTCTTATG 


CTGAACTGGC 


TGAGGGCTTC 


4980 


GTAGAGCTCG 


ATACAGCTGG 


TTGGGCTGAA 


GGCTGGGAAA 


AACTCAGTGG 


CCGAATCAAG 


5040 


GAAGGCTTTG 


AAATGATTGC 


AAAAGAAATG 


GAAGATCAAG 


GTGGAGGTAA 


CGCCCTTGTT 


5100 


GTCAGCCATG 


GAATGACTAT 


TGGAACCATT 


GTTTATCTGA 


TTAATGGCAT 


GCATCCGCAT 


5160 


GGTCTGGATA 


ATGGTAGCGT 


GACAATCCTT 


GAATATGAGC 


ACGGCCAGTT 


TAGGGTTGAA 


5220 


GTTGTCGGTG 


ACCGTAGTTA 


CCGAGAGCTA 


GGACGTGAGA 


AGATGGAAGA 


AGGCTCTATT 


5280 


TAATCAGTCT 


AGACTTGCTT 


CCCATGAGCT 


AGGGATTTGA 


TAAGAATATC 


AAGATAAGAA 


5340 


AAAACAGCCG 


AGGGCACTCC 


TTTCGGCTGT 


TTTTGATGTG 


GAAAACTAAA 


GTGTAATGCT 


5400 


ATTGCTTTTA 


GAGATTTTCA 


TAAACAAGAG 


CAAGGAACCT 


ACTGTTAGAA 


CAGTCAGGAT 


5460 


AGTTGACAAG 


GTTGCGGCTA 


CACCGTAATT 


TCCTCTGAGA 


ACCTCTGTAT 


AAATAGCTAC 


5520 


AGTCATTGTT 


CTTGTTTTGA 


CATTGTAGAG 


GAGGATAGAA 


GTAGAGAGTT 


TTGAAATCAT 


5580 


TGTGACTCAA 


GATAAGATGG 


CTCCAGAAAT 


GATACCAGAT 


AGCATCATTG 


GAGTTGTAAT 


5640 


CTTAGCAAAG 


GTATTGAGAC 


GACTACTTCC 


TAAGCTTTCA 


GCAGCTTCTT 


CAATACTTGG 


5700 


TGCTATTTGT 


TGTAAGCTAG 


CAACAGATGA 


GCGAATAGTA 


TAAGGTAATC 


TTCTGGCAGA 


5760 
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TAGAGACATA ATCAAGATGA AAGCAGTCCC TGTAATCATA AGAAATCCAC TTCCAAATAG 582 0 

ACCAGTATTG AAGGAAGAAA TGAAGGCAAT CCCTAGAACG GTTCCTGGTA CAATATAAGG 5880 

TACCATACTG AGGCTGTCAA TTAAGTTTGT AAACAAATTC CGTTTTCTAA CGGCTAGGTA 5940 

GGAGATAAAT GTCCCAAATA GAACAACTAG AACTAAGGCA ATCAAACGGA TACGAATGCT 6000 

ATTGAAAATA GCAGATCCCA TACGATGGAA AGCTACCTTG TAACTGTTTG GAGAATAACC 6060 

TTTAACAGAT ACCATACCTG ATGTTTTTAG GAAAGAGGTA TAAATTAAGT AGATTTGAGG 6120 

TAAAACAGAG ATAAAGATAA TTCCGTAGAC TGTTGCATAA ATGGCAGCCA TTTTTCCTTT 6180 

TGTAGTTTTT TTAGGCTCAA TTGGATGGAG CAGATTCATG CTGAAACTGT AGCGGTTTGC 6240 

AATGTGTTTT TGGATAAGGA AAATTGCCAA GGCAATGATA ATCGCCATAA TTGCAAAAGC 6300 

AGAATTTCCT CCAACCTCGC TAATAAATTG GGTATAAATC AGGACAGGGA AAGTCCGATA 6360 

CCCTTCGCCA ATCAACATAG GCGTTCCAAA GTCTGAGAAT GCTCTCATAA ATACAAGCAA 6420 

GGAGCTGCTA GTAAGGTTGG AACTAGGAGA GGTAAAACAA CCGTTACGAT AGGTTTAAAT 6480 

CCGAAGGACC CCATGCTTTC AGCTGCTTCA AGTAGAGAAT TGTCAATACT GTTCATTGTT 6540 

CCAGCAACAT ATAGAAATAC CAGTGGGAAT AGTTGCAGTG TAAAGACAAG TACAATTCCT 6600 

TTGAATCAAT AAATATCGAT AGCTGGAAGA TAAAGGGCAT TTGTCAAAAA TTTAGTGATG 6660 

ACCTCATTTC GTCCTAGCAA GAGAACCCAG GAGTAGGCTC CTACGAAAGG AGCTGACATG 6720 

GAAGCAATGA TAATCAATAT TTGTAGAAAT TTCTTCCCCT TGAAGTCATA CATAGAGAAG 6780 

AGATAAGCTA ATAGGGTTCC TACAACTAAG GAAGTGATAG TAGCGGTAAT GGAAACCTTG 6840 

AAACTGTTGA CTAGTGTCTC AGAGTAGTAG GCTTTACTAA AGAAAGTGAC AAAATTAGCT 6900 

AGTGAGAATT GTCCTTCATG TATAAGTGCT TGCTTGAGCA CCGTAACGAT AGGATAAACG 6960 

AGAAAGATAG GATAGGTAAG AAAGAGGAAG AAAGAGGAAA CTGTCCAAAT ATTTAGTTTT 7020 

TTACGTTCCA TGGTTGACTC CTTTTATCAG GTTTTGGCAA CCATCTGCAG AAAAGATGTT 7080 

TAATTTTTGC GTATTGATTC GTAGACGAAT ACGATTGCCT TTTTGTAGAT CTTCTTCAAA 7140 

AGTTGATTCT TCACTAACTT GAATTTTTGA GGCAAAACCT GTCTCAATGA AATAATCCGT 7200 

ATTTAGTCCA AGATAGACGC TATCTCTAAT AGTTCCTTCA ATATCTCCAG ATTCATCTTT 7260 

GATAAACTCT TCGGGACGAA TGCTTACATG AATAGCTTGC TCCTCAACCT GATCAAGAGC 7320 

TGGCATTCGA AGGGCATAGC CATCTGAAAA GACGATATAA GCGCCGTCGC TCCGTTTTTC 7380 

AAGATTGGCA GGGATAATAT TTGTGCGTCC GATAAAGGTT GCCACAAACT CATTAGCTGG 7440 

TTTATGATAG AGTTCTTTTG GTCGGCCGAT TTGTTGGATC ACCCCATCTT TCATAACAGC 7500 

AATTTGGTCT GAAATAGCCA TGGCTTCTTC TTGGTCGTGG GTTACATAAA CAGTTGTAAT 7560 
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TCCCACTTCG 


TGTTGGATTT 


CTCGGATGGC 


TTGACGCATA 


TCCAAGCGAA 


GTTTGGCCTC 


7620 


CAGATTACTA 


AGTGGCTCGT 


CCATGAGGAG 


AACACTTGGA 


TTAACCGCTA 


AGGCGCATGC 


7680 


CAAGGTGACA 


CGTTGTTGTT 


GTCCACCACT 


GAGTTTATCG 


GGCTTTCGAT 


CCGCATATTG 


7740 


AGCAATTTGC 


ATGAGTTCAA 


GATACTTGTT 


GGTCTGTTGA 


ATCAATTCTT 


CTTTTGGAAC 


7800 


CTTCTTTTGC 


ATAAGACCAA 


AAGCAACGTT 


GTCTCGGACA 


GTCAAATGTG 


GGAAAATAGC 


7860 


GTAGTTTTGG 


AAAACCATCC 


CGATATTGCG 


TTTGCTGGGT 


TCCATATTAT 


TGATTTTTGT 


7920 


ATCATCGAAG 


TAAAATTCTC 


CACCTTCGAT 


ACTGTTGAAA 


CCTGCAATCA 


TACGAAGAAG 


7980 


GGTCGTTTTC 


CCACATCCTG 


AAGCTCCAAG 


AACGGTAAAG 


AGACTTCCTT 


TTGGAATTGT 


8040 


AATGTTCAAA 


TTCTCAATAA 


CAGGGACATC 


GTGGTAGATT 


TTTTTGGCGT 


TAATAATTTT 


8100 


GATCTCACTC 


ATAGTGAACC 


TCTTTTACTG 


TTTAGATTGG 


ATATCTGTAA 


AGACTTCGTT 


8160 


GTATTTCTTA 


ACGATATCTG 


ATTTATTCTT 


GATGACATAA 


TCATAATCTT 


CAGTGAGTGT 


8220 


TTTGATTTTG 


TCAATTGGTT 


TCATGTTTTC 


GCTTGTTTTA 


GCATTTTTAC 


GAACAGGACG 


8280 


GTTACTAGTG 


GTTGTACCAA 


GTGTATCTTG 


TACTTCTTGA 


GAGATAATAA 


AATCGATAAA 


8340 


TTTCTTGGCA 


TTTTCCATAT 


TTTTAGATTT 


TTTAACGATA 


CCAGCACTAG 


CAGGTAGGAA 


8400 


GACGGTTCCT 


TCTTTTGGAT 


AGACTACCTT 


AATGTTAGCT 


CCGTCATTTA 


AGAGTTTAAC 


8460 


TGCTGGATCT 


TCATAAGAGA 


GACCAACAGC 


CATTTCTCCA 


TCAGCGACTA 


CTTTATAGAC 


8520 


ACTAGATGAA 


CTTGAACCGA 


TTTTACCATC 


AATAAGTGTC 


AAAAGATCTT 


TTACATAAGA 


8580 


CCAAGCCTTA 


TCATCTTTGT 


AACCACCTTG 


ACCTTGTAGC 


ATATTTGTTA 


ATTGAGCAAA 


8640 


GGCGCTAGAA 


GAGTTTGCTG 


GGTCAGCAGT 


TGCGATTTTT 


CCTTTTAGTT 


CAGGTTTGAA 


8700 


AAGATCGTTA 


TATCCTTCGA 


TGTTCATGCC 


TTTAGTTAAA 


TCAGGGTTGA 


CGATTAAAAC 


8760 


ACTACCATCT 


AGTGTATAAG 


GAGTAGAGTA 


CCCAGTTGTG 


TTTTGATATT 


CTTTGATAAC 


8820 


ATTATCATTT 


TCTTTTGAAG 


TATAGTTTTC 


AAAGAGTTCT 


CCGTGGGTAG 


TATATTGTGT 


8880 


ATAAGAACCA 


CCAAAGATAA 


CATCAGCTAC 


AGGAACTTCT 


TTTTCTGACT 


CTAGTTTTTT 


8940 


GAAAAGTTCT 


CCAGTACCAG 


CTTGAATCAG 


TTCTACTTTG 


ATACCATATT 


TTTCTTCAAA 


9000 


GGCAGGAATA 


GTTGCTCCAA 


TTAAGCCCTC 


TGAGTTTGGT 


GAATAAACGA 


CTAGCGAACC 


9060 


GCCGTCTCCT 


TTATCACATG 


AACTGTCATC 


GGCAGATTCA 


TTAGAAGAAC 


AAGCAGCATA 


9120 


ATACATCCAT 


TTCTTTTTCA 


TGATGGATAC 


CTCCGTTGTG 


TTATTTAAGT 


TTATTTTAAA 


9180 


ACAATGTAAG 


CGTTTTTAAA 


ACATACAATT 


CTATTCTATA 


GTGTATTGAA 


TCTATAACAG 


9240 


TACACTTTGA 


CTGCTAAAAT 


ATTTCTATAA 


ATTAATTTGA 


CTTTCCTGAT 


AGAGATGTTC 


9300 
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ACATCTTATT TCAATTCACT ATATTAGAGT AAAATTCTCT ACAAAAAGAA GAATAGCCTA 9360 

TTTTACTATT CTTCTGAGTG ATTTCAATTC CTTTGGGGAA ATATGGAGAT ACTTTTTAAA 9420 

TCCTGACAAA TGGTTGTTTC TTTTTCTAAA TCGGTGATAC TGTATCGGAG AATGCGCGTG 9480 

AGGTCACAAA GGCTGCGATA GAGCTTCTAT GGAGAATTTC TTTTTGGAGA GATTTTTTAA 9540 

AGGAATGAGA CATCCGCTAC CTCCTTGGAA GGTTTTTG 9573 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13440 base pairs 
(8) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CGGGCTGTTG TGACGATTCT TATTTCTATC TGTGTTATCT TTTTGGGAAC TATTTTGGCT 60 

GTTGTCTTGG CTTTTGGGCA ACGTTCAAAG TTTAAACCGC TTGTTTGGTT GGCCAACTTG 120 

TACGTTTGGA TTTTCCGTGG GACACCGATG ATGGTTCAAA TTATGATTGC CTTTCCTCTT 180 

ATGCATATCA ATGCTCCGAC TATTCAGATT GGAATTTTAG GTGTTGATTT TTCGCCTCTG 240 

ATTCCAGGGA TTTTGATTAT CTCTATGAAT AGTGGTGCTT ATGTTTCGGA GACTGTTCGT 300 

GCCGGAATCA ATGCGGTTCC AAAAGGTCAG CTAGAAGCGG CTTATTCGCT AGGGATTCGT 360 

CCTAAAAATG CGATGCGTTA TGTGATTTTG CCACAAGCAG TCAAAAATAT CTTGCCAGCA 420 

TTGGGGAACG AATTTATCAC CATTATCAAG GACAGCTCCC TCTTATCAGC TATTGGGGTC 4 80 

ATGGAGTTGT GGAATGGGGC TACAACAGTT TCTACAACAA CCTATCTACC TTTAACACCA 540 

CTTTTATTTG CAGCATTTTA CTACTTGATT ATGACCTCTA TTCTGACAGT AGCCTTGAAA 600 

GCTTTTGAAA AACATATGGC ACAAGGAGAT AAGAAATAAT GACAGAAACC TTGATAAAAA 660 

TTGAAAATTT ACATAAATCC TTTGGAAAGA ATCAAGTATT GAAGGGCATC AACCTCGAGA 720 

TTAAAAGAGG AGAAGTTGTC GTTATCATCG GTCCTTCAGG GAGCGGGAAA TCTACCTTGC 780 

TTCGCTCTAT GAATTTGTTG GAAGAAGCAA CCAAGGGGAA GGTTATCTTT GAGGGAGTCG 840 

ATATTACGGA CAAGAAGAAT GACCTGTTTG CCATGCGTGA GAAGATGGGC ATGGTTTTTC 900 

AACAATTCAA TCTCTTTCCT AATATGACTG TGATGGAAAA TATCACCTTG TCCCCTATCA 960 

AGACCAAAGG TGACAGTAAG GCCGTTGCAG AGAAAAGAGC TCAGGAACTT TTGGAAAAAG 1020 

TTGGTTTGCC AGATAAGGCA GACGCTTATC CACAGAGTTT GTCAGGTGGC CAGCAACAGC 1080 

GGATTGCCAT CGCGCGTGGG TTGGCTATGG AACCAGATGT TTTGCTCTTT GACGAGCCAA 1140 
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CTTCAGCCCT AGATCCTGAG ATGGTTGGAG AAGTTCTGGC TGTTATGCAA GATCTAGCCA 1200 

AGTCAGGAAT GACCATGGTT ATCGTAACAC ATGAGATGGG ATTTGCCCGT GAGGTGGCAG 1260 

ATCGTGTCAT CTTTATGGCA GACGGTGTGG TTGTTGAAGA CGGAACACCT GAGCAGATTT 1320 

TTGAACAAAC CCAAGGACAA AGGACTAAAG ACTTCTTGAG TAAGGTTTTA TAAGTTACCT 1380 

TTGTTTAGCT ATTTGTAGCC AGCTTTAAAC GTTAAAGAGA AGATTAGTGA AAAGCTCAAC 1440 

CAGAGCTTTT TCTTATAGTT TAAAGCTATA GGATTGCCTA GGAAAGAAGT GTTAGAGCTA 1500 

CATTGTATTT TTTGGTATAA TTAAAGATAT TTGTAAGAAA AGAGAAGTGA T AT G AC AC AG 1560 

ATTATTGATG GGAAAGCTTT AGCGGCCAAA TTGCAGGGGC AGTTGGCTGA AAACACTGCA 1620 

AAATTAAAGG AAGAAACAGG TCTAGTGCCT GGTTTGGTAG TGATTTTGGT TGGGGACAAT 1680 

CCAGCCAGCC AAGTCTACGT TCGCAACAAG GAGAGGTCAG CCCTTGCGGC TGGTTTCCGT 174 0 

AGCGAAGTAG TACGGGTTCC AGAGACCATT ACTCAAGAGG AATTGTTAGA CCTGATTGCT 1800 

AAATACAATC AGGATCCAGC TTGGCATGGG ATTTTGGTTC AGTTGCCATT ACCAAAACAC 1860 

ATTGATGAAG AGGCGGTTCT ATTGGCTATT GACCCAGAAA AGGATGTGGA TCGTTTCCAT 1920 

CCTCTAAACA TGGGGCGTCT TTGGTCTGGT CATCCAGTCA TGATTCCTTC GACACCGGCA 1980 

GGAATTATGG AAATGTTCCA TGAATATGGG ATTCACTTGG AAGGTAAAAA TCCAGTCGTC 2040 

ATCGGTCGAT CCAATATTGT CGGAAAACCT ATGGCCCAGC TTCTTTTGGC AAAGAATGCA 2100 

ACAGTAACCT TGACTC ACT C ACGTACTCAT AATCTTTCCA AGGTGGCTGC AAAAGCAGAT 2160 

A1TCTGGTTG TTGCAATCGG TCGTGCCAAG TTTGTGACTG CTGACTTTGT CAAACCAGGT 2220 

GCGGTAGTCA TTGACGTTGG GATGAACCGC GATGAAAATG GTAAGCTCTG TGGGGATGTT 2280 

GATTATGAGG CGGTTGCCCC ACTTGCTAGC CACATTACGC CAGTCCCTGG AGGTGTCGGT 2340 

* 

CCTATGACCA TTACTATGCT GATGGAGCAA ACCTATCAGG CACCACTTAG GACATTGGAT 2400 

AGAAAATAAG ATAAAAATTT TCTGAGGAAA GTGTATTTTC TATAGCTATA TCTAAAATGA 2460 

TAGAAATGAA TATTAAATTT TAGAAATAAG TTTATAAAAG GAGGTTTGCG CCTCCTTTTT 2520 

GTTGTATAAT GGAGTGAGGT GATTAGATGA TTTTAAAAAT TTATAATGGG GAATATAGTT 2580 

TACAATGGGA TGGAATATAC TACTTAGCAC TAATTGATTA TCCAAATATT CAAGAGTGGG 2640 

AATTAGAAAA AATTGCTAAA TTTATAGCTT ACGAAAAACT TCATAAACGT CAAACAAGTA 2700 

TTGAGTGTGC TGATTCTTGT TTAAAAAAAG AAATTTTAGA TTACATCTGT CAGCATCCCT 2760 

TTCTGCCACC ATTTACTCCT ACAGATAAAA GAGTAGCCTC GACTTATGAC CTACATAAGA 2820 

GGTTAGTGAC TTCAGACTAC TGTAGTCATA CTACGACTAT AGATGCAGCG ATTTCTATTT 2880 
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TTAAAACTGG 


TCGTCTTTTA 


TCTGCTGTGA 


AAGCCTTTGG 


GCGAGATGCT 






TTTTGGATAG 


TCGAAATGCT 


GCATCTGATC 


CGATACATTA 


TTTTGACTAT 


GTCATGTTAG 

VJ X V»r\ X VJ x X r\ 


JUUU 


GGTGGTCAAA 


TACAAGTTCT 


GGTTATCGAT 


TGGCGATGGA 


GCGTTTATTA 


VJ\J X V>Wiu ^ 1 Vf 




CTTCAGAGAA 


AGAATTACAA 


GACAAGTTTA 

^ II »»»*TA AAA *» 


TTCCTGGAGT 


AAGTTTTCAT 


X X In! v.lntn 




CAGATTTGAT 


TAAAGTTCCT 


GGTTATATTT 

^r^' * A «% A ** A A 4 


TTGATGGTTA CL'ATGCTGTA 


A A A ATT A Ann 


lion 
J 1 o u 


ACATGCTTAA 


TTTATTAAGT 

■ LA *» A A ■ A 


GAGTTGTATA 


TTTGCATTAT 


TCCAACTCAT 




1 "5 j* ft 
JZ4U 


AATTTGAAAA 


TATTATTCCA 


ACCAAAATAC 


AAGATAGGGT 


GTATTATCTT 




J JOU 


GAGAAGACTT 


AGAAGAGTGG 


ACTAAGAAAG 


TCTATCAAGT 


TGTTTTAAAA 


PA ATP AP AT A 




AAGGATAGTT 


GAGGAAAAAA 


CGATGAAAGT 


GATTGATCAA 


ACCTTACTAG 


A & A A A/~<T/^> a T> 




TATTGAACGT 


TfTTGT A C A A 
X V. 1 1 w X #VV- nn 


f!Tf*AT A A Ann 


AGACTACGGT 


CGTCTGCTGT 


1 VjV_ rTt»GTGG 


3480 




T A lYmTGG'm 

X A X V9w X Vnl X vt 


pp ATP A TP AT 
V_ V^/\ 1 X V»/\ A 


GGCTGCTTTA 


GCAGCTCTAA 


AAAGCGGTGC 


3540 


AGGATTGnTA 


AfV*f?TTnnAA 

/\V.VavX X V7Vjn/\ 


nnnAPAnnnA 


AAATATCCCT 


GCTCTACACA 


GCCATTTGCC 


~l r f\ f\ 

3600 


TGAGGCTATG 


\_ XX 1 


x v»^./vr\v»A i v. /\ 


GTAATTGTTA 


CAAGAGCAAT 


TGGAGAAGGC 


3660 


AG A A fi TTGTP 


X X VJVr X UuVlvl. 


v_ X vjvj 1X1 av»Vj 


AGACGATACG 


TTTGGAGAAA 


ATCTTGTAAA 


3720 


A f* AGGTPTTT 


GPTAGf*TTAA 


AAA An A ATT 1 A 
/VW\rt\lnrt 1 V-/\ 


GATTTTGATT 


GTAGATGGAG 


GGGCCTTAAC 


3780 


v* x v* v* x x vj\-_ x 


AGGAeAACTT 


X V» X X\» 1 1 X V_ V_ 


ATCTAACCAG 


CTTATCTTAA 


L It» t. t-CACCA 


3840 


AAAAGAATGG 


GAAAAAPmT 


fPflCTlTTnf* 

X uu lAl 1 VJV- 


TATTGAAAAG 


CAAAACGAAG 


GTAv_AACATC 


i a a a 
3 900 


X *VVJ A VJV. V»\* X W 


A PTTPTTTP P 


r*TCAAnnAAn 


/V\ 1111 Wjw 1 A 


uAunnnuvi I \> 


CAGGTACTCG 


^ a c n 


TATTTGGCAA 


GTTGGPCAGT 


V. 4 VM\ X 1 /» x in 


CCAGTTAAAG 


GTTGGCGGTC 


v. LT ATC AGGC 


4020 


GACTGGTGGT 

A vv A W A 


ATGGGTGATA 

«fc X VnnJ X VIA X A 


^/\w X VlMVf X uv 


AATGATTGCA 


GGATTTGCAG 


VJV.V.AA1 1 TLG 


A AO A 

4UUU 


ACAGGCCAGT 


\» a v* a nLuimVf 


GTGTGGPAGT 

U X VJ x VJV_f V^*»0 X 


AGCAACCCAT 


CTTCATTCAG 


LLnlAuLv.v.n 


4140 


AGAACTATCT 


CAAGAAAATT 


n IVfl UU X V— X X 


GCCGACGGAA 


ATTAGTAATT 


OIL! AA 


>l 1 A A 

4200 


AGTAATGAAA 


AGATATGTCT 


AAAATAGTTA 


GACAAAAAAT 


GTTGATAATT 


TGTATCATTA 


4260 


TTCTTAATTC 


ACAAAAAACG 


AACGTTTAGT 


ATTCTTCTTG 


CTAAGAAACT 


AAATTTGTTC 


4320 


GTTTTTTTAC 


TCTTGTAAAT 


CTATTTTTCT 


TAGAGTTGAT 


TTGGTTTACA 


TCCGTACTTA 


4380 


AATTGATTTG 


TTAGAGCTCT 


ACTTTTATTA 


AAAAAATTCA 


ATTTCAAGGA 


TAAATAAGCA 


4440 


GTATTCTAAA 


GGTACTTTTA 


GATGAAATAA 


AAGCCTTTAC 


ATGGTATAAT 


AGAGGTAGCT 


4500 


CTTTAATGGA 


GGTGTTTGAG 


TGGAAAATCT 


GAAGAAAATG 


GCAGGTATCA 


CGGCTGCTGA 


4560 


ATTTATCAAG 


GATGGGATGG 


TTGTAGGGCT 


AGGAACAGGT 


TCTACTGCCT 


ATTATTTTGT 


4620 


CGAAGAAATC 


GGTCGTCGAA 


TCAAGGAAGA 


AGGCTTGCAG 


ATTACAGCTG 


TGACGACTTC 


4680 



WO 98/18931 



PCT/US97/19588 



885 

TAGTGTGACC AGTAAACAGG CTGAAGGGCT CAATATCCCG CTCAAGTCTA TTGACCAAGT 4740 

AGACTTTGTC GATGTGACAG TCGACGGGGC GGATGAAGTG GATAGTCACT TTAATGGAAT 4 800 

CAAAGGCGGT GGTGGTGCCC TTCTCATGGA AAAGGTCGTC GCAACACCAT CAAAAGAATA 4860 

CATTTGGGTG GTGGATGAAA GCAAGCTGGT CGAAAAACTA GGTGCTTTTA AATTGCCAGT 4 920 

AGAAGTGGTT CAGTATGGTG CAGAGCAGGT CTTTCGTCAT TTTGAACGAG CTGGCTACAA 4 980 

ACCAAGTTTC CGTGAAAAAG ACGGCCAACG TTTTGTGACC GATATGCAGA ATTTTATCAT S040 

TGACCTCGCC TTGGATGTCA TTGAAAATCC AATTGCTTTT GGACAAGAAT TGGACCATGT 5100 

CGTTGGTGTT GTGGAGCATG GTTTATTCAA CCAAATGGTG GATAAGGTAA TCGTTGCTGG 5160 

ACGAGATGGA GTTCAGATTT CAACTTCAAA AAAAGGAAAA TAGAAGGGGG CATAAGATGT 5220 

CTAAATTTAA TCGTATTCAT TTGGTGGTAC TGGATTCTGT AGGAATCGGT GCAGCACCAG 52 80 

ATGCTAATAA CTTTGTCAAT GCAGGGGTTC CAGATGGAGC TTCTGACACA CTGGGACACA 5340 

TTTCAAAAAC AGTTGGTTTG AATGTCCCAA ACATGGCTAA AATAGGTCTT GGAAATATTC 5400 

CTCGTGAAAC TCCTCTTAAG ACTGTAGCAG CTGAAAGCAA TCCAACTGGA TATGCAACAA 5460 

AATTAGAGGA AGTATCTCTT GGTAAGGATA CTATGACTGG ACACTGGGAA ATCATGGGAC 5520 

TCAACATTAC TGAGCCTTTC GATACTTTCT GGAACGGATT CCCAGAAGAA ATCCTGACAA 5580 

AAATCGAAGA ATTCTCAGGA CGCAAGGTTA TTCGTGAAGC CAACAAACCT TATTCAGGAA 5640 

CGGCTGTTAT CTATGATTTT GG ACCACGTC AGATGGAAAC TGGAGAGTTG ATTATCTATA 5700 

CTTCAGCTGA CCCTGTTTTG CAGATTGCTG CCCACGAAGA CATTATTCCT TTGGATGAAT 5760 

TGTACCGTAT CTGTGAATAC GCTCGTTCGA TTACCCTTGA GCGTCCTGCC CTTCTTGGTC 5820 

GCATCATTGC TCGCCCTTAT GTAGGTGAAC CAGGTAACTT CACTCGTACG GCAAACCGTC 5880 

GTGACTTGGC TGTATCTCCA TTTTTCCCAA CTGTTTTGGA TAAATTGAAT GAGGCTGGTA 5940 

TCGATACTTA TGCTGTGGGT AAAATCAACG ATATCTTTAA CGGTGCTGGT ATCAACCATG 6000 

ACATGGGTCA CAACAAGTCA AATAGTCATG GAATTGATAC ACTATTGAAG ACTATGGGAC 6060 

TTGCTGAGTT TGAAAAAGGA TTCTCATTCA CAAACCTAGT TGACTTTGAT GCCCTTTACG 6120 

GCCATCGTCG TAATGCTCAC GGTTACCGTG ATTGCTTGCA TGAGTTTGAT GAACGCTTAC 6180 

CTGAAATTAT CGCAGCTATG AGAGAGAATG ACCTTCTCTT GATTACTGCG GACCATGGAA 6240 

ATGACCCAAC GTATGCAGGA ACGGATCACA CTCGGGAATA TATTCCATTG TTGGCCTATA 6300 

GCCCTGCCTT TAAAGGAAAT GGTCTCATTC CAGTAGGACA TTTTGCAGAT ATTTCAGCGA 6360 

CTGTTGCCGA TAACTTTGGT GTGGAAACTG CTATGATTGG GGAAAGTTTC TTAGATAAAT 6420 
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TGGTATAAGA TGACGCGCTA TGCTTTGCTG GTGAGAGGTA TCAATGTTGG TGGTAAGAAT 6480 

AAGGTCCTCA TGGCGGAGCT TCGTCAAGAA TTGACAAACT TGGGACTGGA AAAGGTTGAG 6540 

AGCTACATCA ATAGTGGCAA TATTTTCTTT ACTTCGATAG ATTCCAAAGC CCAATTGGTT 6600 

GAAAAGCTAG AGACTTTCTT TGCAGTCCAT TATCCATTTA TTCAGAGCTT TTCTTTACTG 6660 

AGTCTAGAGG ACTTTGAGGC GGAACTTGAA AATCTACCAG CTTGGTGGAG CAGAGACTTG 6720 

GCACGAAAAG ATTTTCTCTT TTACACTGAG GGTTTGGATG TGGACCAAGT CATCGCGACA 6780 

GTTGAAAGTT TAGAGCTGAA AGATGAAGTG CTTTATTTTG GAAAACTTGG GATTTTCTGG 6840 

GGGAAATTTT CTGAAGAATC CTATTCTAAG ACTGCCTATC ATAAGTACTT GCTGAAGGTG 6900 

CCTTTCTACC GCCACATTAC TATTCGTAAT GCTAAAACCT TTGACAAAAT TGGTCAAATG 6960 

CTAAAAAAAT AATAAAGGAG ACACACAATG ACATTTTTAA ACAAAATCCA TGAAACTGCT 7020 

ACTTTCCTGA AAGAAAAGGG AATTGCAGCC CCTGAGTTCG GTCTAATCCT TGGATCAGGA 7080 

CTTGGAGAAT TGGCAGAAGA AATCGAAAAT CCAGTTGTAG TAGACTATGC TGAGATTCCA 7140 

AACTGGGGCC GTTCAACAGT AGTCGGTCAT GCTGGTAAAT TGGTATATGG TGAACTGGCA 7200 

GGTCGCAAGG TCTTGGCTCT TCAAGGGCCT TTCCATTTCT ATGAAGGGAA TCCTCTGGAA 7260 

GTGGTGACTT TCCCAGTTCG TGTGATGAAA GTTCTTGGAT GTGAAGCTGT TATTGTAACC 7320 

AATGCAGCTG GCGGTATCGG ATTTGGTCCT GGTACCTTGA TGCCTATCTC AGACCATATC 7380 

AACATGACGG GGCAAAATCC ATTGATGGGT GAAAACTTGG ATGACTTTGG CCCACGTTTC 7440 

CCAGATATCT CTAGGGCCTA CACACCAGAA TACCGTGCCA CTGCCCATGA AGTGGCTAAA 7500 

AAACTTAATA TCAAGCTTGA TGAAGGTGTC TATATCGGAG TTACTGCTCC GACTTATGAA 7560 

ACACCAGCAG AAATTCGTTC CTATAAGACA CTGGGAGCAG ATGCAGTTGG TATGTCTACG 7620 

GTTCCTGAAG TTATCGTGGC AGCCCACTCT GGCTTGAAAG TTCTGGGAAT TTCATGTATC 7680 

ACTAACTTTG CGGCCGGTTT CCAAGAAGAA CTCAATCACG AAGAACTTGT AGAAGTGACT 7740 

GAACGTGTTA AAGGTGATTT CAAAGGCTTG CTTAAAGCGA TTCTTGCTGA ATTGTAAGAA 7800 

AAAAGATTTA AAAGGGGGAG TGCCTCTGTT TTTTCAGGAT TGACTGCCTA TCCGGATTAA 7860 

AGAAGAAACA GAGGAATACT ATGAGCTTCT TCCTGCTCTT ATAACTGAAA GAAGCGGAAG 7920 

AATAGGTATG TCTGATCTGA TAGCCAGCAT TGTGAAAGAC AAGATTCTAG GATACTAGCA 7980 

TTAGCTTCCT AGCCAAGCAG ACTAGTATGA TAAGGAGAGA TGAGAATGAA TTGACTTTCT 8040 

GAATTTCTCA GTCTTATCAT ATATAGCACA ATGAGATTTC GCTTGAGTCT CCTTGTAAAT B100 

AAACGAAAAG AAAGATAAGA AATAATGAAA ATTGGTCAAC GAATTATGCG CTTTGGCATA 8160 

AAAAATTAAG TATCGGAGTT GTATCTGTTG TAGTCGGCTT TGATTTCTAG CTCCAGCTGG 8220 
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AATTTCAGCC AATGAAGTAA AGCAAGATGT AACATCTGAA GTGGTAATAG GTGTGCTAGA 8280 

TTCTAAGGAG GAATTGAAAG AGTCAGAAAA TGATGCTCCA AAACTAGAAA CTCCTCTTAG 8340 

AGAGGAGCCA AGACTAGCTC CTCAAACGCT TCCGGAAGCA AGTGAAGTTC TTGAAAACAA 8400 

AAGGGAAGAG TCAAAAGTAG AGATAACATA ACCAGCTCAA GCGGATGATA TCCGCAAGGT 8460 

TGTTGGGGAA TTAGCCAAGG ATATAAGTAT TACTAAGTTG TATATGACAG GTCATTCTCT 8520 

TGGATGTTAC CTAGCTCAGA TTGCAGCGGT TGAAGCTTAC CAAAAATATC CTGATTTTTA 8580 

TAACCATGTA TTGAGGAAAG TGACAACTTT CAGTGCTCCT AAAGTGATTA CTTCCAGAAC 8640 

TGTTTGGAAT GCTAAGAATG GTTTCTGGGA TGTTGGTTTG GAAAGTCGTA AATTAGCTGT 8700 

TAGTGGAAAA ATTAAGCATT ATGTGGTTGA TAATGACAAT CTTGTGACTC CCTTGATTCA 8760 

TAATAATCGT GATATTGTTA CATTTACAGG TAATTCACGC TTTAAACACC GTTCTCGTGG 8820 

CTATTTTGAA AGTCCAATGA ATGATATTCC TAACTTTAAT ATTGGTAAAC AAGCTACCTT 8880 

GGATAAACAT GGTTATCGTG ATCCGAAATT GGATAAAGTG CGATTCTTTA AGAAACAGCC 8940 

TCTGCCTCGA TCTTCTAGTC AACCAAGCGC TGAACCAATG GAAAATATTG CCTCAGGAAA 9000 

ACAGGTTACT CAAAGTTCGA CAGCTTTCGG AGGAGATGCT AGAAGAGCTG TGGATGGCAA 9060 

AGTCGATGGT AACTATGGTC ACAATTCTGT CACTCATACA AACTTCCAAT CTAAGCCTTG 9120 

GTGGCAAGTA GATTTGGCTA AAGAAGAAAC CATTCGCCAA ATCAATATTT ACAACCGAAC 9180 

AGACACTGCC CAGGATAGAT TGGCAAACTT TGATGTCATT CTTTTAGACA CTTCTGGTAA 9240 

AGAAATTGAG TGAAAACGTA TAACATCTCC TAAACATGTG TCAGCACAAA TTACGATTAA 9300 

CCATAAAAAA GCGCGCTATG TTCGGATTGA GCTAGAAGGC TATAATCCCC TCAGTCTTGC 9360 

AGAAGTTGAA GTTTTCTGCT TTATAGCTAC GAATGCTGAA ACGGCGACAC AAGTTTCTAA 9420 

GCCAGTTCAA CCAATCAGTC AGACTCCTGT GAAGGATAAA ACATTGACAA TTCAACACAG 9480 

TGGAGCTTAC ATTGCCCGCT ACTCCATAAC TTGGGAAGAA GTTCCAGTAG ATAAAGATGG 9540 

AAACCAAGTT GTTCGTAGTC ATTCTTGGGA AGGAAGCGGT CGCAACCAGA CTGCAGGTTT 9600 

TGTCCTCAAC CTCCCAATCA AAGAAAATAT GAGAAATCTG CGAGTTAAGA TTGAGAAAAA 9660 

GACGGGCCTA CTATGGAATA GATGGCAAAC AATCTATGAA AACAGACCAA TTTTAGCTCA 9720 

ACCCCACCGT AAAATTACCC ATTGGGCTAC GACATTGAAT TCCAAGGTGA GTGACGATGA 9780 

TGTCTTGTAA TCTGATGGTA GAATGACAGT TAGTTTGTCT AGTTTATAAG AAAGTACTAC 9840 

CTGAGCTTGA ATAGGACTCA GGTAGCTCTC TATGAAAGAA CAAAATTAAT ACTCAATGAA 9900 

AATCAAAGAG CAAACTAAGA AACTAGCCGC AGGTTGCTCA AAGCACTGCT TTGAGGTTGT 9960 
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AGATAAGACT 


GACGAAGTCA 


GTCACATATA TAATCCAAGG 


CGACGTTCAC 


rtTY^fnTTVI A A 

W 1 ill Uf^n 


10020 

JL \J \J m \J 


GAGATTTTCG 


AAGAGTATAA 


ACAGAAAGGT 


AGAGCGCGTG 


TTCTAATTTfi 






GAAAACTTTT 


CTAAAAACAA 


AAACGAAAGG 


ATGGGTAAAC 


TGTATTCGCT 


CAACIY5AATA 




CGGGCGACTC 


TCCTCTAAAT 


CAAAATTAAG 


AAAGGAATTG 


ACCCCACCCT 


AAAAGTAGTC 


10200 

X V/ £ v V 


GGAAAAAGAT 


AGTTGATCTA 


GCGAGCATCG 


CTCACTGCGC 


CCAACTCCTA 


1 1 1 1L i 1L 




GCTTTTTGAT 


GGGTTTGGTA 


TCTTTCTCAA 


TATAAAATAT 


AAAAT A A AfT & 


A Af3P/P Art ACf* 


1UJ4U 


GTGTGTTTTG 


ATTTGAACAC 


GAGCGGAAAA 


CTCGGAAAAT 


ACATAATCTG 


APTYtAAAAAT 


1 01 AD 
1U JOv 


CAGGATTTCT 


CGTCAGGTTC 


CTAATTTTCA 


GTCGTTTTCT 


Iwl V_ VJ\^ 1 V. i 1 


iu l ni \_A4 AA 


1 Ail j! A 


ATTATGTCTA 


TCCATATTGC 


TGCTCAGCAG 


GGTGAAATTG 


nYSATAA A AT 


1 1- I Iv I 1 tV. 1 


1 ACAA 


GGGGATCCTC 


TTCGTGCTAA 


GTTTATTGCG 


GAGAATTTCC 


TTfSATfiATfV 
• t wi 4 on i ov. 


lu 1 1 1\j 1111 


1 C\^fif\ 
iwjOU 


AACGAAGTGC 


GTAACATGTT 


TGGTTACACT 


GGTACTTACA 




Iw 4 A 1 V- 1\» i ^ 




ATGGGAACTG 


GGATGGGAAT 


GCCATCTATT 


TCGATTTATG 


l.i*4_ijl\?AL»i r 


AATt-liTAGAC 


lObBO 


TACGGTGTGA 


AGAAATTGAT 


TCGTGTGGGA 


ACTGCAGCTT 


will U AA 1 un 


AVinw 1 AX. A 1 


1 A*7 A A 
1 U / H U 


GTTCGTGAAT 


TAGTTTTGGC 


GCAGGCGGCT 


GCAACCAACT 


LAAAUt 1 v. 1* i 


TLUTAn ITjAW 


lUoOU 


TGGCCACAGT 


ACGATTTTCC 


ACAAATTGCT 


ACCTTTGATT 


luL 1 iwV 1 AA 


AL»CL. 1 ACl—A I 




ATCGCCAAAA 


AACTTGGTAT 


GACTACTCAC 


GTTGGGAACG 


I 1 1 i V* 4 l>/l 4 V* 


4 un lu 1\ ill 




TACTCAAATT 


ACTTTGAAAA 


GAATATCGAG 


CTTCGTAAAT 


(ZCinC.Ji.ClTC A A 

UuuunU 1 Iv'v^ 


WiV. 1 \j 1 VKj AA 


1 AQRA 


ATGGAAGCAG 


CAGCTCTTTA 


CTATCTTGCT 


GCCCAATACC 




<jL i A i A 4 V. 


1 1 A/1 A 


ATGACCATCT 


CTGATAGCTT 


GGTCAATCCA 


GACGAAGACA 


p a Ar*rr:f* a^*i a 


AV9AnV.u 1 lw AA 


lit nn 


AATACCTTCA 


CTGATATGAT 


GAAGGTTGGT 


TTGGAAACCT 


Tfl 1TTY1T Art A 


A T A AT*t* A 1* A f 
A 4 AA 1 1A1 Avj 


1 X i DU 


CCAAAAAGGG 


GCTCTTTGTC 


AACTGTAGTG 


GGTTGAAAAA 


AAGCTAAfiCT 


THA^AA A<V1A 




CAAATTTCGT 


CCTTTCTTTT 


TTGATATTCA 


GGGCGATAAA 




4 4 \X#v\U 4 4 14 


1 1 5ft A 


CAAAGTTCCG 


AAAACCAAAG 


GCATTGCGCT 


TGATAAGTTT 


GATGAGATTA 


TTGGTCGCTT 


11340 


CCAGTTTGGC 


ATTAGAATAG 


TGTAGTTGAA 


GGGCGTTGAC 


GATTTTCTCT 


TTGTTCTTTA 


11400 


GAAAGGTTTT 


AAAGACAGTC 


TGAAAAAGAG 


GATGAACCTG 


CTTCAGATTG 


TCCTCAATCA 


11460 


GTCCGAAAAA 


TTTCTCAGGG 


TCTTTGTTCT 


GAAAGTGAAA 


AAGTAAGAGT 


TGATAGATCT 


11520 


GATAGTGGTG 


TTTCAAGTCT 


TCTGAATAGC 


TTAAAATCTT 


GTCAAGAATT 


TCTTTATTTG 


11580 


TTAAGTGCAT 


GCGAAAAGTA 


GGGCGATAAA 


AACGTTTATC 


GCTsArTTTA 


CGACTATCCT 


11640 


GTTGGATGAG 


TTTCCAGTAA 


CGCTTGATAG 


CCTTGTATTC 


ATGAGATTTT 


CGTTCAAACT 


11700 


GATTCATAAT 


TTGAACACGA 


AAACGACTCA TGGCACGGCT 


GAGATGTTGG 


ATAATATGGA 


11760 
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AACGATCTAG AACGATTTTA GCACACGGAA AAAGCTGTTT AGCCAAGTCA TAGTAAGGAC 11820 

TAAACATATC CATCGTAATG ATTTTCACTT GACAACGAAC GCCTCTATCG TAGCGAAGAA 11880 

AGTGATTTCG GATGACAGCT TGTGTTCTGC CTTCAAGAAC AGTGATAATA TTAAGATTAT 11940 

CAAAATCTTG CGCAATGAAA CTCATCTTTC CCTTAGTGAA GGCATACTCA TCCCAAGACA 12000 

TAATCTTTGG AAGCCGAGAA AAATCATGCT CAAAGTGAAA GTCATTGAGC TTGCGAATGA 12060 

CAGTTGAAGT TGAAATGGCC AGCTGATGGG CAATATCAGT CATAGAAATT TTTTCAATTA 12120 

ACTTTTGAGC AATTTTTTGG TTGATGATAC GAGGGATTTG GTGATTTTTC TTTACCAGGG 12180 

GAGTCTCAGC AACCATCATT TTTGAAsAGT GATAGCACTT GAAACGGCGT TTTCTAAGGA 12240 

GAATTCTAGA AGGCATACCA GTTGTTTCGA GGTAAGGGAT CTTAGACGGT TTTTGAAAGT 12300 

CAT rTTTCTT CATTAGACTT CCACAATCAG GGCAAGATGG AGCCTCATAA TCCAGCTTAG 12360 

CGATAATTTC TTTGTGGGTA TCCATATTGA TGATATCTAG AATCTTGATG TTTGGGTCTT 12420 

TAATATCGAG CAGTTTTGTG ATAAAATGTA ATTGTTCCAT ATGATTCTTT CTAATGAGTT 12480 

GTTTTGTCGC TTTTCATTAT AGGTCATATG GGACTTTTTT TCTACACAAA AATAGGCTCC 12540 

ATAATATCTA TAGTGGATTT ACCCACTACA AATATTATAG AGCCCAAAAA GGAAGCCCTT 12600 

TATGAATTGT AGGACTTCCT TTTCTTATCC AGAAATTGAT CTAGCTCTCT CTGATTTCGA 12660 

AGAATAGTGA CTTTATGTGA ATATTCTTGG CAAAGTTTTT GGTAATTTTC TTTTTGAGTT 12720 

TTGCGGACGC CCATCCCAAA GAATCCATCT GATAAACTCC CACTCAAAGC GTTCAGGGCA 12780 

ATCTACCGCC ATACTTTCTC TGACTTTTCC ACGGTATTTA AGATAACGCT TAAAGGCTCT 12840 

AAAGAGACAG GTCAATGCCG AAAAATTGAG AAAGATGATT TGGTCAGCTT CTTGCATTCG 12900 

TTCTTGGTAG TAGCACCAAG AATAATTACC ATCGATGACC CAAGCTTTAT GCTTGGTGAG 12960 

AAA G TTTTTT ATCTCGGTTA ACATCCATTC GCAGTCACTG TCTTGCCAAC CAGGTTGAAA 13020 

TTGGAGTGTG TCCATGTGCA GTTTTGGAAT GGAGTACTAC TTAGATAACT TTTCTGCTAT 13080 

AGTTGACTTA CCAGAACCAG AATATCCGAT AATTGCGATT TTCATTTTCT ACCTTTTCCT 13140 

ATTTGGAGAC AAAAAAACAG CCTCTATGGA CTGTTTCTTA TTTAACAAGT TTAGCTGAAA 13200 

GACGAGCTTT ATCGCGGCTT GCTTTGTTTT TGTGAATCAA ACCTTTAGTT TCTGCTTTAT 13260 

CGATAGCTGA GCTAGCAGCA CGGAAAAGTT CTTCAGATGG GTTTGCTTCG AAAGCTTTTA 13320 

TAGCAGTACG CATAGCTGAT TTTTGAGCTG AGTTCTTTTC GATTCGTCTA ACGTTCAATT 13380 

CAGCGCGTTT GATAGCTGAT TTAATGTTTG CCAATGGTCT TACCTCCATA TTTACTAACT 13440 
(2) INFORKATION FOR SEQ ID NO: 129: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



CCnTTTTCA 


AAAACTAGAT 


ACTAGTCTAT 


CAAAAGTAGG 


AAAGGGTTTC 


AAGAAAATTG 


60 


ATTGGAAATT 


TTTTGAAAAT 


CATAGAACTA 


TTAGCTAATC 


CCTAGTATTG 


AAAAGACTGG 


120 


ATAGCTTCTT 


TCAGGTCATC 


TTGTAAACTA 


TTTCTCTGGT 


CAAGTTGGAC 


ATAGACTTCC 


180 


ACCAGACAGG 


ATCTAAAGTT 


GGAAAATTTG 


TAAAAATCCT 


CCCTTTCTTC 


TATCGGAAAA 


240 


TCAACAGTTT 


TTATCCAAGA 


AGCTACTTGT 


TCTTGCTCCA 


ACTTCCCTTG 


TAAAATAGGT 


300 


TCATAGATCA 


CTCTTGCTAA 


ACGCCAATCC 


TCATCATCTG 


TAAAGCGAAT 


CGACATTCTT 


360 


TTAAATAGTT 


GGCCAAGTAT 


ATCAAATACT 


TCATGAACTC 


TGTTTTTAGG 


AAAGTCTGGA 


420 


TGACAAACCA 


CCTCTGTCAG 


TAAATCGGCT 


CCATGTGCAA 


AACCGTGAAC 


CCAACCATAC 


480 


TGACTTGAGA 


AACCCCTTGT 


ATCCTTTTCT 


TTTGAAAGAT 


AGTGCAAGCC 


TTGATTTAAA 


540 


AGGACATTAC 


GAATTTCTGG 


AGAAGGATTT 


CCCAAATGAT 


CAAACAACCA 


CTGGATTTCT 


600 


TCCTGGTTAT 


AATTTGGTTT 


TTCTTCTGCT 


ATTTTTCTTA 


GTAAATCTTG 


ATACATGGTC 


660 


AATACCTCTA 


CATTTCTAGC 


AACTGTTCAA 


AAAGCCAGTC 


TTAAATGACT 


CAATATTGAA 


720 


TTCTCAATTA 


AATACAATCT 


GATATAAAAT 


GACGTAAATA 


ACTATCAATA 


CCAGTTCTAC 


780 


AGTAAGTTCA 


AATTTAACAT 


CACGACCTTC 


AACGACATTT 


TTGAAAATAG 


CTACAACTAA 


840 


GACAAATAGA 


ATGACGCTTA 


ACAAGCCCAT 


AAACATCATT 


CTAAAAAATT 


TTTCTATTCC 


900 


CCTACTCTCC 


CAACTCAGCA 


CTATAGGAGA 


TAATCTGGTC 


AACTGTGTCA 


GACAAGAATT 


960 


GGATGGTATC 


ACGGAGTGGT 


TTGTCTGTTG 


AAATATCAGC 


ACCGATAATC 


ATGGCTGACT 


1020 


CAAGTGGTGT 


CTTGCTACCA 


CCTGATTTGA 


GGAGATTGAG 


CCAGTCTTCA 


GCTCCAGTTT 


1080 


CAGAATGTTT 


TAGATGAAGG 


TAACCAGCAG 


TCGAGATAAC 


TAGTCCTGCT 


GAGTAAGTGT 


1140 


AACTATACAA 


GCCCATATAG 


TAGTGAGCTT 


GGCGCATCCA 


AGTCAGAGTT 


GCATCATCGT 


1200 


CAATTTCAAT 


AGCATCTCCC 


CAGAAATCCG 


TCAAAACTTC 


CTTCATAATG 


CTGTTGAGCT 


1260 


TGCTTGCTCC 


AAAGGTCTCC 


CCTTCTTCAA 


TCAATGTATA 


CACCTTACGC 


TGGAAGGCGG 


1320 


CTTCCAAGAG 


GTGGGTGATA 


AAGTTATGGA 


AGTAGGTGTC 


TGTCAAGCGA 


TGAGCCAGAG 


1380 


CGAAGCGTTT 


TTGACGTGGG 


TCATTAGACT 


GGTTCTCCAA 


GTAATCACTG 


AGTAGCAATT 


1440 


CATTGAAGGT 


TGACGGTGCT 


TCAACATAGT 


AGGTCGACAT 


ATGGGCATTG 


AAGTAACTTT 


1500 
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GATCATTCTC 


TGAAAAGATG 


AATTGACCAG 


AATGCCCGAT 


TTCATGAATC 


AAGGTATAGA 


1560 


CATCGCTCAA 


ACGGCCTGTC 


CAGCTCATGA 


GTACATAAGG 


GTGTACGCGA 


TATGGGTCCG 


1620 


CCGCATAACC 


ACCGGAATCC 


TTGCCACTGT 


TAGCAGCAAA 


GTCCACCCAG 


CGCTCTTCTT 


1680 


GGTAACCAGC 


AACTTCCTGA 


CAATATTCTT 


GCCCCAAAGG 


TTCTACCGAC 


TTCATGACCA 

* 


1740 


AATCATAGGC 


ATCGTCAATA 


GTCACTTCAG 


GATTCAGGGC 


GCTGTCCAAG 


TCCAATTTCC 


1800 


AGTCTGCAAA 


GGTCATCTTT 


TCAAGACCAT 


TTACCTTCGC 


AACATGCTTG 


AGGTATCTCT 


1860 


GAGCGACTGG 


TGCAAAGTCC 


TTCATGATGA 


GGTCAATCTG 


GCGGTCAAAC 


ATGACACGGT 


1920 


CCACTTCTTG 


TTCAGCTAGA 


AGATAGTCAA 


AGACAGAGTC 


GTATCCCTTC 


ATATCAGCCA 


1980 


AGAGTTTTTC 


AGACTTGACC 


TGAGCCAGAT 


AGGCTCCTGC 


AGCCGTATTT 


TGGTGCTTAC 


2040 


GAAGTCCCTC 


TGAGAAGGAA 


CGGAAGGATT 


TCTCACGAAC 


CTCAGCATCC 


TCATGGTTTT 


2100 


GGTAGAAATT 


CTCATAGGTC 


ACAAAGCTGT 


TTTTGTAGGT 


CTTGCCATGG 


GCTTCAAAGT 


2160 


CAGCCATTTC 


AAAATCCCCA 


GCTCGCATCT 


TAGTATAAAT 


GTCCTGCGGA 


CTGTAGAAAA 


2220 


CTTCACCGAG 


ATTTGTCAAG 


GCCTTCTCCA 


CATCTGCCCC 


TAAGTAGTGG 


GCTTTTTTGA 


2280 


TTTTAGCCTG 


ACGAATGGCA 


GCTGTTAAAT 


GTGGCAATTT 


ACCCAAACGG 


TCCAAGACTT 


2340 


CCTCATCTGC 


TGCCACCAAG 


GCATCGTCAA 


AGAAGGTCAA 


GGCTACGCTG 


GCATCTGTTT 


2400 


CAAATTCCAT 


CCCAGCTTGG 


GCAATATTGG 


CAAATTCGTC 


ATTGCTATAG 


TCCGTCGTCT 


2460 


GAGGCATAAA 


ACCATAGTTG 


CCAATATCGC 


TCATCTGAAT 


GTAGATCTGT 


TCCAATTCCG 


2520 


CAAAGGCCTT 


CTCGAAATCC 


TCAAAAGTGT 


GAAGATTGCC 


CTTGTAATCA 


CGGCTAAACT 


2580 


GGTTGATGTC 


TTCGCGAGCT 


TTCTCGATTG 


CACGCAAGAA 


ATCCTCACGC 


TCTTGGTATA 


2640 


GGGCTGTTAA 


GTCCCAGAGT 


TCCTTCTCTG 


GAAATTCTGA 


ACGGTGTTTT 


TGTTCCATTT 


2700 


TCTTCCTCTT 


ATTTCTCTAA 


TTCTACTAAA 


ACACTAAGGG 


CTGATAAAGC 


GTAAAGCGGT 


2760 


GCTGTTTCTG 


CTCGCAAAAT 


ACGAGGACCT 


AGGCCTGCCA 


AAACGGCTCC 


TTTAGCTTCA 


2820 


AAACTTTCGA 


TTTCTGCAGG 


TGAGAGACCG 


CCTTCTGGAC 


CAAAGATAAA 


GAGCAGTTTG 


2880 


GCTCCTGTTT 


CAAGACCAGT 


GACTGCTTGC 


AGAAGCGCAG 


CGGCTTCTCC 


TTCTTTAGCT 


2940 


GATTCTTCAT 


AGGCTACTAT 


GATAGAGTCA 


AACTGGTCCA 


GCTGAGCTAG 


AAAATCTGCT 


3000 


TTTTTCTCGA 


AAAGTTTAAT 


ACTTGGTACA 


ATATTACGCT 


TGCTTTGCTC 


GGCTGCTCCA 


3060 


AGGGCAATTT 


TTTCTAGTTT 


TTCAACTTTT 


TTACCCAATT 


TCTTGCCATC 


CCACTTGGCA 


3120 


ACTGACCAGT 


CTGCAGGAAA 


GGCCCAGATT 


TGGCTAGCCC 


CCAGTTCGGT 


TACTTTTTGA 


3180 


GCGATGAACT 


CCAGCTTGTC 


TCCCTTGGGA 


AATCCAGATG 


CGATGGTCAC 


TTGGACTGGT 


3240 
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AGTTCCACAT 


TGTCATTTAA TTCTTGGACC 


AACTCAAACT 






J JUU 


ACGCGCGCCA 


AGCGCTTGAT 


1 GCCATCATCA 


AAGACTAAGfi 


. TAAr£*Tf\ATr 


1a. i ivn re 


J JoU 


AAGCGCATAA 


CCTGAAACAT 


ATGCTTACTG 


GTTTCCTTGT 

w * A * ^ \» A A w A 


PCTPHATAlVP 

^* A \* \9£\ A w\Kj A 


uAL-AtAiAoAtj 




ATAGCACTGC 


CTTTTACAAA 


ATACTGCTGC 


ATGCTAGCCT 


CCAATCAPAP 


LnuAvrn J A 1\- 




CTTGGTTTTC 


TTAAAGACAC 


AGGTATTCCA 


TTCCCCTTGA 


nv.^nlv> 1 




1 C A f\ 


AAATCCAGCT 


GACTCAGCCC 


ACTGGCGCAC 


CATGTCCAAr 


TTflTrPTTr ft 
1 ivilV>Ll I (jA 


<t*R ft 1VVV ft /"•*!» 

TAATviC CACT 


J 600 


CATGATCAGG 


TAGCCTTCAT 


CCTTTACCAA 


GCGATAAGCA 


TPC? TPT*A TT ft 


r* & TV ft ft TT* B/^" 

on IvjAA I VjAVj 


J oou 


GATATCCGCC 


AAGATATTAG 


CCACAATCAC 




iTTTP^* ft f ft 


CL I 1 AAGCAA 


3720 


ATCTCCAGCC 


GCTACATGGA 


TATTTTCCAT 




A PfTY* ft imiini 
nUL 1\AA 1 A J 


TTTCCTGAGC 


3780 


CACACGAACC 


GCCACATCAT 


CCAGGTCATA 


GGCGAAAATT 




ft /** ft ft f+f*r+ ft 
CL.A(aAA(iCGA 


3840 


GCTGGCAATA 


GAGAGAACCC 


CTGAACCAGT 


cccr A TfT 


n^LALLb 111 


CGCCACCACG 


3900 


AAGAACCTGT 


TCCAAGGCAA 


AAAGGCTCAT 


f '^T^l^^^l & f "J '1^*1 * 

V» i 1 v>0 1 /tvj 1 1 




CAGTACCAAA 


3960 


AGCCATGCCA 


GGATCCAGCT 


TGATAATCAT 




k» i HJTv- AT 


AGTvTTGTCCA 


4020 


AGAGGGAACG 


ATGGTCAAAT 


CATGAGTGAT 




T\_ATAGTATT 


TCTTCCAGTT 


4080 


GTCTGCCCAG 


TCTTCCTCAG 


CCAAGGCAGT 




111 AAL. IaJvU 


CCAAATCCAT 


4140 


AAAATCTGTC 


AATTCTGCTA 


GACGAGCCTG 


p aa atppc ire 


1 1— AAi_i_Ain\j 


TCACATCCAC 


4200 


CGTGTCAGGG 


TAGTAGGCTG 


TCACTACGAT 




I vA, I (_ L. AL CT 


CTGGGAAAAT 


4260 


CTCTCCAAAG 


CGGTCCACAT 


TTCCCACATA 




TCTTCGATTG 


CGACTCCTTG 


4320 


CGCTCCCAGC 


TCAATCAAGA 


GATTGGAAAC 






TCACTGTAAC 


4 3B0 


TTTTAACTCT 


TGCCATGTTT 


CCATTATTAA 




PT>» ft ft ft/^ft/»ft 


Ik X R ft ft ft ft 

AAACCAAAAT 


4440 


AGGAAATTCT 


CTGAAGACGC 


TTGTGTCTAA 


flAfiAAf^PTTA 




ACAGTGTTTA 


4500 


GGGCGGGTTC 


AGTTTAGAAA 


TGTAACTGAA 






/^pnvpm* ft ft *n ft 
t- 1 1 I IAAATA 


4 doO 


ATCTTTTAAT 


CTCTCTTGCA 


ACTGAGGCAC 


AACTTGACTG 


GAACTAAGAA 


ATTCCTCAAC 


4620 


ATTCATCAGC 


TGATAGCCCT 


GTCCTTCATC 


TCCGAAGATG 


ATATTGTCAA 


ATTGTTCTTG 


4680 


TCTTAGCTGA 


CCAACCATAA 


AGACCGATTT 


CTTGCCTTTA 


AAAATTACGC 


TAGGATAAAT 


4740 


CTTGCTCCAA 


AGCAGACAGT 


CTTCATCTAA 


ATGAATTCCC 


AGTTCCTCAT 


AAACTTCACG 


4800 


CCGAGCGCAT 


TCAAAAGGGC 


TTTCGTCCCC 


TTCACGGCCA 


CCACCTGGCA 


GTTCCCACAT 


4860 


ATTGGCCCAG 


GGAATACTTG 


CCTTATCATC 


GCGTAAGATA 


GTCAAAAGCT 


TATCCCCACA 


4920 


AAACAAAGCA ATCTTGCAAC 


CTGTGAAATC 


AGAAATTTCT 


AGTTCCATCT 


TCAGTTCCTT 


4980 


CTAACATTTC < 


CTTTTCCAGC 


TCGGCTAACC , 


AGTTTTCATA 


ATATCTTTTC 


TCATCCCTCA 


5040 
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ACATTCGACT ACTATCCATT TTCTGTCTAG CAATCTTGAG AGCCTTACGA GTTCGATCTA 5100 
CATCTTTCTT CACCTTTAAT TGATACCAGG CTTGTATCAC TTGAAGATTG GACAGTTTGA 5160 
GAGACAGAAA CGATTTGACC TGTCGAATAC TAGCATATTG CTCCGCTTGC TCAAAATCTC 5220 
CTTCCAACAA GGCGATATGA AGCAGGGATA GTTGGGCAAC TGTCTGCATC ATCGGAGTAG 5280 
TTGTCCTCTC AAGTAATGCT TGAAACTGCT GTTTAGCTAC TTCTTCCTTC CCTTCCAAAA 5340 
TGGAAACTTC ACCTTGCATA CCTAATACAC CATCCGCAAA ACTCCCTCGT GCATCCTCAG 5400 
GAACTGCTTG AACAAAGTCT TTCAAATCAT ATTCTTGAGG AGCTACCAAG GTCTGGGCAG 5460 
AATGTCTCAA TACCAGGTAG GCGTATTTGG TATTTTCAGG GTGTTGTAGT AATTCCCAAA 5520 
TTTTTGCTCC ATCGGTGATG TCGACTGGCA AAATGTTATT TAGGAAGAAA GATAAATTAA 5580 
GAAAAATCCA AGTCCCTGCA AAATACCAGC TTCTTGTCAA AAATCCAAAC AATATCGCCA 5640 
ATAATATCAA GCCGAGATGA ACCATCAAGC CTCCTGAAAG CATCAGGATG ATTCTTTGAT 5700 
CGCTTTCATC CTCTTTTAAA CCAATGTATT GAGCACCAAC ATTTTTCAGA ATGGCTGTTC 5760 
TACTAAGATG AAACCTGCCT GACTTTTTGG TCAAAATAAA ATGTCCTAAT CCAAAAGCCA 5820 
CCACCCGATA GCCTGTCAAG TAGCCACAAA AAGCATGACC CAGCTCATGA AGAATAAAGA 5880 

TTAAATACAT GCTTAGAAGA GCGAAGGCAT AACCAAAACT AAAGGCTAAA ACTGCGGAAT 5940 

ACCCCAACTC TGCAAATGCG ATTGTTCCAC AAGCAAAAGC TAGCATAATA AAGACAACAG 6000 

CTAGCACATA AACCAAATAA GTCCCAATTT TCTTCATAAC ACCTCCAACC AACTCCTAGT 6060 

ATCTTGGATA AGGATAAAAT TCTCCCTTTT CCAACCCAAT TTTTCCTTCT TCAAAGACTT 6120 

CTTGGTTCCA TTCCATGACA AATTCCTCTG CTTCTGGGTC TTCCAAAAAG TCCATGAGGA 6180 

CATCTAGCCC AACCTCAGCA GTATCTTTAA CGAAAAGCGC AAAATAAGCT AAAAATTCAC 6240 

GGGAAAATCC TTTTTTAGGC AGGTAAGGAA TAACAGTCAA ATAGTCTTCC TCATTGACTG 6300 

TTGACTTGGC AGGATTGTAG AAAAGGACCG CTTCCTCAAA AAGAATGTCA TCTGATGAAA 6360 

CCTCTCCGTC TTCATCCACC ATCTCCACAC CGCAGCATTT TGCGCTTCCA ATAGAAAACT 6420 

CACTTCTACC GCATGGTTGC GTTTGTCCCA GCTAATCTCA AAGTCAAAGG GAAAGTTCTT 6480 

GTCCAACTCT TCCTCTAAAA TATCTAAAAA TCCGTATGTT GCCATTTTGT CCTCTTTCTA 6540 

TGCGACTCTT TAATCGCCCC GATTGCTCGG AAATATGCTA AAATAGATAC TACCATCTTA 6600 

CCACAAAATT ATTTTATGTC CTAATTATAC CATATTACCT CATTTAAACC CTTGGTATCA 6660 

GTGATTTTCT TAAAAGTCTG ATTTCTTCAT TTCTCATAAA AATCAATATA AAAAGCCCTC 6720 

GAAAGGGCTA ATAAATCTAT AAAATCAATA GGCGAGTAAC TAGCACAAGT GGACGTGCTT 6780 



WO 98/18931 



PCIYUS97/19588 



894 



TTTTATTGAC 


TATTACCACG 


ATACCACGCT 


TAATCTTAGG 


CTTGAACTTT 

4 4 4V^%« A 4 4 


CTTATCTYiCA 

4^ 4 4 A ^ 4> \7v4~l 


6840 

W W v w 


ATAGCGTCTG 


TCAAAGTCTG 


AGAAAAGTTA 


AGCCCCATTT 


CTCGTCCCAA 


CTT ATfTTfJ CC 

4 4 «^ 4 ^ 4 V ^ W 


6900 


CATTTTGGTA 


TGGTCAAAGT 


CTTrTTAATG 


GGTTCCTGAC 


TTCCTAGGTA 

4 4 4*^* 4 r*w 4 *\ 


TTCTGATACA 

4 4 4> XdT44 4 4*^v44 


6960 


TCAACAGATA 


CCATAGAAAT 


AAAAGATTTA 


TCAAGGTCAT 


AGGTTGACAC 

■ 4 4 a^Mi f 4^»> 


GAAATCTTCA 


7020 


TCATCTTTAA 


AAGGATCATT 


ATCAATTAAA 


GACAAGCTAT 


TGATATCTGA 

4 X*44 4f 44 4 4 \#«4 


TGGCPGAGGT 

4 W 4 V44^ %J 4- 


7080 


AACTCTCCAT 


CACTCTCTAT 


CAAATCTGCA 


ACAGTTATCC 

■ 44*4 tW 4> 4 • 4 4 V* 4b 


CTAGCCACTC 


CGACCCCATA 

^\J*4»^ \*\#W1 X #■ 


7140 

/ 4lU 


GCCAAAGCCT 


CAGAAATCCC 


CTCTCCTTGT 

^4 4 44 4 44 ^* 4 4 44 4 


GTAGCTGAGT 

44 4 4 ■4* T 4* 4 4**4 44Jr 4 


ATTCAAAATC 

444 1 WAAVVtA \« 


TGGGAAATCG 




ACAAAATAAG 


TCGCTTCTGT 


TCCGTCTGTG 


TCGTCATAAT 

4 4« 44T 4 4* Mm 4 4444 A 


AAAATAAAGC 

**4W»4% 4 #WIVV 


TGGATACGTA 


/4VJU 


ACTAACATTT 


CACTACCTCC 


ATATCAAAAA 

* • 4 4 m 4 4»-4 44 44 4B 4a 4 


GCAGGGACTG 


AATTTTACAA 

run * a 4 a Aw/v\ 




' J^U 


TTTCTTATCC 


CTCTTTCAGT 

^4 m> 4 4 4> *mrm 4 


GTACTTATTC 

V** 4 « IV 4 4> ** 4 4 4* 


AGCTCACCAT 


GAAGGATTGT 


\xni nuu iv* 1 1 


7 1 no 

/ J 0 v 


TCCCCTTGCT 


TTTCCATTTT 

■ 4 4 4*4* • 4 4 4 m 4 


AATATGGGAG 

• ■44 4 • 4 4 4^4^4^4*4JT 


CCTTT ACCG C 




WPTfAAPPA 




TGGGCCGTAA 


GGAGTTTAAC 


CATCPCTTTT* 

4 V* 4 V»i * X 4 4 




fip A*ra Rf*nr*p 


ill 1 l-V. 1 


*7 Q fl 


GACAACACCA 


TTATAACACG 


TGTTACACGT 


ATTGTAAAGG 


IfiTYIATAPTT 


t\ I I f\ I 1 L 1 ni 




TATACATAAA 


AGCCCCTAGA 






Ai 1 1M1 IVAl 


B. W ^^^^^^ A 

l A I 1 X L L 




CTAATGAGTA 

^* A 4 m9 ft * A mm 


GTAAAAACTG 


PTTfTTTATP 

w A 4 W 4 4 4 ri 4 V 


GAGCAATTCA 




Afi t r* a a thv^t 


/ oau 


AAAAGTATCT 


CGATCTAAGA 


CAGATTGAGG 

VflWfl 4l 4 44T444jF*J» 


CGGAGTTGAA 


TGAATCATAf! 


fIAAPA^TV^f*f2 




TACTCTATAT 


TTTTTATCTC 

• AAA-* «4 «, 44 A W 


CAATTTTTAC 

*»*Vl 4) 4> * 4 4 4>W 


AAACTGATAC 


TCTTPfiAAAA 


TPAAATTrAA 


7 nnn 


ACCACGTCAA 

• •^4^4>#>^*^4 4> ^»44*4 


CGTCCCCTTA 


CCGTACTCAA 


^> 4 AvAVJ^v 1 O 


V»\»w\.> A n\J ill 


LLlnui 1 luC 


7 R £fl 
/ O DU 


TCTTTGATTT 


TCATTGAGTA 


TGATTAACTC 

a 4*"4 A 4 «hIw 4> 4w 


TCAAGTCTTC 


GAAATCAGCA 


TTTTCAAPAfi 

4 4 4 1 WUtknu 


707ft 

(74VI 


TTATTACAAG 


GAGGCGATTT 

4*44 44*4*»4»i 4J« 4 4> 4> 4 


ACPACTTCAA 

■ *«4 4 a 4 win 


AAACATCAAT 




1 1 1 At 1 1 1 


70 nn 


TTCAACCCAT 


TATT AGAATG 


AACTTCTTGG 

fVlV 4 4 W 4> 4 XJVJV 


TAAGCAAAAT 

4 AASIWWiA 4 


CAA(TPTTAf3A 


4 4 liwitvl l i 




TCGTACAAAT 


CTAAAATCTC 

4 4 4B 4* 4 4 4* 4 4* 


TTTTGGAGTA 


TCTTCCCGGA 


ARAAAAfiTTT 


1 VI W 1 W W 1 




GAAATAACTT 


GATCACTAAG 


AATCCAATGA 


CGAATTTGTT 


TTGTAAAAAT 


CAAAATTTCC 


8160 


TGACTTGGTA 


GTTCCATCAT 


TTCCATTGCT 


TATCACCTCT 


CTTTTCATTA 


TAGTTCATAC 


8220 


AATGACATTC 


AGCAATATTA 


TTTCTCAAGT 


CAGCACTTCC 


ACTTCTTTAG 


GCTCAACTAT 


8280 


CCTATTTTGA 


GCTTTAAGGA 


AAATCAAATC 


TCTCATGCTG 


ATACCTCTCC 


TCATTAAATT 


8340 


AAATAGTAAA 


AAAGATTCTA 


TCTCACTCCC 


TGATTATTAC 


AAAACCATTG 


AAATATCACA 


8400 


ACTAATAGGC 


TAGAATGGAC 


ATAGTAAGAT 


ATAGTAGATG 


AGTCATTCTA 


CTCAAATCCA 


8460 


CGTTAGAAAG 


GACTGCTATG 


CCAGACAATC 


TCGCCGTTCG 


CATGCGCCCn 


GG 


8S12 


(2) INFORMATION FOR SEQ ID NO: 130: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CTCGTTTCAA GGTTGAGTCT CTTGCAAATC TTGTTCGCGT TCTTCCTTTT GCCAAGGCAT 60 

CTCTCCCATG GTTGGTGCcA GCCATTGTTG GAATCTTGCT CTCATTGGTT CTACCAAACA 120 

AGCAAGAAAG CGATGTTTTT GAAATGGAAT AATCACTTAA ATCACTTTTG TAGCCAAGTC 180 

TACAGGAGTG ATTkTCTTTT TTTATCCGAT GATAAATGTG TTATAATAGG TAGCGAAAGA 240 

GGTGAAGAAA TGAATCAAAC AGTAGAATAT ATCAAAGAAC TGACAGCCAT TGCGtCGCCA 300 

ACAGGCTTTA CTCGTGAGAT TGCGGACTAT TTAGTCAAGA CTCTAGAAGG TTTTGGTTAC 360 

CAGCCGGTTC GCACATCCAA GGGCGGTGTC AATGTAACTA TTAAAGGTCA AAATGATGAG 420 

CAACATCGCT ATGTGACTGC CCATGTAGAT ACGCTTGCTG CTATTGTCCG TGCTGTCAAA 480 

CCAGACGGCC GTCTCAAAAT GCACCGTATC GGTGGCTTTC CTTGGAACAT GATTGAAGGA 540 

GAAAACTGTA CCATTCATGT GGCTAGCACA GGTGAAAAAG TATCAGGAAC CATCCTCATC 600 

CACCAAACTT CTTGCCATGT CTATAAGGAT GCAGGAACTG CAGAACGCAC GCAAGACAAT 660 

ATGGAAGTGC GTTTGGACGC CAAAGTAACT AGTGAAAAAG AAACTCGTGC TCTTGGCATT 720 

GAGGTCGGTG ATTTTATCAG TTTTGACCCA CGAACTGTCG TGACAGACAC AGGTTTTATC 780 

AAGTCTCGCC ATTTGGATGA CAAGGTCAGT GCGGCGATTT TGCTCAATCT CCTTCGCATT 840 

TATAAGGAAG AGAAGATTGA ATTGCCCGTA ACAACTCATT TTGCTTTTTC AGTCTTTGAA 900 

GAAGTGGGAC ACGGTGCAAA CTCTAACATT CCTGCTCAGG TAGTAGAATA TCTGCCTGTG 960 

GATATGGGAG CCATGGGAGA TGACCAGCAA ACAGACGAAT ATACAGTGTC TATCTGTGTC 1020 

AAGGATGCTT CTGGACCTTA TCACTATGAC TTCCGTCAAC ATTTGGTGGC TTTGGCGAAA 1080 

GAGCAAGATA TTCCATTTAA GCTGGATATC TATCCATTTT ATGGTTCGGA CGCTTCAGCG 1140 

GCTATGTCTG CAGGGGCAGA AGTCAAACAC GCCCTTCTCG GTGCTGGTAT AGAGTCTAGC 1200 

CATTCCTATG AGCGTACCCA TATTGACTCG GTGATCGCAA CAGAACGAAT GGTCGATGCT 1260 

TATCTTAAGA GCACGTTGGT GGACTAATAT GTGCCTTATT TGTCAGAGAA TTGACCTCAT 1320 

CAAGAAGGAA GAAAATCCTT ACTTTGTCAA AGAGTTGGAA ACAGGCTATC TTGTGGTTGG 1380 

AGACCACCAG TATTTTGAAG GCTATAGTCT CTTTCTAGCC AAGGAGCATG TCAGCGAATT 1440 
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GCACCATTTG AAAAAGGAGA CAAGACTCCG TTTTCTAGAA GAAATGAGTT TAGTCCAAGA 1500 

GGCAGTTGCC AAGCCCTTTG CTGCTGAGAA AATGAATATC GAACTGCTAG GAAATGGCGA 1560 
TGCTCATCTT CATTGGCATC TGTTTCCACG ACGGACAGGT GATATGAATG GTCATGGTCT s 1620 

CAAGGGTCGT GGACCAGTCT GGTGGGTTCC CTTTGAAGAA ATGACAGCAG AAACCTGCCA 1680 

AGCAAAACCG GATGAGATTA AAAGATTAGT CAAACGTTTA TCGTCAGAAG TAGATAAACT 1740 

ATTAGAAATA AAGGAGTAGA AATGAAGAAA AGATACCTAG TCTTGACAGC TTTGCTAGCC 1800 

TTGAGTCTAG CAGCTTGTTC ACAAGAAAAA ACAAAAAATG AAGATGGAGA AACTAAGACA 1860 

GAACAGACAG CCAAAGCTGA TGGAACAGTC GGTAGTAAGT CTCAAGGAGC TGCCCAGAAG 1920 

AAAGCAGAAG TGGTCAATAA AGGTGATTAC TACAGCATTC AAGGGAAATA CGATGAAATC 1980 

ATCGTAGCCA ACAAACACTA TCCATTGTCT AAAGACTATA ATCCAGGGGA AAATCCAACA 2040 

GCCAAGGCAG AGTTGGTCAA ACTCATCAAA GCGATGCAAG AGGCAGGTTT CCCTATTAGT 2100 

GATCATTACA GTGGTTTTAG AAGTTATGAA ACTCAGACCA AGCTCTATCA AGATTATGTC 2160 

AACCAAGATG GAAAGGCAGC AGCTGACCGT TACTCTGCCC GTCCTGGCTA TAGCGAACAC 2220 

CAGACAGGCT TGGCCTTTGA TGTGATTGGG ACTGATGGTG ATTTGGTGAC AGAAGAAAAA 2280 

GCAGCCCAAT GGCTCTTGGA TCATGCAGCT GATTATGGCT TTGTTGTCCG TTATCTCAAA 2340 

GGCAAGGAAA AGGAAACAGG CTATATGGCT GAAGAATGGC ACCTGCGTTA TGTAGGAAAA 2400 

GAAGCTAAAG AAATTGCTGC AAGTGGTCTC AGTTTGGAAG AATACTATGG CTTTGAAGGC 2460 

GGAGACTACG TCGATTAATA CTCTTCGAAA ATCTCTTCAA ACCACGTCAG CGTCGCCTTA 2520 

CCTACTGACT GCGTCGGTTC TATTCACAAC CTCAAAACAG TGTTTTGAGT CGATTCGTCA 2580 

GTTTTATCTG CAACCTCAAA GCTGTACTTT GAGCAs tGCG GCTAGCTTCC TAGTTTGCTC 2640 

TTTGATTTTC ATTGAGTACA AAAAGTAAAC TTTTCTCTTG CAATTCCAGA TAAATAGTGT 2700 

ATAATGGATG GGTATGTGAA AAACATACTT GTGGGAGGTA AAAATCTCTA ATTACCGCCA 2760 

AAACCACAAA GGAGGATTTA AAAATGGCTA AAAAAGTCGA AAAACTTGTA AAATTGCAAA 2820 

TCCCTGCTGG TAAAGCTACA CCAGCTCCAC CCGTTGGACC TGCTCTTGG 2869 
(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
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CTGAATCCCT TATAGGAGTC CAGTAACTTT TTAGCCTCTA CTTTGCCTTC ATAGGCAGCT 
TCAACATCAT TAAAAAAAGA AxGCACTGAA GCAAGTTCTT CAGTGCTCCA CGACAAATCT 
AGTGGGTAAC TATACTGTTT GTTCATTAAC TAATACCACC TCTCATTCTT GCTTCTTTTA 
GTTCTTGCTT ACGATAACTA CGAGGGAGAA AAGCACGAAT CTCATCTTCA TTAAAACCGA 
TTTGCATACG CTTGGCATCA ATAATAATTG GACGACGCAA AAGACTAGGA TACTCCTCAA 
TCAAATGAAG CAATTCCGAT ACCGAAATAC TCTCTACATC AATATTCAAT TTTTGAAAAA 
TTTTTGAACG AGTTGAAATG ATGTCATCAG TACCATTTTC GGTCAAGGAA AGGATGTGTT 
GCAATTCTTT TCTTGTTAAA GGACTGGTCA TAATATTGTG TTCCACAAAG GGAACTTATG 
TTTTTCTAAC CAGGCCTTAG CCTTACGACA TGATGTACAG CTCGGTGATA GAAATAGTGT 
AATCATGCTT TTCTCTTCTT ATCTATACTT TGCTACTTCT ATTATACAAA AAAATAAAGC 
GCTTGACTAG GGATTTTTAG AAAAAAAGCC TATTTTTTCA AGAAAAATAG GCTTTTTGCG 
AACGATTGAC ACAATTGGAT TTGGTTAATT CACTCTTAAC GATGGTTTTA AACGATATAT 
ATTTTTATAT ATGTAAATTA AAAACATCTT TCCTTTCACT TCCTACGACT TTTCAGATAC 
AGATAGCCAA AGAAGTTTTC ATAGAGGGCA AAAAAGAGGA GGAAGGCATG AAGAAAGAAG 
GTCTCTGGCA AAATCATAAT AACAGCATCC TTGGCTGGAT CAAAAAGCCA GGTATCATCT 
CCCACAAAGA GAATTTGATG GAAAAGAGTA AAGAATTGGT CAAAACCAAT CAAAACTCCC 
CCAAGTCCAA TCATCACAGG TAAGACTACT AGAGCCAGGA GACTTTTTCG ATAAAGAGAC 
AAAAAGTCCT TTTTCACAAT CCTATTGACA AAGACATAGA AACTTGGCAG TGTCACTAGA 
GCTACTAGCT GAACCAAATG AAAGAGATTC TTGACCACTG CGAAATGGTG CAGACCAGCT 
GCTGACGAAC CAAAATCAGG CATCTGTAAG ACCTGACTAA AAGGATTGCT CAGATAATTC 
ATCAAGATAT GAAAATTGTA TTGAATGGTT TCTGGTTTTA GATAGACTCG ATTCGTTAAG 
TTTAGCCACT GAATCTCCAT AGGATAGAAA ATCCAAGCCA GATAAATGGT CAGAAGGATG 
GAGAGGGAGA GGAGAAAGAG CATAGAGCCC CAAAAGATCA ATTTAGTTTT CATCAAAATC 
CCACTCCGCA AGGCTAGAAA CCACATGTGT CGGTGCGATT GGCAGGCCAG CTACTTCTTC 
TGCCTTAGTA AAACCTGTCG TCACCAAGAG CGTTGGAATG CCATTGTCAA TCCCAGCCCC 
AATATCAGTC AAATAATTGT CCCCAACCAT GATTAACTCT TCACGTTCCA AACCTAAGTG 
CTCAACCGCC TTGTCCATAA TGATGGCATT TGGTTTTCCG ATATAAACCG GCTTCACTCG 
TGTCGCTACT TCAAGCAGCG TAATCAGTGA GCCAGCACCT GGCAAAAGAC CGCGTTCCGT 
CGGGATGTTG AGGTCAGGAT TGGTTCCGAT AAAATGGGCA CCCTTTTGAA TAGCAAGAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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TGCTGTGGCA AATTTTTCAT AGTCGACTTG CCAATCCAGA CCAACTACCA CGTAGGCAGG 1800 

TTTTTCCTTG TCTTCCACAT AACCAGCCGC CTTGATGGCT TCCTTGAGTC CTGCTTCTCC 1860 

GACGACATAG ACGGTCTTTT CAAGCCCCAA ATCATTCATA TAGTCGATGG TTGCCAAAGT 1920 

CGCTGTGTAG ACAGTCGATA GGGGCGTATC GATATTAAAA TTCTGAGCCA ACATCTCCTT 1980 

AACACTCTCT GGAGTGCGGG TTGTATTGTT GGTTACAAAG AGATAGGGAA TGTCCCGCTT 2040 

TTGCAATTCA TGAACAAAAG TCTCTCCAGC AGGGATTCGG TCTTTCCCCT TATAAATGGT 2100 

TCCGTCTAAA TCAATTAAAT AGCCTTTATA TTTCATCTAT TTCTCCCTAA GCCTTTTTTA 2160 

TTTCTTGCCA AGTAATGATT GCTTGGGCAT TGATAACCCC ATCACTTGTA ATTTCATGCT 2220 

TGCTTTCCAG TCCAGTCCGT TCAACAGCCG ATGTAATCAC CCCACCTGGT CGAACTTCCT 2280 

TGACATACTT GAGGTTGATT TTCTTGGGAA TATAGTGGGT CAAAAAATCC GCTCCCATGA 2340 

CCTCAAAAAT CCAGTCCAAG TATTTACTGT TATTGACATG ACCATTCATA TCCAAGTCGT 2400 

AAAAACGAAC ATGGTAATCC TTGCTGATCG GTTCTTCCAA GGACTCATAC TTCGGTCCAC 24 60 

GGATAAGTTT TTTATCAAAA TCAGACTGGT AAGGAGCCAC AATCTCAGGT TCAACAACAT 2520 

GGACTTTTCG ACTGTCGCGG TCCATGAGAA CAAAGGTCGC CATCATGTGG ATGACCTCCT 2580 

GCTCCGCTTC ATTATAAATA GTAAAGCGAC GGTAGCAAAA AAGTCGATTG TAGCTCAAGG 2640 

CTTCCGTTTC GATGGTAATT TCTTCCGCAA AACGAGGCAA ACGAACCACC TCAATATCAT 2700 

ATTCTACGAT AATCCAGACC AGATTATATT CTTCCAAAAT GGCCTTATCA CTAACTCCCA 2760 

GTTCAATCGA CTGCATCCCT GAAACTTGCA GTGACAGCAA AATCACATCT GGAAGTTTGA 2820 

TATGACCGTT CATATCAGCC ATATCAAAAG GAATTTTCAT TTTCATTTGA TAAGTTAAGC 2880 

CCATGATCCT ACTCCAAAAT AAATCGTTCT GCTACAGTAT CTCCCAAAAA GAGACCTCTC 2940 

TTTGTCATGC GAACGTGCTC ACCCTCAATC TGCATGAGGC CTTGTTGAAC CAAATCTCTG 3000 

ACAATTTCTC CATAAAGTCC AGCAAAAGAC TGTCCAAATT TTTCCTCAAA TCGCGCCATG 3060 

GAAACCCCGG ATTTCTTGCG GAGTCCCAAG AACATTTCTT CTTCCATTTG CTCCTTTTGA 3120 

CTCAGGTGAT CTTCTGTAAT ACAAGCATTG CCTTCCTCAA CCGCACTGAG ATAATGACGA 3180 

ATGCGACCAT GATTTTTATA GCGTACTCCA TTGACATAAC CAGATGCCCC TGCACCAATA 3240 

CCATAGTATT CAGCATTGTC CCAGTACATG AGATTATGAC GACTTTCAAA ACCGGGTTTG '3300 

GAGAAATTAG AAATCTCATA ATGCTCAAAA CCCGCTCGCT CCAGCTCTGC AATGATGTAC 3360 

TCAAACATCT CCGCTTCTAG TTCCTCCTTA GGCAGAGGCA ATTTCCCACG TCGCATCCGG 3420 

TTCATAAAGA CCGTATGGTT TTCTAAAATC AAACTATACA AACTCATGTG GGGAATATCC 3480 

AATCCAATGG CTTTAGCCAC ATTTTCCTTT ACTTGCTCCA TGGTCTGACC AGGCAGAGCA 3540 
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TAAATCAAAT CAATGGAGAT ATTGTCAAAA CCAGCCAGTT TCAGGCGATC GATATTTTCA 3600 
TAAATATCCT TCTCCAAATG ACTGCGCCCA ATCTTTTTCA ACATCTTATC ATCAAAGGTC 3660 
TCGACACCTA GCGAAACACG ATTGACAGCC GAATTTTTCA AAACAGCTAT CTTATCCGCA 3720 
TCCAAATCGC CTGGATTGGC TTCAATGGTC AACTCTTCCA AGACAGACAA ATCCAAGTTT 3780 

TTAGTCAAGC CATTCAGTAA CACCTCCAGT TGCGGAGCCG ACAGGGCTGT CGgTGTTCCA 3840 

CCACCGATAT AAAGGCTTCA CAACTTTTCA ATATCATAAG AACGAAACTC TTCCAGCAGA 3900 

TGCTCTAAAT AGCTGTCGAC TGGCTGATTT TTGATGAAGA CCTTTGAAAA ATCACAATAA 3960 

TAACAAATCT GGGTACAAAA TGGGATGTGC ACATAGGCTG ACGTTGGTTT TTTCTGCATA 4020 

GTAATTATTA TACCACAAAG ACTAGATTCC AGATAAAAAT CACCATCCCC AGATACATAG 40 BO 

TCCGTCCGGA GATGGTGATG GTTTATTCTT CTGTTATATC AATCACAATC TCTTCTGAGT 4140 

CATCAAGAGC TTCGGCTTTT TCTTGCCATT GCTCCTTGAG ATTATTTAAT TGATTTTTTG 4200 

ATGCTTCTGT CGCTTGAAAA GCATAGGATT TAGTTTGAGC AAGTATACTG TCCACAGTGA 4260 

TTTCACCTGA CTCAACCTCT TCTTTTCTTT TCAGAACAAA ATCTGTAGCC TGCTCCTTAA 4320 

CTTCTGTCAG TTTTTCACAG ACTTGCTCCT TGGCATACTC CGGATCTTCT CTCAAATCAT 4380 

CTAGAAAATC TTGAGCCTGA CTGCAAACTT GTTTGCCCTT ATCACTTGTT AAAAACAAGG 4440 

CAAGAGCTGC ACCTGAAACG GTTCCTAAAA GGATTGAGGA TAATTTACCC ATAAGGATTC 4500 

TCCTTTTTTA TTTTTTGAAA AATTTACTTG CAAGACGAAG AGCTGACAGA CTTGCACCAG 4560 

TCTTGAGTGT TTTTGAACCA GCTGATGAAG CTTTCTTGCT CAAGACACGC GCATGGTCAT 4620 

TGAGGTCTGA AACAGATAGA GATAAATCTG CAACAGCACT GAAGAGTCGA TCAATCGTAC 4680 

CCACCTTGAC ATTGATATCA TCTGCCAAGA CATTGACCTT AGCCAACAAC TCATTGGTGT 4740 

GATGCAAGGT CACATCCACA TCTGAAGTCA AGGTTTTAAT CGTCTTTTCT GTTTCATCGA 4800 

TGACACGACC AAGCTTTTGT ACAGTAATGA TCAGATAGAC CAAAAAGACA ATCAAAGCTA 4860 

GGGCAACAAG AATATATGCA ACTTCTAACA TTTAGTTTTC CTCCTCTGTA ATATAGTAAG 4920 

GGGCCTTCTT TCGATTTTGA TAAATAACGA TCATTATACC GAGACCGATA AGGACAACTG 4 980 

ACAGCCATTG GGACACTCGA AAGCCGAAGA ACATGACACT ATCTGTTCGC ATACCTTCGA 5040 

TAACCATACG ACCGAAACCA TACCAAATCA AGTAAAAGGC CGTGATATGA CCTCGTCTGA 5100 

GACTCTTCCA TTTCCGTCTA AAAATCAGAA TCAAGGCAAA GCCAAGCAGA TTCCATAGAG 5160 

ACTCATAAAG GAAAGTCGGT TGACGGTAGC TCCCCTCAAT ATACATCTGG TCACGGATAA 5220 

AGCCAGGTAG ATAATCCAGA TTATCCACTG TTGCACCATA AGCTTCTTGG TTAAAGAAAT 5280 
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TACCCCAACG CCCCAAACTT TGAGCAATCA TAACGCTAGG CGCCGCAATA TCTAGAAAAT 5340 

CCCAAGTATT GATGAGTTTA CGGTCAGCAA AGATATAGAG CACAAGAGCC CCAGTTATCA 5400 

AACCACCGTA AATGGCCAAA CCACCATTCC AAATCGCAAA AATCTCTCCT AAATTCTGAC 54 60 

TATAGTAATC AAATCGGAAA ATAACATAGT AGAGACGAGC TCCTAAAATA GCCAAGGGAA 5520 

ACGCTACTAA GATAAAATCT AAAATATCGT CTGGTATGAT CTTCTTTCTA GGTGCTTCTT 5580 

TCATGGTCAA ATAAACCGCA AGAATCAAGC CTGTCACAAT ACATAAGGCA TACCAACGAA 5640 

TGGCTAGGGG TCCTAGTTGA ATAGCAATTG GATCAAGCAT TTTGCACCTC ATTTCGAGCG 5700 

ATTAGACTTG TCAGTCGTTC GTCGAACAAA CGGGTCGCAT CAAAGCCCAT TTCCTTGGCA 5760 

CGATAATTCA TGGCAGCTGC CTCAATCACA ACAGAGATAT TACGACCTGT TTTAACTGGA 5820 

ATACGAATAC GAGGAATGtA CGCCAGAAAC TTCAAGTTCC TCTGCATTAT TTCCAAGACG 5880 

ATCAAAGGTC TTATGCGTAT CGTAATTTTC CAAATAGACA GCAAGCTGAA CCTGTGAAGA 5940 

ATCCTTGACA GCACTCGCAC CGTAGAGACT CATAACATCG ATAATACCAA CCCCACGAAT 6000 

TTCAATCAAG TGTTTCAAAA TTTCAGCTGG TTCACCCCAG AGAGTAATCT CATCCTTGGC 6060 

AAAGATATCG ACACGGTCAT CGGCTACCAA ACGGTGACCA CGTTTGACAA GCTCAAGACC 6120 

TGTCTCGCTC TTACCAATTC CACTATCTCC CTGAATCAAG ACGCCCATCC CATAAATATC 6180 

CATCAA 6186 
(2) INFORMATION FOR SEQ ID NO: 132: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

GAAAATCACA ACCCTTTTTG CAAAATTTTT GAGATTATTT TCACAAACTT GATTTTTCAA 60 

AGTATACTCA ATAAAAATTA AAAAAATCCA CTACGTCAAG GCGAGGCTAA TGTGGTTTGA 120 

AGAAATTTTC GAAGAGCGTG AATGAGTATC ATCTATAGTA AAATAAAAAA ACTGAACAAT 180 

TTGGTTGGGG ACAGCCAAAC CAATTTCTCA CAATGTTTCA GAAACAAGGG TGTGCTATTC 240 

CAATTTCAGC CTACTATAAC TGTCATAGAT TGCTGAAACA AAGTCTAGGT AAAAGTCTTC 300 

ATAATAAAAA GACCTCCTAT CAAGTGTTCA AAAACTTTGA TAGGAGGTCT TGTTTTGTGA 3 60 

AAATATTTAT CAAATTTTCT ATACAAGTGA GCTGTTAGCC AGGTTCTTTC TATTCTTTCA 420 

ATTTCAATGA ATGGATTTTT TACTAATACT CATAACTGGG AATTTGTCTG TGTAAAAATA 480 
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CCGAGATAGA TGGTATTTAT AAAACACTCA AGACAGCTAG ACTAATATCA TTTAAAACAT 
TATCTTCTTT TGAGCGACTG TTGGTTACCA ACATAGCTAA ATTTCCTGCA TTTTCAAATT 
GATAGGGTTC TGATTTAGCA TTCACAACCA CCAAGAGGTG TTCTTTGCCG TGAACTTCAT 
AGATAAGGTA GCCGCTATGT TCAATCGCAG AATGCACAAA GACATGATGG TAAATTTCAT 
CATAGCTAGA GTAAGAAAAG GCACCAGTTT TTGTCTTCAA TCGGATGACT TGACGGATAA 
ACTCAATACT GTCTTGACGC TCATTAATCA AGTTCCAGTT CACTTGGTTC ACACTGTCAG 
GAGCATTATA GCTATTCATC GCACGCTCTC TATCATCATG GGTCAACTCA CCATTTTCAC 
CAGTCGCAAC CAGTTTGGTA CGACCAAATT CTTGACCGAT TTCCATAAAG GCCATCCCCT 
GCATGAGCAG ATTCATGGCT GTGGCAGTTT CGACCTTGCG CATGATTTGC TCTGAACTTT 
GGTCTGGATG AAGGGTTGCC AATAAATCGT GAAGATTGTA ATTGTCATGG GCTTCTACAT 
AGTTAAGCAC CTGATTTGGA TGTGTATAGC TTCCTAATTC ACGACTTCCT AGGATTGCTT 
TAGCTAGAAT TGGCTCTGTC GCAGCACCAC TGACAAAACC TGACTTGATA GCACCATAAA 
CTTCTCCCCC TTTGACAGCA TCGCGCTGAT TGTCATTAAA GAAACCAATA TTTGGCATCT 
GGTAGGCATT GTCCTTCTTG GCCTTATCAT AAGGGGCAAG ACCTGTTCCC ATATCCCATC 
CTTCTCCATA GAGGATAATG TTGGAGTCGA TTTCATCCAA GCTTTGACGA ATCATCTGCA 
TGGTCTTGAC ATCATGAATC CCCATCAAGT CAAAACGGAA GCCGTCAATA TTATATTCCT 
GCACCCAGTA TAGAAGAGAA TCAATCATAT ACTTGCGAAA CATTTCGTGT TCACTGGCTG 
TTTCATTTCC AACACCCGTT CCATTCTGGA AGGTACCATC TGGATTCATA CGATAATACT 
AATCAGGGAC TGTTGTTTGG AATGGTGCAT CAACAACTGA GAAGGTATGG TTATAGACTA 
CATCCATAAT GACTCCAATA CCCGCATCGT GATAAGCTTG AACCATCACC TTCAAATCAC 



GAATGACCTG 

TTTGTGGATC 

GGTCTGCAAT 

CAGTTGACTG 

GAAGATGTTC 

AGATAACTGC 

CGAAGTTTTC 

TGGTCGCGAT 

ACTGATAAGT 



AGCTGGATCA TCTGGATTAG TTGAAAAACT AGTTTCTGGC GCGTTATAGT 
ATAACCCCAG TTGTAGGTTA CATTTCCATC CTCATCGTAT TCTTTATCAC 
TGGTTGCAAT TGAACATAAT TGTAGCCCAG CTTCTTGATG TAATCAAAAG 
GCCGTATTGG TTAACTGTTC CAGCCTGAGC AGCACCCAAG AAAGTTCCTC 
ATCTACACCC GATGTAGGTG ATTTAGTCAA ATCACGAATG TGCATTTCAC 
CTTACATGGA TTTTCCAAGC GCCAAGTAGC CTCCGAACCG TGCTTAACCT 
AACTTGCTTT TCTACATGGC TCAGAATAGC TGAACGTTTG CCATCAGGGC 
TGTATAAGGA TCACGTGTCA GTGTTTGGTG ATGAGGGAAT TGGACTTGAT 
CTTACCTACC AAATCTTCTT CAACATCCAA ACTCCAGACA CCGATTGTAT 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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TGTCCTTATG ATTATAAGAG TAGCTATTGC CTCTTTTCAT CTCAAAAGTC TTCCAAACGG 2280 
GTGCATCATT AGCAGCTGAT TCATAAACGA CAACTTGCAC TTCTGTCGCT GTAGGTGACC 2340 
AGAGAGAAAA ATGAGCCTGA TTGTCCTCTA CACGGCAACC CAATTCTCCT TGGTAACCCC 2400 
AATGATGATC AAAACTAGCA CTGTTAATGG CCTTATCAAA GGCAAAAGGA TTTTGATTTT 2460 
TATAGAAAGG ACTGGCAATA ; GCAGGATTTT CAGAGTAATA AATCCTATCA TCGCCTTCCA 2520 
AAATCCAGAC CTCTGTTAAT AGGGGATAGT GATTAAAACG GATAGAATAT TCTTTACTAG 2580 
TTTGACCTGT ATGAACCACA AAATTCAAGC TTTCTATAAC ATGTGAACTT GGGTGTTCAA 2 640 

AGCTAAATAA AGCTCCAAAA TAATCTTCTT TGTAGGTTAG CAAATCAATT CGTTGATCCT 2700 

GACTTTTTAC AAAGGAGCAA GTGTCATATT CTCCATTCTT ACGATGGTAA TGAATGCGCA 2760 

TAGGGTAGTT ATACATTTTT TATTTTTCCT TTTTACTTTG TTTCTATTTC ACTAATAAAT 2820 

TTTTGTCAAT CTCGTCTCAA TTAACAGACA TAGTCATATT CTCTAAACTC TGTTTTTAAA 2880 

CGATCCATTA CAAACTTTCT AGCCATGCCT CATCTCTGAC CTGGATACCA AGTTCTTGTG 2940 

CTTTTTGCAG TTTACTTCCA GCGTCTGCAC CTACCACGAC GAGGTCGGTC TTTTTAGAAA 3000 

TACTACCTGT CACTTTGGCA CCCAGACTTT CGACTTTACT TTTAGCTTCT GAGCGCTTGA 3060 

GTCGTTCCAA TTTTCCTGTC AATACCACGG TCAAACCTGA CAAGGCCGCA TCCGCTACTA 3120 

CCGTCTGTCC TTTATAGTCC AGATTGACCC CAGTTTCTTT CAATTCTCTG AGCAGAATTT 3180 

CAGAGCCTTC TGTCGCAAAA TAAGTCTGAA GACTTTTGGC AATCACGCCA CCTAGACTTT 3240 

CAATACT AGC CACTTCCTCT GAATCTGCCT GAGACAGATT TTCAATTGAA TGGAAATATT 3300 

GAAGTAAAAG CTGACTAACC TTGCTTCCGA CATGACGAAT TCCCAAACCA AATAAGAGCT 3360 

TCTCGGCAGA ATTTTCCTTT GATGCTTGGA TAGCCTGATA CAGTTTAGCA GCGGACTTTT 3420 

CCTTAACTCC CTCTAAAAGG AGGAAATCCT CTTCTTGCAA ACGATAAATA TCCGCCACAT 3480 

CCTTGACTAA ATTAGCAGCA AAAAGCTTCT CAACAATAGA TGGACCAACG CCTCTAATAT 3540 

TCATAGCATC ACGAGAAGCA AAGTGAATCA AGCCTTCCAT GATTTGAGCA GGGCAACGCG 3 600 

GATTGATACA ACGTAGGGCC ACTTCATCTT CAAAGTGCAA CAAGTCAGAG TTACAACTTG 3 660 

GACAGTTTGT AGGGATATCT AGTTTTTCTT CAGAAACCCG TTTGGACTCT ACCACACGTA 3720 

AAACGGCAGG GATGATGTCA CCAGCCTTAT ATACAATGAC CGTATCCTCT TTTCGGATAT 3780 

CTTTTTCAGC AATATAATCT ACATTGTGCA GGGTCGCACG GCTAACAGTC GTACCGGCAA 3840 

GTTGTACTGG TGTTAGATTA GCAGTTGGAG TTACAACACC GGTACGGCCA ACTGTCCAGT 3900 

CAACTGATAA GAGTTGAGCT TCTTTTTCTT CGGCAGGGAA CTTGTAGGCT ACTGCCCACT 3960 

TTGGAGCCTT AACTGTAAAA CCAAGTTCTT CTTGACTTGC TAGGTCGTTG ACCTTGATTA 4020 
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CCACTCCATC 

TCCAGATTTC 

GTTCTAGGTA 

AGAGAAACGT 

CTGCTGCCGC 

TAACTTGGTC 

CTTCTGGCAA 

CAATTGAACC 
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AATATCGTAA GGCAGATTTT CCCCTTCCTC TCCTACTTCT TGGATAAAAT 
ATCTATGTTT TCAGCCAAGA TTCGCTTACG ATTGACCACA AAACCTAGTT 
CTTCAAACCC TTTTCTTGGC TATCACGAGT TGAAGGGCTG GCTTCTTGAT 
TGCAAGATTA CGCTTGGCAA CTACTGCTGT ATCCAACTGA CGCAGAGTTC 
ATTACGAGGA TTAGCAAATT CAGGCTCTCC ATTTTCTTGG CGCGCTTGGT 
AAAGGAACCG CGTGGCATGT AACATTCCCC ACGAACTGTG ATATCTAGTT 
AGTCAAAGGG ATGTCCTTAA CACCCTTGAG GTTTTCTGTG ATATTTTCAC 
ATCTCCACGT GTTACCCCAG CAACCAAAAT CCCCTTTTCA TAAGTCAGCG 



AGATAGATAA 

GAACACGCGC 

GAGGATACTG 

TCGCACTGTC 

AAAGGCGGTC 

CATAGCGATT 

ATAAAACAAG 

TTTGAAACTC 

TAAGAAATAT 

TGGCAAGACT 

TGTACTCAGT 

GTGAATATTA 



GCCATCGATT 

ATCAAAAGCA 

ATGACTGTAT 

TGCTAGCACT 

ATACTCACTG 

GAGCAAAGCG 

CCCTCCTCAC 

AAGCAACCTT 

AAGGCTACAA 

GTCATAAATC 

ACTCTTCCAA 

AAAATCGTCA 



TTCAGCTCAC AAATATAGGT CGGATGAGCC ACTTCCTTAC 
TCTAGCTCCT CACATGAAAA AGCATCCTGC AAACTATAAA 
TTTTCAAAAC CATCTAAAAC CTTGCCACCA ACACGATGAG 
TGCTCTGGAT AAGCAGTTTC TAACTCGACC AACTCACGGT 
TCTGAAACCG AGGGATTATC GCTGGTATAG TACTCAGTCC 
ACTAACTCAT TCATTCTTTT ATTCATAAGA CCATTTTACC 
AAACGAGAAG GGCGGAAAAA ACACTTAGTT TGAAATTATT 
ATATCAATTT TTCAAAATGA GTTCGAACAT ATCCGAGAGC 
CTCCAAGTCC AATAATCAAG AAAGAATAAA GATGGACACT 
CTTTTGCAAT AGGCATAAAT AGAATAGCTA AGGTAAAAAT 
GAAATTCGCT CTCAACCTTG GTTTGTACTT GAGTAAAAAA 
TAAACAATTC ACAAACTAAA TTTCCAGAAA AGGAAAGAAA 



AGTTGGAAGT GGTAATCCCA 
AAGATTATAA ATATTAGCTT 
AGCCCCCATA GTTAAAATAC 
GGGAAGTAGA AATTCAAAAG 
GAAGAAAATT TCTTGCTGAT 
ATCTCTCCCA GTAAAAGCCT 



TCATAAAAAC TCCGACACCT GTCAAAGCCA GTAAAATCAA 
TAATTTTACT AGCTAGAAGA GCCCCAATGA TGGAACCAAT 
TTGCATAGGC TCCTTCTGAC CCGTAAAGCT GATTCGAAAA 
CTGCAAAAAA GAAATTAACG CTGGAAGCTA CCAGCAAAAG 
GCCAGATATA GTGTAACCCA TCCTTGATAT CTACAAAAAT 
TTTTCTCTTG AACTTTTGCT TCCTCTTTTG GAAGGAAAGC 



CACTAGAACA AAAGCAATGA AAAAAGTCAG CGAGTCTAGC AGTAGCGTCA TATGGAGACT 
TGCAAACTGT AAAACAAGGA AGGAAAGAAC AGGAGAGCTA ACACCTACAA CCTGCAAAAC 
CAGCTCTAAG CGAGAATTAT AGATCACAAT CTCATCTTTC TCCACCACTT CAGTTATGAT 
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AGCTTTATTG 


GCTGTGCGAG 


AAAAGGCAAA 


AGCAATAGCC 


TGCACAATGT 


TAGCAACAAT 


5820 


CAAAGCGCCA 


ATCATCCAGC 


TATCATTCCT 


TATGAAAGAA 


ATAGCCAGAC 


AAAGAATCCC 


5880 


ACAAACAAGA 


TCTGCCGTCA 


TTAAAATCTT 


ACGACGAGAA 


AAACGGTCTG 


AAATAACTCC 


5940 


GCCAAAGGGA 


TTGACGAGAA 


TAGATGTGAC 


GAGCTCAGAA 


ATCTGATACA 


TTCCTAAAAC 


6000 


TGTCTGTCCT 


ATAGTCCCCA 


TAGAAGCCAA 


CCAGACACTA 


TTTCCATAAT 


CATAGAGCAT 


6060 


ATTTCCCATT 


TTATTGATAG 


CCCCACGGCT 


AATCAACTGC 


ACTGCATACC 


GATTCATATT 


6120 


AAAGCTCCTC 


TCAAATTTTG 


AAACTATTGT 


ATCAAAACCG 


AAAGGAGCTT 


TTTATTTTTT 


6180 


CCCTTATTTG 


GGAAAATTAA 


CTTTTGACAA 


ATTTTTCGTA 


GTGTTCCTGA 


TAATAGGCTA 


6240 


CTTGCTCTGG 


AAGACCTAAC 


ACATCAAAAA 


TATGCATGGC 


CTCTTGCATC 


TGCTTACAGC 


6300 


CTTCTTTACA 


CTGTCCTTTT 


TGATATAAGG 


CAAAACCTTT 


TAAATAATGG 


AAAACATTAC 


6360 


GCTCATAAAG 


CTTAATACCT 


TTGTCAATAA 


TCTTCTCTGT 


ATAAGCCTCA 


AAATAGTTGG 


6420 


CATTATAAAA 


AGAAGAATGC 


TCTAAACAAT 


GCTGGTAACA 


ATTGAGGGCC 


AAAATCAACA 


6480 


CTAATCTCTT 


ATGGCGACTA 


ATCTCTTGGT 


AAAATTCCTC 


CCTCTCCATA 


ACTTCTCTAC 


6540 

V -J V 


CAATCCGAGT 


GACATAGTCT 


ACATCGTAGA 


AACTATAGAG 


GTTACCGAAA 


AGAATCAACT 


6600 


CATACATGGT 


CCATTCTTCT 


GTTTTGAAGA 


GATAATCTGC 


TACCTTACCC 


AAATCATCCT 


6660 


GCTTCATATC 


ATAACTCGCA 


TCTCTTTGAC 


AAATCAGACC 


TTGTAGCAAA 


ATCCAGTTCA 


6720 


GCTCAAAATA 


AAGGGGAGTC 


GTCGAACTCT 


TAGACTTTTC 


AAGTTGTTCT 


CTTTGAAGCT 


6780 


TTTGAAAACC 


TGCAATATCG 


TTTGAATAGT 


AAAGTGGGAT 


AATCTGTGCC 


ATCATAGACA 


6840 


CATGTTCATG 


ATTATGAAAA 


TTCCTTGCCT 


TATCCATGAA 


ATTTTCGATT 


GTTACATGAA 


6900 


TGTTATCCAA 


AATCTCAAAG 


AAACGGGAGA 


CTGCCAGGTC 


AGACTCCCCA 


AGCTCAAAGC 


6960 


GAGATAACTG 


AGAGGTAGAG 


CAGGATTCGC 


CTGCTGCTTC 


CTTTAAAGAA 


TAATTTCCAC 


7020 


TTGTTCGAAA 


TTCACGAAAT 


ACTTTTCCAA 


GATGTTCCAT 


CTTTACACCT 


GCTCTGATAA 


7080 




TCAAGCATAG 


CTTCTTCCTG 


ACGATGGCTG 


ATTTTGTCCA 


GCTCAGCCTG 


7140 


TAATTCCATG 


AGTTTGTCGC 


CATCGTTTGT 


TTCCAACATT 


TGTTCAGAAA 


TGGCTTGGCT 


7200 


TTGACTTTCT 


AGCTCTTCAA 


TTTCAGCTTC 


TAGACTTTCG 


ATTTGTCCCA 


TGAGTTTGCG 


7260 


AACTTCTTTT 


TGACTTTCTT 


TCTGGGCCTG 


ATAGTCATTG 


ACTGGACTTG 


CTTCCTTTGC 


7320 


TTGATTGCTA 


GTTGAAGCTT 


CCTCAGTCTG 


ACTCATTTCT 


GCTGTTGCTT 


TCTTCTCAAC 


7380 


ATAGTAGTCG 


TAATCTCCAA 


GGTAGAGAGT 


TGAACCATTC 


TCAGACAATT 


CCAAAACATG 


7440 


AGTTGCCACA 


CGATTGATAA 


AGTAACGATC 


ATGACTGACA 


AACAGCAAGG 


TTCCATCAAA 


7500 


GTCAATCAAG 


GCATTTTCTA 


GCACTTCCTT 


ACTATCAATA 


TCCAAGTGGT 


TGGTCGGCTC 


7560 
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ATCCAGAATC AAAAACTTAT TGTTTTCCAT AGACAATTTA GCTAAAAGCA AACGAGCTTT 7620 

TTCGCCACCA GATAGCATGC CGACTGATTT TTTAACATCA TCTCCTGAGA AAAGGAAGGC 7680 

TCCAAGACGG TTGCGGATTT CAACTTCTGG TGTCAGTTTG AAATCATTCC AGAGTTCATC 7740 

CAGCACCGTA TTACTTGGTG TCAGCTTGCT TTGGGTTTGG TCATAGTAAC CAACCTCAAC 7800 

ATTAGCGCCA AAGCGCTTTT CTCCCTTGAT AAAAGGAATC TGGTCCACAA TAGACTTGAT 7860 

AAAGGTTGAC TTGCCGATAC CATTTGGACC AACGATAGCG ACAGCATTCA TCTTACGAAG 7920 

ATCTAGGTTA ATCGGTTGTG ACAAGACTTC CCCGTCATAG CCAACAGCTG CATTTTCAAC 7980 

AGTCAAAACA ACATTGCCCG ACGTTTTTTC AGACTGGAAG GTCATGTTGG CTGATTTCTT 8040 

GCCAGCTTCA GGCTTGTCCA AACGTTCCAT TTTTTCCAGT TGTTTACGGC GAGATTGAGC 8100 

ACGTTTAGTC GTTGAAGCAC GAACTAGATT GCGATTGACA AAGTCTTCCA GAGCAGCGAT 8160 

TTCCTTCTGT TGCTTTTCAT AGTTTTTTGC CTCAGTAACT AGCTTTTGCT CCTTCAATTC 8220 

GACAAAACGA GAGTAATTCC CCACATAGCG ATCCAAGGAA TGCTTGGTCA AATCTAGCGT 8280 

AATTGTCGCA ACCTTGTCCA AGAAATAACG GTCGTGGCTG ACGATAATGA GGGCACCGCT 8340 

ATAGTTTACC AAGTAATTCT CTAGCCAGGC GATGGTTTCA ATATCCAAGT GGTTAGTTGG 8400 

CTCGTCCAAG ACCAAGAGAT TGGGCTTTTC AAGGAGCATT TTGGCAAGTG CCAAACGAGT 8460 

ATTTTGACCA CCAGAAAGCT CAGCAATTTT CATCTGCCAC ATAGACTCGT CAAACTTGAA 8520 

TCCATTCAAA ATCGCTCGAA TATCAGCTTC ATAGGTAAAG CCACCTGCTT GGCGAAAATT 8580 

CTCAGATAAG CGGTCATAAT CTGACATCAG TTTATCCAAA TCCTCACCAG ACTTTTCACC 8640 

CATCTCCAGC TCCATCTGAC GCAGTTGTCT CTCCGTCCGA CGCAAATCAT TAAAGACATG 8700 

AAGCATTTCA TCGTAGATGG TATTTTCAGA CTCAAAACGG CTATCTTGGG CTAGGTAAGA 8760 

CAGAGAAATA TCTTTTTTCT TATTGATTTC TCCGCTAGTT GGCTCCTCTT CTCCAACTAA 8820 

AATCTTCAAA AGAGTAGACT TACCTGCACC ATTTTTCCCA ACAAGAGCAA TCCCATCTCG 8880 

TTCATCAACC TGCAGGTTGA TATTATCGAA AAGAACCTCT CCTGCAAAAG AACGTTCAAT 8940 

TTTATTAGCT TGTAAAATAA TCATACAAGT AGTATAGCAT GTTTCCCTAA GGCATTCAAG 9000 

ATAATCGTAA GTCTTTTAGT ACAACTTTTA TAACATAAAA TAAACTAAAT TATCTATATT 9060 

TTATATTAGA TTACTTCACT ATCTTGTTGG ATTTTCTAAC CAGCTAATCT TGTTTCAAAT 9120 

AGTTATCGCA CAAGTCTATT ATTTAATTCT TTTCATCATT TACGTACGTA TAGCAGATTG 9180 

AAATAAGATG AGAACAAATC GATTGGGAAA GTAAAATTAA TTTCTATAAA TGTTTTAGCA 9240 

ATTGTTTCGT ACTATTTTAG ATTCAGTCTA CTATATACAA TATTTTCGGA ACATTCAACT 93 00 
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TTTTAACTCT ATTTATTACT AGATTTCATA ATTAAAAAAC CTACTGACCA AGCTAGAAAG 9360 

CTTGATACAA TAGGCTTTTT AAAGACTGAT TATTTAACAG CGTCTTTAAG AGCTTTACCA 9420 

GCTTTGAATG CTGGTACTTT AGAAGCTGCA ATTGTCATTT CTTTACCAGT TTGTGGGTTG 9480 

CGACCTTTAC GTTCTGCGCG CTCACGAACT TCAAAGTTAC CAAAACCGAT CAATTGAACT 9540 



r 9541 
(2) INFORMATION FOR SEQ ID NO: 133: 

(ij SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TTGACTATCC TATCATCCTT TCTAACGTCT ACTCAAGAAA ATCATTTTCA AGTTTTCACA 60 

CCTTTCTCAA AAAAGTTAAA AAATTTTCTC AAAAACGCTT GACTCTGACC TAAGGCGAAG 120 

GGTTATACTA TCATTGTAAG GAGGAAATCA TGTACCATAT AAAAGAAGCT GCGCAGCTTT 180 

CAGGTGTCTC TGTCAAGACC CTGCATCACT ATGACAAGAT AGGACTCTTC GTCCCCTTAA 240 

AGTCGGAAAA CGGCTATCGA ACCTACAGTC AAGAGGATTT GGAACGCCTT CAGGTCATTC 300 

TTTACTACAA ATATCTAGGC TTTTCTTTAG AGAAAATAGC AGAGCTGTTA AAGGAAGAAA 360 

GGACAGATTT ATTCCCCCAT TTGACTAGGC AGTTGGACTA TCTAACTCGC GAAAGGCAAC 420 

ATCTGGATAC CTTGATTTCC ACCTTGCAAA AAACTATTCA AGAACAAAAA GGAGAAAGAA 480 

AAATGACCAT TGAGGAAAAA TTCACGGGAT TTAGCTATCA AGACAATCAA AAATACCACC 540 

AAGAAGCGGT AGAGAAATAT GGTCAAGAAG TCATGGGACA AGCGCTCGAA CGCCAAAAAG 600 

GTCACGAAGA CGAGGCTACG GCCGCCTTTA ACCAAGTCTT TCAAACTTTG GCACAAAATC 660 

TTCAAGTTGG TTTACCTGCA ACAGCAACCG AAAACCAGGA GCAAGCAGCC AAGCTCTTGC 720 

AAGCCATTCG CACTTATGGA TTTGACTGCT CTATTGAGGT ATTCGGTCAT ATCGGTAAAG 780 

GTTACGTCTA CAACCCAGAG TTTAAGGAAA ACATTGACAA GTTTGGTTCT GAAACAGCCC 840 

AGTACACGTC AGATGCCATT GCGGTTTACG TTCAGACAAA TGCAGAATAA ATAGGCTAGG 900 

AATTTCCTAG CCTATTTTTT ACTTCAAATC ATAAAGCCAG TCGTCACCGT TTTTGTAGTA 960 

AAAGAATTCA CTGAGATCTT CTTCTAGAAA CACACGAAGC ATATCAGACA TATCATCCGT 1020 

TGCAAGTTTT AGATGAGAAA GATTTTCAAA GTCCTCCCAC CAAACTTTCC CTTCGTCTGA 1080 

AGACTGGAGT TCACCAGTAA AGTGTTCTGT CTTGTAAAAA AGGACGACAT AACGATAATC 1140 
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CTTGTCGTCA TACCAGTTTT TGATACCACA GAGTTGGGGT TTGGAAATGA TCAGACCAGT 1200 

TTCTTCTTTC ACTTCACGAA TGACAGCATC GACAAAGGAT TCGCCACGTT CAACATGACC 1260 

ACCAGGAAAA GTAATGCCAG ACCAGTCGGG ATTAACTCGG TCTTGGACCA GGACCTTATC 1320 

TCCGTTTTTA ATCATACACA TGTTAACAAA TTCGACTGCC TCTCTTCTGT TCATTCTTCA 1380 

CAACCTTTAA TCTTTAATCA TAATGCAGAC TTCCCGCCAC CCAGCCGGTA CAGAGGGCAG 1440 

AAGTGATGTT AAAGCCACCC CTGTGGGCAT TGATATCCAT AACTTCGCCT GCAAAGTGGA 1500 

GGCCAGGT AC CAGCTTACTT TCAAGGGTTT TAGGATTGAT TTCCTTGAGA CTGACTCCAC IS 60 

CCTTGGTAAC AAAGGACTTT GCAAGGGACA TTTTTCCAGT TACAGGAATT TTAAGTTCTT 1620 

TAATGGACTG GACAAGTTGT TCTCGTTCCT TTTCAGTCAG TTGTTTGACT TTTTCAGGAT 1680 

ATCCTTGTAC AAAAAATTCG GCCAAGCGTT CTGGTAACAA GGTTTTTAAA GCGTTTTTCA 1740 

AGGATTTTTC CCGATTTTCT TCTAGAAATG TAACCAAGTC CTTCTCAGAA AGTTGAGGCA 1800 

AAACATCGAG TGAGAGAACC TCCCCACCTT TGACAAAGCT AGACATGCGT AGGGCAGCAG I860 

GACCTCACAA ACCAAAGTGG GTAAAGAGTA AATCATGAGT GATGACATGC TTACCATAAC 1920 

TTAGGGTCAC ATCGTCCAGA GAAATACCTT GTAACGCTTT ATGTGGAAAA TCTGTTAATA 1980 

AAGGACTTTC AGCAGCCTCA AGATCGGTGA TGGTATGCTT AAAATGGCGA GCAATCTCGT 2040 

GACCAAAACC AGTCGAACCA GTCGAACGAT AAGACTTACC ACCTGTTGTG ACAATGAGTT 2100 

TCTCACAAGT GAAGGTTTGA TCCGCTGACT TAAGGACAAA CTGGTCATCT ACTTTTTTAA 2160 

CAGAAACGAT TTCTATTTGA GTAGCAACTT GACCACCTAG TTCGGTGATT TTCTTTTCCA 2220 

AAGCTTCGAT AATAGTCCGA GACTTGTCAC TGGCTGGAAA GACGCGTCCG TGGTCTTCGA 2280 

CCTTAAGTTT AACACCATTT TCTGTAAAAA AGTTGATGAT GTCATGATTA TCGAACTGGG 2340 

AGAAAACACT GTAAAGAAAG CGTCCGTTTC CAGGAATTCC AGCTAGCAGG TTGTCTAAGC 2400 

TACCATTGTT GGTCACATTG CAACGTCCCC CACCAGTCCC AGCTAATTTT TTTCCAACTT 2460 

TCCGATTTTT TTCGATGAGG AGGGTTTTCT GTCCATAAAA GCTACTGGAA ATCGTAGCCA 2520 

TCATACCAGC AGGTCCCCCA CCGATGACAA TAGTATCAAA ATGTTTCATA GCTCTATTGT 2580 

ACCACAAAAA AACAAGACAT GATGGTCACC TCTTGTCAAG AATGCAATTA ATCAATTTCA 2 640 

TAGCCCATCA GCAAACCGCC CTCTTCTGCA TAGAAACTGC AGAGACCAGA GGTTGGTAGA 2700 

ATTTTAATAT CCGCTTGTGG GAAGGTTTCA CGGATTCGCT CTGAGAGCTG TTGACAACAT 2760 

TTTTCGTTAT TGCGTTGGGC CATGACAATA CGGCCACCAG CATATCCAGC TTTTACTAAC 2820 

TCATCATAGG CAGCTTGAAC TGATTTCTTT GATCCCCTTG CTTTTTGTAG CAATTCGAGA 2880 
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GTCCCAGTTT CACTAGCTTT TCCGACCATA CGAATGTTGA GAAGGCCAAC GACCGTACCG 2940 

ATAAGCTTGC TCAAACGGCC GTTCTTCACC AAGTTATCGA CTTTGGCTAG GACAAAGAGC 3000 

AACTTAGTTT TTTCTTGATA GGCGGTGATA GCTTCAACCA CTTCTTCAAA AGACAAGCCC 3060 

TGGTCAATCA AGTCATTCAA TTTTTCTACG AGTAGGTCAA CTTCACCACC AGCAGATAAA 3120 

CTATCAATCA CATGAATCTT AGTGTCAGGA TGGTCTTCCA GATAAATATT CTTTGCTAGT 3180 

TGAGCACTAT TGTGACTGCC AGAAAGGGTA CCTGTGATGG TTACTAGGAA AATGTTTTTG 3240 

GCACCTTCAA ATGCTCGCAA ATAGTCATCT GGGCTTGGAC AAGCCGATTT TGAAGCTTCT 3300 

GCAGTTGCAT ACATGGTTTC CATCATTTGG TCAATATCGA GACTGGCGTC ATCAACAAAG 3360 

ACCTGATCAG CTACTTGAAT GGTTAAGGGG ACACTTACAA AGGTTGTGTT AATAGCTGCT 3420 

GTTGGCAGTT GACGATAATC ACAACCAGAG TCAGCAATAA TCTTCCAAGT CATAGAAATT 34 80 

CTCCATCTTT GTCAGGAACG AT 3502 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12665 base pairs 
(B| TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

CGATTGATTT TTTTAAAGCG TTCGATAGAG AATGAGAAAC GAATCCTTAG CAATGGCGGG 60 

AAAGAATTTG GAGTTGAGAA TACAAAACGA TTAACTATGG CTCATATTGT TTTTTATCTC 120 

TCTTGCTTGG TTGAGGCAAT GGTGCACAAG ACAATTTTTG ATGGCATGGG CATGGTTGGT 180 

TTAGTCTTGC TTATTTTTTC TATGCTGATG TTGATGTTGG TGATTCACTT GTTGGGAGAT 240 

ATTTGGACAG TGAAGCTTAT GCTTGTCAAT AATCACAAAT ATGTAGATCA TATCTTGTTT 300 

AGGACAGTAA AACACCCTAA TTACTTTTTA AATATTCTTC CTGAGTTGAT TGGCTTGACC 360 

TTGTTGAGTC ATGCTTATGT GACTTTTGTT TTAGTTTTTC CAGTTTATGC AGTTATTTTG 420 

TATCGACGAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC AAATGGAAGC 4 80 

ATAAAGAGAT AAATACAAAA TTCGATTTAT ATACAGTTCA TATTGAAGTG ATATAGTAAG 540 

GTTAAAGAAA AAATATAGAA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA AAGAAAAGTA 600 

CATTATTCAA TTCGTAAATT TAGTGTTGGA GTAGCTAGTG TAGTTGTTGC CAGTCTTGTT 660 

ATGGGAAGTG TGGTTCATGC GACAGAGAAC GAGGGAGCTA CCCAAGTACC CACTTCTTCT 720 

AATAGGGCAA ATGAAAGTCA GGCAGAACAA GGAGAACAAC CTAAAAAACT CGATTCAGAA 780 
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CGAGATAAGG 


CAAGGAAAGA GGTCGAGGAA 


TATGTAAAAA 


AAATAGTGGG 


TGAGAGCTAT 


840 


GCAAAATCAA 


CTAAAAAGCG 


ACATACAATT 


ACTGTAGCTC 


TAGTTAACGA 


GTTGAACAAC 


900 


ATTAAGAACG 


AGTATTTGAA 


TAAAATAGTT 


GAATCAACCT 


CAGAAAGCCA 


ACTACAGATA 


960 


CTGATGATGG 


AGAGTCGATC 


AAAAGTAGAT 


GAAGCTGTGT 


CTAAGTTTGA 


AAAGGACTCA 


1020 


TCTTCTTCGT 


CAAGTTCAGA 


CTCTTCCACT 


AAACCGGAAG 


CTTCAGATAC 


AGCGAAGCCA 


1080 


AACAAGCCGA 


CAGAACCAGG 


AGAAAAGGTA 


GCAGAAGCTA 


AGAAGAAGGT 


TGAAGAAGCT 


1140 


GAGAAAAAAG 


CCAAGGATCA 


AAAAGAAGAA 


GATCGTCGTA 


ACTACCCAAC 


CATTACTTAC 


1200 


AAAACGCTTG 


AACTTGAAAT 


TGCTGAGTCC 


GATGTGGAAG 


TTAAAAAAGC 


GGAGCTTGAA 


1260 


CTAGTAAAAG 


TGAAAGCTAA 


CGAACCTCGA 


GACGAGCAAA 


AAATTAAGCA 


AGCAGAAGCG 


1320 


GAAGTTGAGA 


GTAAACAAGC 


TGAGGCTACA 


AGGTTAAAAA 


AAATCAAGAC 


AGATCGTGAA 


1380 


GAAGCAGAAG 


AAGAAGCTAA 


ACGAAGAGCA 


GATGCTAAAG 


AGCAAGGTAA 


ACCAAAGGGG 


1440 


CGGGCAAAAC 


GAGGAGTTCC 


TGGAGAGCTA 


GCAACACCTG 


ATAAAAAAGA 


AAATGATGCG 


1500 


AAGTCTTCAG 


ATTCTAGCGT 


AGGTGAAGAA 


ACTCTTCCAA 


GCCCATCCCT 


GAAACCAGAA 


1560 


AAAAAGGTAG 


CAGAAGCTGA 


GAAGAAGGTT 


GAAGAAGCTA 


AGAAAAAAGC 


CGAGGATCAA 


1620 


AAAGAAGAAG 


ATCGCCGTAA 


CTACCCAACC 


AATACTTACA 


AAACGCTTGA 


ACTTGAAATT 


1680 


GCTGAGTCCG 


ATGTGGAAGT 


TAAAAAAGCG 


GAGCTTGAAC 


TAGTAAAAGA 


GGAAGCTAAG 


1740 


GAACCTCGAA 


ACGAGGAAAA 


AGTTAAGCAA 


GCAAAAGCGG 


AAGTTGAGAG 


TAAAAAAGCT 


1800 


GAGGCTACAA 


GGTTAGAAAA 


AATCAAGACA 


GATCGTAAAA 


AAGCAGAAGA 


AGAAGCTAAA 


1860 


CGAAAAGCAG 


CAGAAGAAGA 


TAAAGTTAAA 


GAAAAACCAG 


CTGAACAACC 


ACAACCAGCG 


1920 


CCGGCTCCAA 


AAGCAGAAAA 


ACCAGCTCCA 


GCTCCAAAAC 


CAGAGAATCC 


AGCTGAACAA 


1980 


CCAAAAGCAG 


AAAAACCAGC 


TGATCAACAA 


GCTGAAGAAG 


ACTATGCTCG 


TAGATCAGAA 


2040 


GAAGAATATA 


ATCGCTTGAC 


TCAACAGCAA 


CCGCCAAAAA 


CTGAAAAACC 


AGCACAACCA 


2100 


TCTACTCCAA 


AAACAGGCTG 


GAAACAAGAA 


AACGGTATGT 


GGTACTTCTA CAATACTGAT 


2160 


GGTTCAATGG 


CGACAGGATG 


GCTCCAAAAC 


AATGGCTCAT 


GGTACTACCT 


CAACAGCAAT 


2220 


GGCGCTATGG 


CGACAGGATG 


GCTCCAAAAC 


AATGGTTCAT 


GGTACTATCT 


AAACGCTAAT 


2280 


GGTTCAATGG 


CAACAGGATG 


GCTCCAAAAC 


AATGGTTCAT 


GGTACTACCT 


AAACGCTAAT 


2340 


GGTTCAATGG 


CGACAGGATG 


GCTCCAATAC 


AATGGCTCAT 


GGTACTACCT 


AAACGCTAAT 


2400 


GGTTCAATGG 


CGACAGGATG 


GCTCCAATAC 


AATGGCTCAT 


GGTACTACCT 


AAACGCTAAT 


2460 


GGTGATATGG 


CGACAGGTTG 


GGTGAAAGAT 


GGAGATACCT 


GGTACTATCT 


TGAAGCATCA 


2520 
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GGTGCTATGA AAGCAAGCCA ATGGTTCAAA CTATCAGATA AATGGTACTA TGTCAATGGC 2580 

TCAGGTGCCC TTGCAGTCAA CACAACTGTA GATCGCTATG GAGTCAATGC CAATGGTGAA 2640 

TGGGTAAACT AAACCTAATA TAACTAGTTA ATACTGACTT CCTGTAAGAA CTCTTTAAAG 2700 

TATTCCCTAC AAATACCATA TCCTTTCAGT AGATAATATA CCCTTGTAGG AAGTTTAGAT 2760 

TAAAAAATAA CTCTGTAATC TCTAGCCGGA TTTATAGCGC TA6AGACTAC GGAGTTTTTT 2820 

TGATGAGGAA AGAATGGCGG CATTCAAGAG GCTCTTTAAG AGAGTTACGG GTTTTAAACT 2880 

ATTAAGCCTT CTCCAATTGC AAGAGGGTTT CAATCTCTCC CAGGGTGCTG GCTTGCGAAA 2940 
TGGCTCCACG GAGTTTGGCA GCGCCAGATG TTCCACGGAG ATAGTGAGGA GCGAGACCGC . 3000 

GGAATTCACG AACTGCGACG TTTTCTCCTT TGAGGTTAAT CAATCGTTTC AAGTGTTCGT 3 060 

AGGCGATCTT CATCTTGTCT TCAAAGGTCA AATCAGGTAG GATTTCTCCT GTTTCAAAGT 3120 

AATGGTTGAT TTGGTTGAAG AGGTAAGGAT TTCCCATGGC AGCTCGGCCA ATCATGACTG 3180 

CGTCAGCACC AACTTCTTCG ATGCCTTGCT TGGCTTCTTG GACAGTACGG ATATCACCGT 3240 

TGGCGATGAA TGGAATCTTG GTTAGAGCTT GGGCAACCTT GTAAAGGGTC TCAAGGTCTG 3300 

CGTGGCCAGT ATACATTTGT TCACGGGTAC GGCCATGCAT GGCGAGGGCA GAAACACCTG 3360 

CAGCTTCAGC AGCGAGAGCA TTTTCTACTG CAAGAGATGG GTCCGCCCAG CCGGTACGCA 3420 

TTTTGACAGT AAGTGGGATA TCAAGGACAG ACTGGACCTT GTTGATGATG GAGTAAATCT 3480 

TGTCTGGATC CTTGAGCCAC ATAGCACCAG CTTCGTTCTT CACGATTTTG TTGACAGGGC 3540 

AGCCCATGTT GATATCGACG ATATCGGTCT TGGTGTTTTC TTGGATGAAT TCTGCTGCGC 3600 

GTGCTAGGCT GTCTTCATCG CTACCAAAAA GTTGGATAGA GACAGGGTTT TCGCCCTCAT 3 660 

CGATATGAAG CATGTGCAGG GTTTTTTCGT TGTTGTATTG GATTCCCTTG TCAGAGACCA 3720 

TTTCCATTAC AACGAGTCCA GCTCCGAGCT CCTTTGCGAT AGTACGAAAG GCTCAGTTGG 3780 

TCACGCCAGC CATAGGCGCT AAAACGGTAC GATTGGGAAT CTCAATATTG CCAATCATAA 3840 

AAGGTGTATT AAGATTTGTC ACGAATGAGT TCCTCCAGGT CCTTTTCATC AAAGTTGTAA 3900 

GTACTTTGGC AGAATTGACA AGTGATTTCT GCCCCGTGGT CTTCCTCTTT CATTTCCTGT 3960 

AAGTCTGAGC TTGGAAGGCT GGCAAGAGCG TTCATAAAGC GTTCATGGCT ACAGTCACAT 4020 

TGGAAACGGA TTTCTTCTTC AGAAAGACGC TTGTAGGCTT CGTCCCCGTA GATAGCCTTC 4080 

AGGAGGGCTT CGATATGGTC GTCGCTTTCG AGAAGACTAG AGATAGCTGG CATTTCTTGG 4140 

ATGCGTTTTT CAAAGCGAGC AATCTCTTCT TTCTTGGCTC CTGGCAAGAC TTGAACTAGG 4200 

AAACCACCTG CAACCTTGAC CTTGTCTTCC TCGTCCAAAA GGACATTGAG GCCGACCGCT 4260 

GAAGGCGTTT GTTGGCTTTC AGTAAGGTAA AAGGCAAGGT CTTCACCGAT TTCTCCAGAG 4320 
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ATGAGGGGAG TTATAGAGTT GTAAGGATTT CCAGTACCGT AGTCTGTCAT AACGAGGAAT 4380 

TGACCATTTC CAACAAAAGG TCCGACTAGG ACTTCACCAG TCGCAGTCTT TTTGATGTCA 4440 

ACACCAGGAT TTTGAACATA GCCTTTGACG TTCCCCTTGG TATCAGCGAC GGTGATAATA 4500 

GCACCTAGAG AGCTAGATCC CAACACCTTA ACTGTAAGTT TGGTATTTCC TTTTTCATTG 4560 

GCTGCGAGAA TCTGGCTAGC GATAAGAGTT CGACCAAGCG CTACAGTTGA GCTAGCTTGG 4 620 

GTTTGATGTT TTTCTTGAGC AGTGCGGACG GTTTCAGTGC TATCAAGGAC AAAAGCACGA 4680 

AAGGcTCCGC TTTCTGATAT AGTTTTAATA ATTTTATCCA TAGCTACTAT TTTAGCATAA 4740 

AAATGCCCAA AGGGGGAGCC GTGTGTTTAC TGATTTTCAG GATAATGGAC CAGGAAATCA 4800 

GCATGAAAAT AAAAAGAGAA ACAGATTATT TTAGCATTTG TCAGATTTAT GCTATGCTTA 4860 

AGGTAGAAAA TGAAAGGGAT AACAAATGTA TTTAGGAGAT TTGATGGAGA AAGCCGAGTG 4920 

TGGTCAATTT TCAATACTTT CCTTTCTATT ACAAGAGTCT CAGACGACCG TCAAGGCTGT 4980 

AATGGAAGAA ACAGGATTTT CAAAAGCAAC CCTAACCAAA TATGTCACCC TGCTCAATGA 5040 

CAAGGCTTTG GATAGTGGCT TAGAGCTGGC TATTCACTCA GAAGATGAAA ATCTGCGTCT 5100 

GTCTATCGGT GCAGCTACCA AGGGGAGAGA TATTCGGAGC TTGTTTTTGG AGAGTGCTCT 5160 

TAAATACCAG ATTTTGGTTT ATCTTCTCTA CCACCAACAG TTTTTAGCCC ATCAGCTGGC 5220 

TCAAGAATTG GTGATTAGCG AGGCTACGCT TGGTCGTCAC TTGGCTGGTT TAAATCAGAT 5280 

TTTGTCAGAA TTTGATTTAT CCATCCAAAA TGGCCGTTGG CGAGGTCCAG AGCATCAGAT 5340 

TCACTATTTC TATTTCTGTC TTTTCCGAAA GGTCTGGTCG AGTCAGGAAT GCGAAGGTCA 5400 

CATGCAGAAA CCAGAGAGAA AACACGAGAT TGCCAATTTA GAGGAAATCT GCGGTGCAAG 54 60 

TTTGTCTCCG GGGCAGAAAT TGGACTTGGT TCTCTGGGCT CACATCAGTC AACAACGTCT 5520 

TCGGGTCAAT GCTTGTCAGT TTCAAGTCAT AGAAGAGAAA ATGCGAGGGT ATTTTGACAA 5580 

TATCTTTTAT CTTCGTTTGC TGAGAAAGGT TCCGTCCTTT TTTCCTGGGC AACATATTCC 5640 

ACTAGGAGTT GAGGATGGTG AGATGATGAT ATTCTTCTCT TTTCTCCTAT CTCATCGCAT 5700 

TCTTCCTCTT CATACTATGG AGTATATTCT TGGTTTTGGA GGGCAGTTGG CAGATTTACT 5760 

GACGCAATTG ATTCAAGAAA TGAAGAAGGA GGAACTATTG GGGGATTATA CAGAGGACCA 5820 

TGTCACCTAT GAACTCAGTC AGCTTTGTGC TCAAGTCTAT CTCTATAAGG GCTATATTTT 5880 

ACAGGATCGC TACAAGTACC AGTTAGAGAA TCGTCATCCA TATTTACTGA TGGAACATGA 5940 

TTTTAAAGAG ACAGCAGAGG AGATTTTTCA TGCTCTACCT GCTTTTCAAC AGGGGACAGA 6000 

TTTAGATAAG AAGATTCTCT GGGAATGGCT CCAGTTAATC GAATATATGG CTGAAAACGG 6060 
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TCGCCAGCAT ATGCGGATTG GTCTGGATTT GACATCTGGT TTTCTTGTCT TTTCAAGGAT 6120 

GGCAGCCATT TTGAAACGGT ATTTGGAATA CAATCGTTTT ATTACCATTG AAGCTTATGA 6180 

CCCTAGTCGG CATTATGATT TGCTGGTTAC CAATAACCCG ATTCATAAGA AGGAACAGAC 6240 

ACCAGTCTAT TATTTAAAAA ATGACTTGGA TATGGAGGAT TTGGTAGCGA TTCGCCAGTT 6300 

ATTATTCACT TAAAAGGCTT GGTTAATCCA GGTCTTTTTT GTGAAATTCA CACAATCTCC 6360 

TCACATTTTT TTAAAAATTA AAAAAAGTTG ATAAACAAGA AAGCGCTTTA TTTTGTATAC 6420 

TAGTAAGTGT AAAGAGGAAA CACCTCAAGA TCTTTATCAG GAGGACAGTA CATGTCACAA 6480 

GAAAAATACA TCATGGCCAT TGACCAGGGA ACTACAAGTT CTCGTGCCAT CATTTTCAAC 6540 

AAAAAAGGGG AAAAGGTTAG CTCGAGTCAA AAAGAGTTTA CCCAGATTTT CCCTCAGGCA 6600 

GGTTGGGTTG AGCACAATGC CAATGAAATT TGGAACTCTG TTCAGTCAGT TATTGCGGGT 6660 

GCTTTCATCG AAAGTGGTGT CAAGCCAAAT CAAATCGAGG CAATCGGGAT TACCAACCAA 6720 

CGTGAAACAA CGGTTGTCTG GGATAAGAAA ACAGGACTTC CTATCTACAA TGCTATCGTT 6780 

TGGCAGTCAC GCCAGACAGC ACCTTTGGCT GAGCAACTAA AAAGCCAAGG TTATGTGGAA 6840 

AAATTCCATG AAAAGACTGG TTTGATTATT GATGCTTACT TCTCTGCTAC CAAGGTTCGT 6900 

TGGATTTTGG ATCATGTAGA AGGTGCTCAA GAGCGAGCAG AAAAAGGGGA ATTGCTCTTT 6960 

GGTACTATCG ATACTTGGTT GGTTTGGAAA TTGACTGACG GTGCGGCTCA CGTGACTGAC 7020 

TACTCAAATG CAGCTCGTAC CATGCTTTAT AACATTAAAG AACTCAAATG GGATGATGAG 7080 

ATTTTGGAAA TCCTTAACAT TCCGAAGgCT ATACTTCCAG AAGTTCGTTC TAACTCCGAA 7140 

ATCTACGGCA AGACAGCTCC ATTCCATTTC TACGGTGGAG AGGTGCCAAT CTCAGCTATG 7200 

GCTGGGGACC AACAAGCAGC CCTCTTTGGA CAGTTGGCTT TTGAGCCAGC TATGGTTAAG 7260 

AATACTTATG GAACAGGCTC TTTCATCATC ATGAATACTG GGGAAGAGAT GCAGTTGTCT 7320 

GAAAACAACC TCTTGACAAC CATTGGTTAC GGAATCAACG GTAAGGTTTA TTATGCCTTG 7380 

GAAGGTTCTA TCTTCATCGC AGGAAGTGCT ATTCAGTGGC TTCGTGACGG TCTTCGCATG 7440 

GTTGAAAATT CACCAGAATC TGAAAAATAC GCTCGTGATT CTCACAACAA CGATGAAGTT 7500 

TATGTCGTTC CAGCCTTTAC AGGTCTAGGC GCTCCATACT GGAACCAAAA TGCTCGTGGT 7560 

TCCGTCTTTG GTTTGACTCG TGGAACAAGC AAAGAAGACT TTATCAAGGC GACTTTGCAA 7620 

TCTATTGCTT ATCAAGTGCG TGATATCATC GACACCATGC AAGTGGATAC TCAGACCGCC 7680 

ATTCAAGTAC TGAAGGTGGA TGGTGGTGCA GCCATGAACA ACTTCCTCAT GCAGTTCCAG 7740 

GCGGATATTT TAGGCATTGA CATTGCACGT GCTAAAAACC TGGAAACAAC AGCTCTAGGA 7800 

GCGGCCTTCC TAGCAGGTTT GTCAGTAGGG TACTGGAAAG ACTTGGACGA GTTGAAACTC 7860 
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TTGAACGAGA CAGGAGAACT CTTTGAGCCA TCTATGAACG AATCTCGCAA GGAACAACTC 7920 

TACAAGGGCT GGAAGAAGCC TGTGAAAGCA ACTCAAGTCT TTGCGGAAGT AGACGACTAA 7980 

TACTGGCAGA ATAAAGCGAT TTATTTAGAA AGTGTGTAAA TATGGAATTT TCAAAGAAAA 804 0 

CACGTGAATT GTCAATTAAA AAAATGCAGG AACGTACCCT GGACCTCTTG ATTATCGGTG 8100 

GAGGAATCAC AGGAGCTGGT GTAGCCTTGC AGGCGGCAGC TAGCGGTCTT GAGACTGGTT 8160 

TGATTGAAAT GCAAGACTTT GCAGAAGGAA CATCTAGTCG TTCAACAAAA TTGGTTCACG 8220 

GAGGACTTCG TTACCTCAAA CAATTTGACG TAGAAGTGGT CTCAGATACC GTTTCTGAAC 8280 

GTGCAGTGGT TCAACAAATC GCTCCACACA TTCCAAAATC AGATCCAATG CTCTTACCAG 8340 

TTTACGATCA AGATGGAGCA ACCTTTAGCC TCTTCCGTCT TAAAGTAGCC ATCCACTTGT 8400 

ACGACCTCTT GGCAGGTGTT AGCAACACAC CAGCTGCGAA CAAGGTTTTG AGCAAGGATC 8460 

AAGTCTTGGA ACGCCAGCCA AACTTGAAGA AGGAAGGCTT GGTAGGAGGT GGAGTGTATC 8520 

TTGACTTCCG TAACAACGAT GCGCGTCTCG TGATTGAAAA CATCAAACGT GCCAACCAAG 8580 

ACGGTGCCCT CATTGCCAAC CACGTGAAGG CAGAAGGCTT CCTCTTTGAC GAAAGTGGCA 8640 

AGATTACAGG TGTTGTAGCT CGTGATCTCT TGACAGACCA AGTGTTTGAA ATCAAGGCCC 8700 

v 

GTCTGGTTAT TAATACAACA GGTCCTTGGA GTGATAAAGT ACGTAATTTG TCTAATAAGG 8760 

GAACGCAATT CTCACAAATG CGCCCAACTA AGGGAGTTCA CTTGGTAGTA GATTCAAGCA 8820 

AAATCAAGGT TTCACAGCCA GTTTACTTCG ACACAGGTTT GGGTGACGGT CGTATGGTCT 8880 

TTGTTCTCCC ACGTGAAAAC AAGACTTACT TTGGTACAAC TG AT AC AG AC TACACAGGTG 8940 

ATTTGGAGCA TCCAAAAGTA ACTCAAGAAG ATGTAGATTA TCTACTTGGC ATTGTCAACA 9000 

ACCGCTTCCC AGAATCCAAC ATCACCATTG ATGATATCCA AAGCAGCTGG GCAGGTCTTC 9060 

GTCCATTGAT TGCAGGGAAC AGTGCCTCTG ACTATAATGG TGGAAATAAC GGTACCATCA 9120 

GTGATGAAAG CTTTGACAAC TTGATTGCGA CTGTTGAATC TTATCTCTCC AAAGAAAAAA 9180 

CACGTGAAGA TGTTGAGTCT GCTGTCAGCA AGCTTGAAAG TAGCACATCT GAGAAACATT 9240 

TGGATCCATC TGCAGTTTCT CGTGGGTCTA GCTTGGACCG TGATGACAAT GGTCTCTTGA 9300 

CTCTTGCTGG TGGTAAAATC ACAGACTACC GTAAGATGGC TGAAGGAGCT ATGGAGCGCG 9360 

TGGTTGACAT CCTCAAAGCA GAATTTGACC GTAGCTTTAA ATTGATCAAT TCTAAAACTT 9420 

ACCCTGTTTC AGGTGGAGAA TTGAACCCAG CAAATGTGGA TTCAGAAATC GAAGCCTTTG 94 80 

CGCAACTTGG AGTATCACGT GGTTTGGATA GCAAGGAAGC TCACTATCTG GCAAATCTTT 9540 

ACGGTTCAAA TGCACCGAAA GTCTTTGCAC TTGCTCACAG CTTGGAACAA GCGCCAGGAC 9600 
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TCAGCTTGGC AGATACTTTG TCCCTTCACT ATGCAATGCG CAATGAGTTG ACTCTTAGCC 9660 

CAGTTGACTT CCTTCTTCGT CGTACCAATC ACATGCTCTT TATGCGTGA? AGCTTGGATA 9720 

GTATCGTTGA GCCAATTTTG GATGAAATGG GACGATTCTA TGACTGGACA GAAGAAGAAA 9780 

AAGCAACTTA CCGTGCTGAT GTCGAAGCAG CTCTCGCTAA CAACGATTTA CCAGAATTAA 9840 

AAAATTAAGA AAAAATAAAA GAGGTGGAGG GCAGCATTCC TTGTCGCCCG TCCCTTCTTT 9900 

TTAATGGAGA CAGAAAGATG ATGAATGAAT TATTTGGAGA ATTTCTAGGG ACTTTAATCC 9960 

TGATTCTTCT AGGAAATGGT GTTGTTGCAG GTGTGGTTCT TCCTAAAACC AAGAGCAATA 10020 

GCTCAGGTTG GATTGTGATT ACTATGGGTT GGGGGATTGC AGTTGCGGTT GCAGTCTTTG 10080 

TATCTGGCAA GCTCAGTCCA GCTTATTTAA ACCCAGCTGT GACCATCGGT GTGGCCTTAA 10140 

AAGGTGGTTT GCCTTGGGCT TCCGTTTTGC CTTATATCTT AGCCCAGTTC GCAGGGGCCA 10200 

TGCTGGGTCA GATTTTGGTT TGGTTGCAAT TCAAACCTCA CTATGAGGCA GAAGAAAATG 10260 

CAGGCAATAT CCTGGCAACC TTCAGTACTG GACCAGCCAT CAAGGATACT GTATCAAACT 10320 

TGATTAGCGA AATCCTTGGA ACTTTTGTTT TGGTGTTCAC AATCTTTGCT TTGGGTCTTT 10380 

ACGACTTTCA GGCAGGTATC GG AACCTTTG CAGTGGGAAC TTTGATTGTC GGTATCGGTC 10440 

TATCACTAGG TGGGACAACA GGTTATGCCT TGAACCCAGC TCGTGACCTT GGACCTCGTA 10500 

TCATGCACAG CATCTTGCCA ATTCCAAACA AGGGAGACGG AGACTGGTCT TACGCTTGGA 10560 

TTCCTGTTGT AGGCCCTGTT ATCGGAGCAG CCTTGGCAGT GCTTGTATTC TCACTTTTCT 10620 

AGTTTATACT CTTCGAAAAT CAAATTCAAA CCACGTCAGC GTCGCCTTAC CGTACTCAAG 10680 

TACAGCTTCC GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAGAAAACAA 10740 

TTATGTTGAT AGAGCTTGGG CAAGAGCCCA ATTTCAGCAA AAAATGAAGT AAATCTTCTC 10800 

ATAATAAAAC GCATCATATC AAGCACGAAA ATTCCACGAG GTCAACTACA GTCAGAAAGC 10860 

TGAACAACAA GCCAAAACGC CCAAAAAAGG CGGCAAAAAG CAAGCACCTG CAAGCAACGT 10920 

GCCGAAATGG TCAAATCCTG ATTATGTCAA CGAATTAGAC CCAAAAATCG TTGATA7GCT 10980 

AGTAGAATTT CACAAGTCAC AAGCCACTTT GGAAACTCCC GAGGCGCAAG CAGAAATCGC 11040 

CCAAAAACGT GAAGAAATCG AGCAAAGGAG AGCTGAGCTT GAGGGTAAAA AACAAGAGCT 11100 

TTTGAACCGC TTGAACAAAT AGAGTTTCGC AAGTATTATG CTTACAAATT ACTTGAGCAA 11160 

TTAACTAAAA TATAAACCCT GCCTTTATAT CTAGGCAGGG TTTATATTTT AGAAATTCAC 11220 

GTAGGTTGTT ACGGTTTTTA CATACCCAGT ATAGTTTGAG TTTCTATAGT ATTCAGTGAT 11280 

AAACTTCCAT TTTCTTTGAG CAACATGGAT ATAAGTACTT GTTATGTAGT ATGGATATGG 11340 

GCTTTGTGAA TCCAAGTAAG ACTGATAAGC TTGTATACCA AAATATGCTC CACCAATTAT 11400 
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TGCACCCCAT GGACCCCCCA ATAAAGCACC TATCCTACCA ATCATATAAC TGATTCCAGC 11460 

ACCAGTCATG AAGTTAGCGA ATGTGTTAGC TTGTTTATTC CCATGTATTG TGTTGACGTA 11520 

ATTCCAAACA TTAGGATCGT ATGATCTAAA AGATATATTT AGGTCGATTT CATTCTTTTG 11580 

ATAAGCCATA TAAAATGCCC CATTGATATA GACGCCGTCA GCACGTCGTT CAATAGTGTC 11640 

TACACTTCCA TCTGGATTGA CAACCTCAAG AACTTCATCG CTTAAAATAT TTACTTGCGT 11700 

ATCTCCGAAC CGCACTGATG AGCCATTCTC AAACTGAGCC TCACCAGATA CAACTTTAGA 11760 

GTTTGCCGAT AAGCTATCAT CAGCAAAAAC AAACAAGCGA CGGGGAAATG CTAGACATAC 11820 

AGAAAACAGA CATAACTAGC AAACACATGC ATTTAAACAT CTTAGACATA ACGGAAACTC 11880 

CTTTGTATTT TTGATTTTTT TCAACTTTTA TTATACAATA AAACCAAATA AAAAGAAAGC 11940 

GGTAACAATA TGCTTAATGC GAAAATTTTT TATATATTTT TATGTTTGAT CGTTATCGAA 12000 

ACTACAGGCT TGTTGTTGTT GAAAAGAGGT CTCGAAATGG GTTATTTAGA CACAGAAGCT 12060 

ATTATCCTCG CAGTTTTTTC ATTTGCTTTT TACAACCTAT GTTCATTCGC TTGGGTCTGC 12120 

TCTACAATAA AAAACAATAA AAAATAAATA GACCTATTTT CAAAAAAAAC maAATGCATA 12180 

TTTATATTAG CAAAACGACG ATTTAAATCG TCGTTTTTTT GTAGTACGAC GGGCATGTCG 12240 

TATATCTGAG GTGTAAGTCC TCAGCCTGAC TATCGTCAGG TAGCAGGGAG AGGAAGGGAT 12300 

AGCGAAATCG TGGCTCTACG AACAGGAACG TGATAGTAAG GCGTATATAG CGGATAAGGA 12360 

GGCTTCAAAC TCTAAAGTCC AAAAAGGTAG TCGTAACCTA TATGTGTAAA TCACGAGAGT 12420 

AATTGAATTC GGACTAAGGT TTGTGTGAAA AAGATAAATC TTTCTAGAGT CTAAAGACTC 12480 

TGCGTCAGAT TTCCTATTTT CACTGTAACC TTTTAACGTC CTCATATCTT GTATAAACGA 12540 

GGAAAGATGT ACGACTTATC CCGTGAGGTT TCATGAGCGT GAAAGCGTAG TAACAACGAA 12600 

TCATGAGAAG TCAGCCGAGC CCATAGTAGT GAGGAAACTT CCGTAATGGA AGTGGAGCGA 12 660 

AGGGG 12665 
(2) INFORMATION FOR SEQ ID NO: 135: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5305 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 135: 
CGCTAATCAC TACAATCATT TTATTGTACT TTTTCACTCT CAAGAAAAGC AAGAAGTATT 



60 
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CATTTTAGTT TCATTTAGTA TTATTTTGCA TACCTAAAAT ACAGTAAAAA ATCAGTCATC 120 

TTGGTATGCT CCTGCTTTCA CTATTCAACA CGTTTTTGAC TTATACTAGG CTCATTTCCA 180 

AAACCATTAT ATAATAGTGA TATGAAACCA ACTAAACTAA ACAAGAAATA TAAGCAATAA 240 

AAATTCGTTT AAAAGATCTT ACTAAAGCTA ATACTAAATA AAAATAAAAG AGTAAACTAG 300 

GAAGTTTATT TCAAACAACC TAAAATACTG ATTTTCGGCT GAAGATAATA CTGGAGTGCA 360 

AATTAATGGG GTTATAATAA ATAGCTGATA GCTTGTGTTG GTTTTGGATT TTTTAAGAGT 420 - 

AGATGAGTAT TAAAACTATA AGGAGGACGA AGGTGGCTAA AAATTTAAAA TTAAAATTAG 480 

CTCGGGTAGA GCGTGATTTA ACACAAGGTC AACTGGCAGA GGCTGTCGGG GTCACACGCC 540 

AGACTATTGG TTTAATAGAG GCGGGAAAAT ACAATCCCAG TCTCTCGCTC TGCCAGTCTA 600 

TTTGCAGATG TTTAGGGAAA ACCCTAGACC AACTATTTTG GGAGGAAGAA GATGAAAAAT 660 

AGATTTTATT ATTCTCAATT ACTAGACGAA AGAGAAGAAC AACTGTTCAA TAAAGCGGGC 720 

TCTGAAAGTT TCTATATCTG CATTGCTTTG TCGCTCCTAT CTTATATCAT TTCAGTATTA 780 

GCACCAAGCC TTTTTAATTC TAATATGCTG CTAATCGTTA TCATCATAGG GACATTTTAC 840 

TTTTTCAATC GTGCCCGTTA TCTGGGAGTG ACCTACTATG GTCGTTTTCA TTTTACGATT 900 

TTGGGTTGTT TTTTCCTAAC CTTGGCTATT ACGGCTCTTT TGATGTTGCA GAATTATCAA 960 

TTCAACATAG AAATTTATCA GCACAATCCT TTGAATTTTA AATACCTGTC TGCTTGGGTC 1020 

ATTACTTATA TCATTTACCT TCCGTGGATC TTTATTGGCA ATCTTGGTCT TAAGAGCTAT 1080 

GGCGAATGGG CTCAGAAAAA ATTTGAACAA GATATGGATG AATTGGAGAG TGGAGAATAG 1140 

CTTGTTACTC TTTTCTCAAT CCAGCTAAAA TGTGATATAA TAGTACTAAT TTATTGGAAT 1200 

ACATGAAAGT TCTTGAAAAT TTTCATGGCT TTCTAGCTAA GGAAGTAGGA AAAGTATGTA 1260 

TCCAGATGAT AGTTTGACAT TGCACACGGA CTTGTACCAG ATCAACATGA TGCAGGTTTA 1320 

CTTTGACCAA GGGATTCACA ATAAGAAGGC GCTCTTTCAG GTGTATTTCC GCCAACAGCC 1380 

TTTTAAGAAC GGCTATGCGG TTTTTGCAGG TTTAGAAAGA ATTGTGAACT ATCTTGAAGA 144 0 

CTTCCGTTTT TCAGATAGTG ATATAGCCTA TTTGGAGTCG CTTGGTTATC ATGGGGCGTT 1500 

CTTGGATTAC CTTCGCAATT TCAACTTGGA GTTGACCGTT CGTTCTGCCC AAGAAGGGGA 1560 

TTTGGTTTTT GCTAATGAAC CGATTGTGCA GGTGGAAGGA CCTCTAGCCC AATGTCAGTT 1620 

GGTCGAAACG GCTCTTTTGA ACATCGTCAA CTACCAGACT TTGGTGGCGA CGAAGGCAGC 1680 

TCGTATTCGT TCGGTTATCG AAGATGAACC CTTGATGGAG TTTGGGACAC GTCGGGCTCA 1740 

AGAAATGGAT GCGGCCATCT GGGGAACACG CGCAGCTGTG ATTGGTGGCG CCAATGGAAC 1800 

CAGCAACGTG CGTGCGGGTA AGCTCTTTGA CATTCCTGTT TTGGGAACCC ATGCCCATGC 1860 
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CTTGGTACAG GTTTATGGCA ATGACTATGA AGCTTTCAAG GCTTACGCTG CGACCCACAA 1920 
AAATTGTGTC TTTCTTGTGG ATACCTATGA CACCCTTCGC ATCGGTGTAC CAGCTGCCAT 1980 
TCAGGTGGCG CGTGAGCTGG GTGATCAGAT TAACTTTATG GGTGTGCGGA TTGACTCTGG 2040 
GGATATTGCC TACATTTCTA AGAAAGTCCG TCAGCAACTG GATGAGGCTG GATTTACAGA 2100 
GGCTAAGATT TATGCTTCTA ATGATCTAGA TGAAAATACC ATCCTTAACC TCAAGATGCA 2160 
AAAGGCCAAG ATTGATGTCT GGGGTGTGGG TACCAAGCTG ATTACAGCCT ATGACCAGCC 2220 
GGCTCTTGGG GCGGTTTACA AGATTGTTGC AATCGAAGAT GAAACTGCTC AGATGCGCAA 2280 
TACGATTAAG CTGTCTAATA ATGCTGAAAA AGTTTCTACG CCAGGTAAGA AGCAGGTGTG 2340 

GCGCATTACC AGTCGTGAAA AAGGCAAGTC AGAAGGCGAC TATATCACTT ATCATCGTGT 2400 

GGATATTAGC GACATGACAG AAATCAAGAT GTTCCATCCG ACCTATACAT ACATCAAGAA 24 60 

GACGGTTCGT AATTTTGATG CCGTTCCTCT CTTGGTGGAT ATCTTCAAAG AAGGAATATT 2520 

AGTTTACAAC TTGCCTAGTT TGACTGACAT TCAGGATTAT GCCCGTAAGG AATTTGACAA 2580 

GTTGTGGGAT GACTATAAGC GTGTGCTCAA TCCGCAGCAC TATCCAGTGG ATTTGGCGCG 2640 

TGATGTATGG CAAGATAAGA TGGACTTGAT TGATAAGATG CGCAAGGAAG CCCTTGGTGA 2700 

AGGAGAAGAA GAATGAGTTT GCAAGAAACG ATTATCCAAG AGCTGGGTGT CAAACCAGTG 2760 

ATTGATGCCC AGGAAGAAAT CCGTCGTTCT ATTGATTTCT TAAAAAGATA TCTGAAAAAA 2820 

CATCCCTTCC TAAAAACCTT TGTACTAGGG ATTTCTGGGG GACAAGACTC AACCTTGGCA 2880 

GGACGTTTGG CGCAATTAGC TATGGAAGAA CTGCGAGCTG AAACGGGAGA CGATAGCTAC 2940 

AAATTTATCG CTGTCCGCCT GCCATACGGA GTGCAAGCTG ATGAAGCAGA TGCTCAAAAA 3000 

GCCCTAGCCT TCATCCAGCC AGATGTCAGC TTGGTTGTGA ATATCAAGGA ATCAGCTGAT 3060 

GCCATGACAG CTGCAGTTGA AGCGACAGGT AGTCCTGTTT CAGACTTCAA CAAGGGGAAT 3120 

ATCAAGGCAC GTTGCCGTAT GATTGCTCAG TATGCCCTTG CTGGTTCCCA TAGCGGAGCG 3180 

GTCATTGGAA CAGACCACGC CGCGGAAAAT ATCACAGGTT TCTTTACCAA GTTTGGTGAC 3240 

GGCGGTGCGG ATATTCTCCC TCTTTACCGC CTCAATAAAC GCCAAGGAAA ACAGCTCTTG 3300 

CAGAAACTTG GCGCAGAGCC AGCCCTTTAT GAAAAAATCC CAACGGCAGA CCTAGAAGAA 3360 

GATAAACCAG GCCTAGCTGA CGAAGTCGCA CTTGGAGTCA CCTACGCAGA GATTGACGAC 3420 

TACCTAGAAG GCAAAACAAT CAGCCCAGAA GCTCAAGCGA CCATTGAAAA CTGGTGGCAC 3480 

AAAGGCCAAC ACAAACGCCA CTTACCCATC ACCGTATTTG ATGACTTTTG GGAGTAAAAA 3540 

GGTCCGGGGG ACCTTTTTAG CTTCTTGCCC TGAAATTAAA AAGCAAGAAA AACCTCCACT 3600 
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GGAGGTTTTC AGCCTCTCAT CTTGAAATAA GAAAGTGAGA GAAGGTCTGG GGGATCTTGA 3660 
ACCCCGAGTT TAGAAATAAG AAAATGAGGC AGATTCAGTA ACTCGAAGAG TTCGATTTCA 3720 

TCGTCTTACC CCTGCAACGA TGACTAGGTT TGAAAAAGCT TGCTAGAGCG CATTTCAAAC 3 7 BO 

CAGGCAGCAA CTGCGTCAAG AAATTAGAAG ACAAACTCGT TTTCTAGCTG TTACTGAGTT 3B40 

GAGCCTTTTT ACTACGAGTA TAGAAATAAG GAAGTGAGGT AGCATCATGA AATCTATCGG 3900 

TACGCAAATA TTACAGACAG AACGTTTGAT TTTAAGAAGA TTTGTGGAGA GTGATGCAGA 3960 

AGCCATGTTT CAAAATTGGG CTTCATCCGC TGAGAATCTG ACCTATGTTA CCTGGGATCC 4020 

CCATCCTGAT GTCGAAATCA CTCGAAACTC GATTTGCAAT TGGGTTGCTT CCTATACTAA 4080 

TCTCAACTAT TATAAATGGG CCATTTGTCT AAAAGAAAAC CCAGAGCAAG TAATAGGAGA 4140 

TATCAGCATT GTTAAGATAG ACGAGGCTGA TTTAAGCTGT GAAATTGGCT ATGTGTTAGG 4 200 

CAAGGCTTAC TGGGGAAATG GTATGATGAC AGAGACTTTG AAAGCTATCT TGGACTTTTG 4260 

TTTTACTCAA GCAGGTTTTC AAAAGGTCAG AGCACGTTAT GCCAGTCTCA ACCCAGCTTC 4320 

AGGTCGTCTC ATGGAAAAGG CTGGAATGTC CTATCTACAA ACCATTGTTA ATGGTGTAGA 4380 

GAGAAAAGGC TATCTTGCGG ATCTTATTTA TTATGGTATA AGTACGGAAG AATGTTGAAT 4440 

TCTATTTTCT GTTTCTATCG AAGTCAACTA TTTATTGTAA ATATAATAAT TAGCATTCCA 4500 

AGTTTATTTG AAACTTTAAA ATAGCATATT GATTAGTACA ACACAGATGT TCTAGTTCCT 4560 

TCTTTAATCT GGTTTAGTGT TAGTTAAAAA ATCGCTTTAA GCTTGTAACT AAGAGGGAGC 4620 

TAATCGACTA GATTCTCCAG CCGAACAGCX GGTAATGTAC TTTTTATAGT GTAATCCTAG 4680 

CTGTTGTTAA ATTTAAAATA GAATCCTCTA TCGAGTTAGG GAATTAAATT CAACCAATTT 4740 

TATTCATGTT TTTTCTATCA AATTATCTAA TATTAAAATA GTCTCATTCT GATGAGAAAA 4 BOO 

CTATTCCCAA ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC 4860 

ATGAGCCACT TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT 4 920 

TTTCCTCGCT AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT 4980 

TCATCTCGAC TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG 5040 

GAACAGGAAG ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG 5100 

AATAGGCATA GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC 5160 

CCACCACGTG AAGAAAAAGA TGGCGGAAGC CTTTGATTGT TAAAGTTTGG AAGTCACCTC 5220 

CAGCTAGATG TTTGAGAAAA AGATAGAGAT TGTAGGCGAT ACAGCTCATC ATCATACGAA 5280 

CTTCGTTTTT GATTAAGGTT GAACT 53 q 5 
(2) INFORMATION FOR SEQ ID NO: 136: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xij 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


136: 






TGGCAGCTCG 


TCGTCGTAAA 


GGACGCAAAG 


TTTTGGCTGC 


ATAATCCAAA 


CGAATTCTAT 


60 


CAAAAATCAG 


TAGGAACTCG 


AGTCTACTGA 


TTTTTATTTT 


TGTAAAAAAG 


TTCAGTAGAT 


120 


GCAAATGGAT 


TCGGAAGCGA 


TGTTACAGTA 


GATTGAAACT 


AGAATAGTAC 


ACCTCTGTTT 


160 


CTAAAACATT 


GTTAGAAATC 


GATTTGACTG 


TCCTGATCGA 


TTTGTCCTGT 


TATTATTTTA 


240 


TTTTACTATA 


AAGTTGAAGT 


AGGTGGAGAT 


GGTACAGCAA 


CAATCGTCTT 


TAAAGATGGT 


300 


TCAGCTATTA 


CAATTCCAGG 


AAATCAATTG 


GTAGCACAAG 


ATCCAAAAGC 


ACAAGATAGC 


360 


ACTAAACTGA 


CTGCTGAAAA 


ATCAACTGTT 


AAAGCACCTG 


CTCAAAGAGT 


AGATGTAAAA 


420 


GATATAACTC 


ATTTAACAGA 


TGAAGAAAAA 


GTTAAGGTTG 


CTATTTTACA 


AGCAAATGGT 


480 


TCAGCATTAG 


ACGGAGCGAC 


AATCAATGTA 


GCTGGAGATG 


GTACAGCAAC 


AATCACATTC 


540 


CCAGATGGTT 


CAGTAGTGAC 


GATTCTAGGA 


AAAGATACAG 


TTCAACAATC 


TGCGAAACGT 


600 


GAATCTGTAA 


CTCAAGAAGC 


TACACCAGAG 


TATAAGCTAG 


AAAATACACC 


AGGTGGAGAT 


660 


AAGGGAGGCA 


ATACTGGAAG 


CTCAGATGCT 


AATGCGAATG 


AAGGCGGTGG 


TAGCCAGGCG 


720 


GGTGGATCAG 


CTCACACAGG 


TTCACAAAAC 


TCAGCTCAAT 


CACAAGCTTC 


TAAGCAATTA 


780 


GCTACTGAAA 


AAGAATCAGC 


TAAAAATGCC 


ATTGAAAAAG 


CAGCCAAGGA 


CAAGCAGGAT 


840 


GAAATCAAAG 


GCGCACCGCT 


TTCTGATAAA 


GAAAAAGCAG 


AACTTTTAGC 


AAGAGTGGAA 


900 


GCAGAAAAAC 


AACCAGCTCT 


CAAAGAGATT 


GAAAATGCGA 


AAACTATGGA 


AGATGTGAAG 


960 


GAAGCAGAAA 


CGATTGGAGT 


GCAAGCCATT 


GCCATGGTTA 


CAGTTCCTAA 


GAGACCAGTG 


1020 


GCTCCTAATG 


CTCCTCCTAA 


GACAACAAGT 


GCACCGCAAG 


CAACTGCAGG 


AACAATGCAA 


1080 


GATGTTACCT 


ACCAGTCACC 


TGCTGGCAAA 


CAATTACCTA 


ACACAGGTTC 


AGCATCAACT 


1140 


GCAGCACTTG 


CTAGTCTTGG 


TCTAGTGGTG 


GCAACAAGTG 


GTTTTGCTTT 


GCTAGGAAGA 


1200 


AAGACTAGAC 


GTAGAAAATA 


GAACAGCTAG 


AAAATTCTAT 


TCTCTACTTA 


AAGTTAGATT 


1260 


ATAAGGGGGA 


TTTTGAGAAG 


TCATCAATCC 


TAGTGATGGG 


TGAGAAAAGT 


GAGAACCCAA 


1320 


GATAATCACA 


TACTTTAGCT 


GAATAGGAAT 


ATTCTATCAA 


TGTAGCCAAT 


CTCTTCTGTC 


1380 


TCTAACTGTG 


GAATAGGAGA 


TGGGCAATAT CGGATAGAAA AGATAGCAGA ATAGCTCTCT 


1440 
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ATTGAAGAGA GGAGGGGAAA CCGAAAAATT AGGTGCCCCT CCTCTTTTTT GGTATAATAG 1500 

AAGATAGAAA ACGAGCTTAG AAGAGATGAT TTTTGATACA CATACACACT TGAATGTAGA 1560 

AGAATTTGCA GGTCGTGAGG CAGAAGAAAT TGCCTTGGCT GCTGAGATGG GTGTGACACA 1620 

GATGAATATT GTTGGTTTTG ATAAACCGAC GATTGAGCAT CCCTTGGAGT TGGTAGATGA 1680 

GTATGAGCAG CTCTATGCGA CTATTGGTTG GCATCCTACA GAAGCTGGTA CTTATACAGA 1740 

GGAAGTTGAG GCTTACTTGT TGGATAAGTT AAAACATTCC AAGGTTGTGG CTTTAGGTGA 1800 

AATTGGCTTA GATTACCATT GGATGACAGC GCCCAAAGAG GTGCAGGAGC AGGTTTTTCG 1860 

CCGTCAGATT CAGCTATCTA AGGACTTGGA TTTGCCTTTT GTTGTCCATA CCCGTGATGC 1920 

GCTGGAAGAT ACCTATGAGA TTATCAAGAG TGAGGGCGTT GGTCCTCGTG GTGGTATCAT 1980 

GCATTCATTT TCAGGGACGC TTGAGTGGGC AGAGAAGTTT GTGGATCTTG GTATGACCAT 2040 

TTCCTTCTCA GGAGTGGTGA CTTTTAAGAA GGCAACTGAC CTCCAAGAAG CAGCTAAAGA 2100 

GTTACCTTTG GACAAGATGT TGGTGGAAAC AGATGCGCCT TACTTAGCAC CTGTACCCAA 2160 

GCGTGGTCGT GAAAATAAAA CAGCCTATAC TCGCTATGTG GTCGACTTTA TCGCTGACTT 2220 

GCGTGGTATG ACGACAGAAG AGCTGGCGGT AGCAACGACT GCAAATGCAG AACGAATTTT 2280 

TGGACTGGAC AGCAAGTAAT GAAAGAGAAA ATTTCTCAAG TTATCGTGGT TGAAGGGCGT 2340 

GATGATACGG TCAATCTCAA ACGTTATTTC GATGTGGAGA CCTATGAGAC TCGAGGTTCT 2400 

GCCATCAATG CTCAGGATAT AGAGCGGATT CAGCGCCTGC ACCAACGTCA TGCAGTCATT 2460 

GTCTTTACAG ACCCAGATTT TAATGGGGAA CGGATTCGGC GCATGATCAT GATGGTCATT 2520 

CCAACAGTTC AGCATGCCTT TCTCAAGCGA GATGAAGCTG TTCCCAAGTC CAAGACCAAG 2580 

GGGCGTTCTC TGGGAATTGA GCATGCCAGC TATGAAGACC TGAAAACGGC TCTAGCTCAA 2640 

GTGACAGAAC AATTTGAACA TGAGAGTCAG TTTGACATTA GTCGTAGCGA TTTGATTCGC 2700 

CTTGGTTTTC TAGCAGGGGC AGACAGCCGT AAGCGTAGAG AATATCTCGG AGAGACTCTC 2760 

CGAATCGGCT ATTCCAACGG CAAGCAACTC CTCAAACGCC TAGAGTTGTT TGGGGTTACT 2820 

TTGGCAGAAG TGGAAGAAGC TATGAAATCT TATGAGTAGG AAAGATGTAG CCGTTACAAT 2880 

TTTTTAAGTT TCACAGTATT TTTCGAAGCA GCTAGAAGAG GAGGCGTCTG ATGTTAATTG 2940 

GTCAAAAAAT TAAAGAGATT CGGATAGAAA AAGGAATTAG TCGTCCAGAT TTTTGTGGAG 3000 

ATGAGCAAGA ACTGACAGTT CGTCAACTGT CGCGAATTGA AAGTGGAGCT TCGCAACCGA 3060 

GTTTGCCCAA GTTAGACTAT ATTGCTCGCC GGCTAGGAGT TCCAGTTTAT AGCCTTATGC 3120 

CGGATTTTTC AGCTCTTCCT TCTGCTTATT TAGAATTGAA ATACCAGATT TTACGTGAAC 3180 

CAATCTATGG TAAAGAAGAG GAGTACGATA AGAAGGAAGC GTGTTTGGAA GAGATTTATA 3240 
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AAACATACTT TGATAATCTT CCTAAAGAAG AACAATTAGC ATGTGAAGTA TTGCAGGCGT 3300 

GTTTGGATAC TTCTAGAACT AGAAGGCCTG AATATGCAGA GTTAATACTT GAGGAACATA 3360 

TGCCTCAGAT TATAGAAAAA GAAGCTTATT CAATAAATGA TATGTTGTTG ATTCGTTTGT 3420 

TTTTTTATCA AATGCTCATT AGAAAAGATC TTGCCAAATT TATAAATCAA ATCGAAAAGC 3480 

TAATGCTCTT TCTTTTGGAA CAGAAGAAGG TAACTCAAAT AGAGAATTAC TTTATAATTA 3540 

GAGATACTCT TATTTCAGGA ATGTGTTGTC TTGAAAAGGT AGGAGTAACT GATTGTTTTA 3600 

ATGATTATCT ATCGTGTTTA CAAGAAATTA TGGATAAAAC TCAAGATTAT CAAAAGAAAC 3660 

CTCTTGTATT TATGTTTTTG TGGAAGCAAG CATTAAGAGA AGAAAGAGAT TTTAGTTTAG 3720 

CTGAATCATT TTATCAGTCT TCTAAAACAT TTGCGCAGCT AATTGGAGAT GAATTTCTAG 3780 

TAAAGAAATT GACAGAGGAA TGGCAAGAGG ATGTCAAAAA ATATTTATAA ACATAGTGAA 3840 

TCAGTGACAA AGATGTCCTT GTCCTCGTAT CAAAACAGTT CTAAAGTTCG TCTTTAGGGA 3900 

TGTTTTTTTA GATATAAGCT AAAAATGACA CGAAATGGTT AGATTTTAAG GACATTGATG 3960 
TCCG 

(2) INFORMATION FOR SEQ ID NO: 137: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12666 base pairs 
(B» TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



3964 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 137; 

TGAGACCGTT ATTTGTATTA GGGAAATGCG TATCTATTTT TAATGCTGTG GGGATTTTGA 60 

TTGTTTCTAT TATTCAAACC AAAAGCTTGT CAGGTATTGG AGCAGGATTG TTTAATCTAT 120 

ATAACATTTC ATCTTATATA GGTGATTTAG TTAGTTTCAC TCGATTGATG GCATTAGGAT 180 

TATCTGGAGC AAGTATAGCA TCAGCTTTCA ATTTAATTGT TGGTTTGTTT CCGGGAATAT 240 

TGGCTAAACT GACAATTGGA TTAGTATTAT TCATTCTTTT ACATGCGATC AATATTTTTC 300 

TATCGTTACT ATCAGGATAT GTTCATGGAG CACGTCTGAT ATTTGTTGAA TTTTTTGGTA 360 

AGTTTTATGA GGGTGGAGGA AAACCATTTC AACCTTTGAA GGCTTCTGAG AAATATATTA 420 

AGGTTATTAC AAAGAATTAA TGGAGGATAT ATATAATGGA ACATTTAGCA ACTTATTTTT 480 

CAACCTATGG AGGAGCTTTC TTCGCTGCAT TGGGAATTGT ATTGGCGGTT GGATTAAGCG 540 

GTATGGGGTC TGCTTATGGA GTTGGTAAGG CTGGGCAATC TGCCGCAGCT TTACTGAAAG 600 
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AACAGCCTGA AAAGTTTGCC TCAGCTTTGA TATTGCAATT ATTGCCCGGA ACACAACGAT 660 

TATATGGTTT TGTTATTGGA ATTTTAATTT GGTTGCAATT AACTCCAGAA CTTCCTTTAG 720 

AAAAAGGCGT TCCTTATTTC TTTGTAGCTC TTCCAATTGC TATTGTAGGA TACTTTTCAG 780 

CTAAGCATCA AGGAAATGTA GCAGTAGCGG GAATGCAAAT CTTGGCTAAA AGACCAAAAG 840 

AATTCATGAA GGGAGCAATT TTAGCTGCCA TGGTAGAAAC CTATGCAATT CTTGCTTTTG 900 

TCCTATCATT CATTTTGACC CTTCGTGTAT AAGAAATAAA TTTGCAATTC AAAGGAGGTG 960 

TCTAAATGAG CAATTTAGAA AACTTACGAG AGTCTCTTAT TGAACAAGCT CATGAAAAAG 1020 

GGCGTATGAA ATTATTGGAT TCCAAAAAGA AGATTGATGA TGAATTTGAA ATGCAAAAGT 1080 

CGCTCATTAT AAAGAAAAAA GAAGCTGAAC ATGAACGAAA GTTAAAAGAA TTGCAACAGA 1140 

AATATCAAAT AATTTTTCAA CAATTAAAAA ATAAGGAACG CCAATCAACG TTAGTATCAA 1200 

AACAGAAAAT ATTAAAAGAA CTTTTTCAAT CTGCTTTACT AGAAATGGAA TCTTGGAGTG 1260 

CAGATAAAGA AATGGAGTTC ATCTATCGAA TTCTGGAACG ATATTCACAA CAAGAGGTCA 1320 

TAGTAACCTT TGGGGAACGG ACTTTAGCTA AATTCAATTT GGAACAATTA GAGAAATTGA 1380 

AATTCTCTTT TCCAAATTAT TTATTTAGTG AACAACCTAT CTCAAATGAA TCAGGCTTAC 1440 

TTATTTCAAT AGGTAAAATT GATGATAACT ATTTGTATAA AACATTAATT GGATCGATTT 1500 

CTAAGGAAGA AAGTTCAAGT ATCGCAAATC AAATTTTTAT CAATTAAGGA TGAAATTGGT 1560 

TAATCCTTCT TAGAAATTTG GAGTATTCCA ATAAAATTAG AAAGGTATTT TATGGATACT 1620 

AATCTTTTTT CAAAAATAAA TACGACGATT TCGGTAAAAG AAAACGATTT TATTACAGAA 1680 

GAAAAATTTC AAAAAATTAT ACAATCCAAA GATACGGAGA CATTGGCATT TATCTTAGAA 1740 

TCAACTCCCT ATCATTTATC GATTGACATC TTACAAGATC CTAGTCAGAC AGAGATTTCG 1800 

CTAATGACAA AATTAGTCAA TGATTATAGA TGGGCCTATG CTGAAAGTCC GTCTGATATA I860 

ATTGTGACTT TATTTGCTTT ACGATATCTT TATCATAATA TCAAAGTTTT ATTAAAATCT 1920 

AAGGCGGCAA TTAAGAAAGA TTTTTCTAAA TTATTAATTC CAATAGGGAT TTTTGATATA 1980 

GAAAGTTTAA AACATTTAGT TTCTTCCTTA CATTCAGATA CACTTCCTGA TTTTATGGTT 2040 

CGTGAAGTAG AATCAATTTG GAATGAGTAT GAAACTTTTA ATAATATTCG TGTACTTGAT 2100 

GTCGGAGCTG ATCTAGCATA TTTTAAACAT CTGAAACTTT TATCTAATGA CTTAGATGAC 2160 

GTACTGTCTC AGGTTATTGT CGAAATGATT GACTTTTATA AT ATT ATT AC TGTAAAACGT 2220 

GGTTTATCTC AAAATAAGAG TCATGGGCAT ATTTTACAAT TACTTTCAGA TGAAGGAAGT 2280 

ATTTCTGCTA AAGAATTTAT ATACATTGTA GAAAATCAAG AAATATTTGT GTGGTTCAAT 2340 

AAAATAAATC CAAGCTTAGA TTCAATCTTT TCAACTTATG AATTGAAGAT GCAGGACGCA 2400 
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ACAATTTCAT 


CTTCTGAGTT 


AGAATTTTTA 


TGTGATTTAC 


TATTGTATAA 


AACTTTAGAT 


2460 


CAAGGAAGCT 


ACAATGTAGA 


GGGGCCGTTA 


GTTCTTGCTA 


GATATTTATT 


GGGATGTGAG 


2520 


TTTGAAGTAA 


AGAATCTCAG 


AATGATCATA 


TCAGCTCTTC 


AAAATACAAT 


TCCCTTTGAA 


2580 


TCAATAAAAG 


AAAGGATACG 


CCCACATTAT 


GGAAGCTAAT 


AAGTATAAAA 


TTGGCATAAT 


2640 


TGGTAGCCGT 


GATATTATTT 


TACCATTTAG 


CATGATTGGG 


TTTGATATAT 


TTCCTGCCTA 


2700 


CCAAGAACAA 


GAAGCTATAA 


ATACACTAAG 


AAAATTAGCT 


CAATCTGATT 


ATGGTGTCAT 


2760 


TTATATCACT 


GAAGACATTG 


CTTCAATGAT 


ATTAGATACA 


ATTCGCCATT 


ATGATTCCCA 


2820 


AGTTGTGCCT 


GCTATTATTT 


TATTACCGAC 


TCATAAACAA 


GGTTTAAATT 


TAGGATTAAA 


2880 


ACGTATAGAG 


GATAATGTAG 


AGAAAGCAGT 


AGGACACAAT 


ATTTTATAAT 


AATGTACAAA 


2940 


ATTGTCTGTA 


AT ATT ATT CT 


ATAATTTTTG 


GACTTAGTAA 


GGAGAATAAC 


TTTGACTCAA 


3000 


GGGAAGATTA 


TAAAAGTATC 


GGGACCTCTA 


GTTATTGCAT 


CAGGTATGCA 


GGAGGCTAAT 


3060 


ATTCAAGATA 


TTTGCCGTGT 


AGGTAAGCTA 


GGGTTAATCG 


GTGAAATTAT 


TGAAATGAGA 


3120 


AGAGATCAGG 


CATCTATCCA 


AGTCTATGAA 


GAAACATCTG 


GTCTTGGTCC 


GGGAGAACCT 


3180 


GTTGTTACAA 


CTGGAGAACC 


TCTCTCGGTT 


GAATTAGGGC 


CAGGATTGAT 


TTCTCAAATG 


3240 


TTTGATGGCA 


TACAACGCCC 


ATTAGATCGA 


TTTAAATTGC 


CTACTCATAA 


TGATTTTCTA 


3300 


GTTCGTGGGG 


TAGAAGTTCC 


AAGTTTGGAT 


AGAGATATTA 


AGTGGCATTT 


TCATTCCACT 


3360 


ATAGCAATTG 


GTCAAAAAGT 


GAGTACGGGT 


GATATTCTTG 


GAACTGTCAA 


GGAAACCGAG 


3420 


GTAGTTAATC 


ATAAAATTAT 


GGTTCCTTAT 


GGAGTATCTG 


GAGAAGTCGT 


TTCTATTGCA 


3490 


TCTGGCGATT 


TTACAATTGA 


TGAAGTTGTA 


TATGAAATAA 


AAAAATTGGA 


CGGTAGTTTC 


3540 


TATAAAGGAA 


CGCTTATGCA 


AAAATGGCCT 


GTCCGCAAGG 


CGCGTCCTGT 


TTCTAAACGT 


3600 


TTAATTCCAG 


AAGAACCATT 


AATCACAGGT 


CAACGACTTA 


TTGATGCATT 


CTTTCCAGTA 


3660 


ACCAAAGGGG 


GAGCTGCAGC 


AGTTCCTGGA 


CCGTTTGGAG 


CAGGAAAGAC 


AGTTGTACA* 


3720 


CACCAAGTAG 


CTAAATTTGC 


CAATGTTGAT 


ATTGTTATTT 


ATGTCGGTTG 


TGGAGAACCT 


3780 


GGAAATGAAA 


TGACGGATGT 


ACTGAATGAG 


TTTCCTGAGT 


TGATTGACCC 


TAATACCGGA 


3840 


CAATCAATTA 


TGCAACGGAC 


AGTTCTGATT 


GCTAATACTT 


CAAATATGCC 


TGTTGCTGCT 


3900 


CGTGAGGCTT 


CAATTTATAC 


AGGAATTACC 


ATGGCTGAGT 


ATTTTCGTGA 


TATGGGCTAC 


3960 


TCTGTCGCCA 


TTATGGCTGA 


TTCAACTTCA 


CGTTGGGCAG 


AAGCGCTACG 


TGAAATGTCA 


4020 


GGACGTCTAG 


AAGAAATGCC 


TGGTGATGAG 


GGTTATCCTG 


CTTATCTGGG 


AAGTCGTATC 


4080 


GCTGAATATT 


ATGAAAGAGC 


AGGACGTTCT 


CAGGTTCTAG 


GGCTTCCAGA 


ACGTGAAGGA 


4140 
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ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA ACCAGTTACT 4200 

CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG ATGCTCCGTT GGCACAGCGA 4260 

CGTCATTTTC CTGCAATTAA CTGGCTTACA TCTTATTCAC TATATAAAGA CAGTGTGGGC 4320 

ACTTATATAG ATGGTAAAGA GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC 4380 

TACTTACAAC GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 4440 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA TTATTTGCAA 4500 

CAGAACGCTT TTGATTCGGT AGATACATTC ACTTCGTTTG CAAAACAAGA AGCAATGCTA 4560 

AGTAATATTC TCACTTTTGC TGATCAGGCA AATCATGCTT TAGAGTTGGG TTCTTACTTT 4620 

ACAGAGATTA TGGAAGGTAC CGTGGCAGTT CGAGACCGTA TGGCGAGAAG TAAATATGTT 4 680 

TCAGAAGATA GATTAGATGA AATCAAAATT ATATCAAATG AGATTACACA TCAAATTCAT 4740 

TTGATATTAG AAACAGGAGG TCTATAAATG AGTGTTATAA AAGAATACAG AACTGCTAGT 4 800 

GAAGTTGTTG GGCCTCTTAT GATTGTTGAA CAAGTAAATA ATGTGTCTTA CAATGAGTTA 4 860 

GTTGAAATTC AACTTCATAA TGGAGAAATT CGTCGTGGAC AAGTTTTAGA GATCCACGAA 4920 

GATAAAGCAA TGCTTCAGCT TTTTGAAGGA TCTAGTGGAA TAAATTTAGA AAAGTCTAAA 4980 

ATTCGTTTTG CTGGTCATGC ATTAGAATTG GCTGTATCTG AGGATATGGT TGGTCGTATT 5040 

TTTAATGGCA TGGGAAAACC AATTGATGGT GGACCAGATT TAATTCCAGA GAAATATTTA 5100 

GATATTGATG GTCAAGCTAT TAATCCTGTA TCTAGAGATT ATCCAGATGA ATTTATTCAG 5160 

ACAGGGATCT CCTCTATTGA TCATTTGAAT ACTCTTGTAC GTGGTCAAAA ATTACCAGTA 5220 

TTTTCAGGTT CGGGCTTACC TCATAATGAA TTAGCTGCTC AGATAGCAAG ACAACCGACT 5280 

GTTTTAAATT CTGATGAAAA TTTTGCGGTT GTATTTGCAG CAATGGGTAT TACTTTTGAA 5340 

GAAGCTGAGT TTTTTATGGA AGAACTCAGA AAAACAGGAG CGATCGATCG TTCGGTTTTA 5400 

TTTATGAACT TGGCAAATGA TCCTGCAATT GAGCGTATTG CAACTCCCCG CATTGCTTTA 5460 

ACTGCGCCAG AGTATCTAGC TTTTGAAAAA GATATGCACG TTCTAGTTAT CATGACGGAT 5520 

ATGACTAACT ATTGTGAAGC GTTACGTGAA GTCTCGGCAG CTCGCCGTGA AGTTCCAGGG 5580 

AGACGAGGCT ATCCGGGATA TTTATATACA AATTTATCAA CTCTATACGA AAGGGCTGGT 5640 

CGCTTAGTTG GTAAAAAAGG TTCGGTGACA CAGATTCCTA TTTTAACAAT GCCAGAAGAT 5700 

GACATAACAC ATCCAATTCC TGATTTAACT GGATACATTA CTGAAGGGCA AATTATTTTG 5760 

TCGCATGAGT TGTATAATCA AGGTTATCGT CCACCAATCA ATGTTTTACC TTCTCTCTCT 5820 

CGATTAAAAG ATAAGGGATC TGGAGAAGGT AAAACTCGTG GAGATCATGC TCCAACTATG 5880 

AATCAACTGT TTGCAGCCTA TGCCCAAGGG AAAAAGGTTG AAGAGTTAGC ACT ACT ATT A 5940 
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GGAGAATCGG CTTTATCTGA TGTAGATAAA TTGTATGTGA GGTTTACAAA GCGTTTTGAA 6000 

GAAGAGTACA TAAACCAAGG ATTTTATAAA AATCGAAATA TAGAAGATAC GTTGAATCTT 6060 

GGGTGGGAAT TACTATCAAT TCTTCCTAGA ACAGAGTTAA AACGTATCAA AGATGATTTG 6120 

CTTGATAAAT ACTTACCTTT GGTAGAAGTT TAATCCGGAA ATGGAGTGAT TATCTATGGT 6180 

ACGTTTGAAT GTAAAACCAA CTCGTATGGA ATTGAATAAC TTAAAGGAAC GTTTGACAAC 6240 

AGCTGAACGT GGACATAAGT TATTAAAGGA TAAAAGAGAT GAATTGATGA GGCGATTTAT 6300 

TTCTTTGATT CGTGAGAATA ATCAACTTCG GAAAGAAGTG GAAACTTATC TAATTGATAA 6360 

TCTAAAATCC TTTGCAGTTG CTAAATCATT AAAGAATTCT CAAATGGTGG AGGAATTATT 64 20 

TTCAATTCCA TCGAAAGAAA TTGAATTATT TGTTGAGAAA GAAAATATCA TGAGTGTAAC 6460 

AGTTCCTAGA ATGCATATGA ATATTACTTC TCAAAATGAG AACAGTGAAT ACAGCTATTT 6540 

ATCTTCTAAT AGTGAAATGG ATGATGTATT TGCTACAATG AATAGTTTAA TTTATAAATT 6600 

ACTAAGACTG GCAGAAGTTG AAAAAACGTG TCAGTTAATG GCTGATGAAA TAGAAAAAAC 6660 

ACGTAGACGT CTAAATGGTT TAGAATACTC GATTATTCCA AACTTGTCGG AAACTATTCA 6720 

TTATATAGAA TTGAAACTAG AGGAGGCAGA AAGAGCCAAT TTAGTTCGTA TTATGAAAGT 6780 

GAAGTAGATC CTTTATTTAG ATTATTAATT AGATGAACAA ATATCAGCTT GGATAAGGCT 6840 

TTAAGCCTTT CTAACCTTTT TTTATTGACA GTATCAGGAT ATCTTTTTCA AAATTTTGGT 6900 

TTGTTAGATA ATGAAAATGT TTCTACTAAT CTAGATTTAG GATTAGTAAA TCGTAAATGT 6960 

AATTATATAG AAAGTAAGCG CGTCATAACA AGGTATCTAT CATTCATCGA GCTCCTCCTG 7020 

TATACTATTA GTAAAGTAAA ACTATTGGAG GATATTTTAA TGCCACAACC TATTGTTCCT 7080 

GTAGAGATTC CACAATCTCG TCGTTTTGAT TCTAAAAACA GAAATGATAT TCTGCTTAAA 7140 

ATTCGTATTG GCAACCTTGA AGTAAGTTTT TTTCAATCTC TCAATCTCGA AATGGTAGAA 7200 

CAGCTTTTGG ATAAGGTGTT GCTCTATGAC AATTCATCTA TCTAGCCTAG GGGAGGTCTA 7260 

TCTCGTGTGT GGGAAAACTG ATATCAGACA AGGAATCGAT TCACTGGCTT ATCTGGTTAA 7320 

AACCCACTTT GAATTGGATC CTTTCTCCGG TCAAGTCTTT CTCTTTTGTG GTGGACGTAA 7380 

AGACCGCTTT AAAGTCCTTT ACTGGGATGG TCAAGGATTT TGGCTACTAT ATAAACGCTT 7440 

TGAGAACGGC AGATTGATTT GGCTAAGTAC AGAAAAGGAT GTCAAAGCTC TCACACCAGA 7500 

ACAAGTAGAC TGGCTTATGA AGGGCTTTTC TATCACTCCA AAAATATAGT AGATTGAAAC 7560 

TAGAATAGTA CACCTCTGCT TCTAAAACAT TGTTAGAAAT CGATTTTACT GTCCTGATCG 7 620 

ATTTGTCCTG TTCTTATTTC ATTTTACTAT AAATCCATCA GAAAGTCGTG ATTTCTATTG 7680 
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AAATGAGGAC 


TTTCTTTTTA 


TACTCATCTG 


CTTTCAAAAA 


GCATTCTAGT 




7740 


TTAACGATGG 


ACTTTATCAC 


CTCCTTCTCC 


AGTCCTTGTA 


TAACATCTTG 

4 nn W »» 4 W 4 4 \J 


>j#vvj 4 x un i iv. 


7800 


ATGACATCTT 


CCAAAGTTTA 


AAAGCCTTTA 

■ 44 P W«# 4 4 4 ■ • 


TTCTTAAATC 

4 4 ^# 4 4 nofi 4 


CACGTTTACG 


AA l\.l\. 1 X 4 \_ 


7ftfifl 

f O w U 


CACACTTGTT 


CAATGGGGTT 


CATCTCTGGT 


GTGTATGGAG 


GAATAAATCC 

win 4 nnvi 4 


AAAGPPAATA 

AAAUv V^ r^f \ 4 t\ 




TTAGTCGGAA 


TCTTTAAGGT 


ACTTGATTTA 

* 4 4 N*Ti 4 4 4 • » 


TGCCATATAG 

4 *J#*4 4 44 4 OU 




A APf^AHTA AA 
nnwunu 4 AAA 


74 AO 


AGATAATCAT 


CTGGATAAGC 


TTGTGAAATC 


TCCTATTCCT 


AAAGCCCCTT 


TAGCGCATAA. 

4 fluLULAlAA 


A04fj 


CTTTGGCTCA 


GCTTCTATTA 


TCGCTCACAC 


CATCCATCAG 

W4* A W ^»4* 4 W1U 


AAGTTTAATC 

MVf 4 4 1 /\A L V_ 


Tfi a a cuyr a f*r* 


O X u u 


CAATTATCGC 


CAAGAAGAAG 


ATTGGGCTAG 


GATGGGTTTA 


CC AATP Af* A r* 

x» V- AA 4 ^. /\V— /\W 


Cfl AGTtA A AT 


0 J.OU 


CTCTAATTGG 


CATATCAAGG 


CGAGTCAATA 

wnw 4> wnn 4 *» 


CT ATTTGC A(* 

4 n 44* uunu 


rffPTTT A T A 


A^^». l^w 4 i\jV_0 


O44U 


AGAGAGACTA 


TTGACTCAGC 


wWj A 4 iftV* 4. 4 




r\\~ i 1 aIa 


uW> iM. 1 AV>a 




GAGTGATAGT 


CAGCTGACTT 

WJTXIW 4 WfAW 4 4 


ACTATTGGAC 


All! 4 1U1 v»/% 




AbAAAUVAbu 


O ^ jl A 


GATTACGCTT 


TACCACCATG 

4 ^4WW#4W WJ^i 4 W 


ATCAGTGTCG 

n X W/»W 4 W 4 W W 


AAGTWtTTCA 

AAO 1 VJSJ 1 1 V/\ 


fiTArTir A A^ 


AA n i AWj 


04 AA 


AGATTATTCT 


GGCTATGTGC 

WWW 4 *» 4 W 4 WW 


ATTYITFGAVAT 

C» 4 4 W 1 4 \JmT4 4 J* 4 


111 UWUULAU 


4 M/%V_ 1 1 AUun 


Li n Ao i 


a >l £ A 
04 OU 


CTAGTTCTGC 


CTATGCCATA 




Ul I 1 AUUnU\> 




I AAvL I l\A> 1 




AAACTTCGAA 


CCG CP CGT PT 

WW>^JW 4 WW 1 V* 4 


GCTTATCGTC 

u ^ii n a w> w 4 w 


AAV- X 


AUL 1 VinAL 1 I 


U I I VAjA 1 W 1 1 


aeon 
D3dU 


GGGCGCATGT 


CAGAAGGAAC 


i 4 4 4 X iVjnnu 


VfV^V.ViWV.VVA 




1 AAA i V_A 1 V_\_ 


004U 


TTAGGAGCTA 


AAGGTTTAGC 


TT ATTfPPR A*P 

4 4 n 4 4> W 4 W^r** 4 


V>oD 1 IAH I 1 


4 4 WlAAAU 


AIjAC IViwAU 


m A A 


GCTTTGCCAG 


CTGATGAACC 








L.U AA i wuAA 


PICA 


GACTTCTTTG 


CTTAGTGCCG 


GCGTCAGTCA 

WWW 4 ^» 4»^J 4 W-#» 






Am a iwrr a 


O0<U 


ATTGAATACA 


GCCTCAAGTA 


TGAAGAAACC 


TTTAAGACCA 

■ 4 4 r4nWJ^Wwr\ 


TTTTGAAAi^A 

4 4 4 IW/VVtWt 




OQOA 

ooou 


GTCCTTTCCA 


ATAATCTAGC 

» 4 4 on m 4 nub 


TGAACGCGCC 


ATTAAATCAT 


1 VJU I 1A1 WWJ 


if fIGAfIT A A A 

/U.V.IUAU 1 AAA 




AGAGTCCAGT 


GGACTCTTTT 

4 4 4 4 4 


AGCCTAAGCT 


CAGTTTAAAA 

W«*W 4 « 4 rtrkfin 




U\J 1 lAl 1 1 1L 




TCAAAGTTTT 


GAAGGAGCTA 


AAGCAAGAGC 


TATTATTATG 


AGTTTGTTGG 


AAACAGCTAA 


9060 


ACGTCATCAA 


TTAAATAGCG 


AGAAATATCT 


ATCCTATCTT 


CTAGAATGTC 


TTCCAAACGA 


9120 


GGAAACTCTC 


GTAAACAAAG 


AGGTTTTAGA 


GCCTTATTTA 


CCATGGACTA 


AAGTTGTACA 


9180 


AGAAAAGTGC 


AAATAAGAAA 


TCTCCAGATT 


AGGAACTATC 


CGTGAGTTCT 


CCAGTCTGGA 


9240 


GATTTTTCAA 


TAGACTTCCT 


GCGAAACAAA 


ATATGGTATA 


ATAGTTCTAT 


GAATGATGAA 


9300 


GCAAGTAAAC 


AACTAACCGA 


TGCACGATTT 


AAGCGTCTTG 


TTGGTGTTCA 


ACGCACGACT 


9360 


TTTGAAGAGA 


TGTTAGCTGT 


ATTAAAAACA 


GCTTATCAAC 


TTAAACACGC 


AAAAGGTGGA 


9420 


CGAAAACCTA 


AATTAAGTCT 


AGAAGACCTT 


CTTATGGCCA 


CTCTTCAATA 


TGTGCGAGAA 


9480 



WO 98/18931 



• 

PCT/US97/19588 



927 

TATCGAACTT ATGAACAAAT TGCGGCTGTT TTTGGTATTC ACGAAAGCAA CTTAATCCGT 9540 

CGGAGCCAAT GGGTTGAAGT AACTCTTGTT CAAAGTGGTG TTACGATTTC AAGAACTCCT 9600 

CTCAGTTCTG AGGACACGGT AATGATTGAT GCGACGGAAG TAAAAATCAA TCGCCCTAAA 9660 

AAAAGAATTA GCGAATTATT CTGGTAAAAA GAAATTTCAC GCTATGAAGG CTCAAGCGAT 9720 

TGTCACAAGT CAAGGGAGAA TTGTTTCTTT GGATATCACT GTGAACTATT GTCATGATAT 9780 

GAACTTGTTC AAAATGAGTC GCAGAAATAT CAGACAAGCT GGTAAAATCT TGGCTGACAG 9840 

TGGTTATCAA GGGCTCATGA AGATATATCC TCAAGCACAA ACTTCACGTA AATCCAGCAA 9900 

ACTCAAACCG CTAACAATTG AAGATAAAGT CTATAACCAT GCGCTATCTA AGGAGAGAAG 9960 

CAAGGTTGAG AACATCTTTG CCAAAGTAAA AACGTTTAAA ATGATTTCAA CAACCTATCG 10020 

AAATCATCTA AACGCTTCGG ATTACGAATG AATTTGATTG CTGGTATTAT CAATCATGAA 10080 

CTAGGATTCT AGTTTTGCAG GAAGTCTATT ATCAAAAATA CCATCAAGAT TATATAAGAT 10140 

TGATACAGGA AAAGTTTTAT TTGATGGTGT AAATATTAAT CAAATAGATA AAAAAATATT 10200 

AAGTCAAAAT TTAGGAGTAG TTCCACAGGA TTCATTTTTA TTGAACCGAA GTATTCTTGA 10260 

TAATATAACT TTAAAGCACG AAGTTACTTC ACAAAAGATA GAGGAAGTTT GTAAAGCACT 10320 

TCAAATCTAT GATGAAATCA TGGCTATGCC GATCAAATTT AATACTATCA TCTCAGAGAT 10380 

GGGGTCAAAT ATTTCAGGTG GGCAAAGGCA ACGGATACCA CTGCCACGTG CATTAATAAA 10440 

TAATCCTAGT ATTGTAATTT TAGATGAAGC AACTAGTGCA TTAGACACTA TTAATGAGGA 10500 

AAGAATAACA AAGT ATAT AC AAAGTCAGGG CTGTACTCAA ATAATTGTAG CTCATAGATT 10560 

GTCAACGATT AAGGATGCGG ATGTTATTTT TGTAATGAAA GGTGGTAAGA TTGTTGAGTC 10620 

AGGAAATCAT AAGTACTTAA TGGATCTTGG TGGAGAGTAC TACAGCTTAT ATACAAAAAG 10680 

GAAATGAGGT GTAAAGAAAA TGAAGAAAGA AAATGAATAT GTAATTTTAA CAACAGCCTC 10740 

ACTAGGGGTG ATGATTGGAA TAGTGTTTGC AATTTTTTTA GATTTTCCAG TTGAATATGG 10800 

TATTTCTTTA GGCTTGTTGA ATGGAATAGT ATTGGGTTCG CTGATTGTTT ACAAAAACAA 10860 

TAAGAATTAA GCATAATTTT TTGCTGTAAA CTAAGGAGTA GAGATGGCTA TACTTGAAAT 10920 

TATAAATCTA ACAAAAAGCT TTAAAGATAT TGAAGTTATT CATAACACTT AAATAATAGA 10980 

GCAACTACAG TAGTAGCTTA AAAACATGAT TAAATCGCTA TTCTTAGGAG TAGCGGTTTT 11040 

TCTTTTTGTT TAATACTCTT TGAAAATCTC TTCAAACCAC GTCAGCTTTG CTTTACCGTA 11100 

CTCAAGTACA GCCTGCGGCT CGCTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATAAAA 11160 

AGGGTCAAGT AAGTATAGTA AATTGAAATA AGATATGAAC AAATCGATTA GAAAAGTCAA 11220 
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ATTAATTTCT AGAAATATGT TAGAAATTGG TTTGAATTCC GCAATCAATT TGTTCAGTTT 11280 

TTATTTCATT TCATTTTATT TAATTAGATT TTCCAATTTT TTAATTCAAG CTAAAAATCC 11340 

CCAATCCTAG TGATTGAGGA TTGAGTAAAT AAATCTTAAA CAATACCTTG TGCAATCATG 114 00 

GCATTTGCTA CATTTTCAAA GGCAGCAATG TTAGCTCCTG CAAGGTAGTC TTTATCAAGA 114 60 

CCGTATGTTT CTGAAGTCGT TTTACCTGTG TTGAAGATGT TTGTCATGAT GTCTTTGAGA 11520 

CGGCCATCAA CTTCTTCACG AGTCCATGAG AGGCGAAGAC TGTTTTGGCT CATTTCAAGA 11580 

GCTGAAACGG CTACACCACC AGCGTTCGCA GCTTTTGCAG GTCCGTAGAA GATACCATTT 11640 

TCTTTGTAAA CTTTGATGGC ATCAAGGTCG CTCGGCATGT TGGCACCTTC AGATACACAG 11700 

ATAACGCCTT GAGCAACCAA ACGTTTAGCT GCTTCACCGT TGATTTCGTT TTGAGTGGCA 11760 

CATGGAAGAG CAATGTCATA GTTTCCAGCG TAAGTCCATA CAGTACCTTC GTGGTAGGTT 11820 

GCAGTTGCTT TTTCAGCTGC ATACTCAGTC AAACGAGCAC GACGTTTTTC TTTAACATCA 11880 

ACCAAAAGAT CGAAGTCGAT ACCATTTTCA TCGATGACAT AACCATTTGA GTCAGAAACA 11940 

GAAATAACAG TTGCACCGAG TTCAGTTGCT TTTTGAAGAG CATATTGAGC AACGTTACCA 12000 

GAACCTGAAA TAACGACTTT CTTACCAGCA AAGCTGTTAC CGTTAGCTTT GAGCATTTCT 12060 

TCAGTATAGT AAACCAAACC GTAACCAGTT GCTTCTGGAC GAATCAAGCT ACCACCAAAT 12120 

CCAAGAGGTT TACCAGTCAA GACACCAGCA TCAAATTGGT TAAGACGTTT GTATTGACCG 12180 

TAAAGGTAAC CAATTTCACG TCCACCAACA CCGATATCAC CAGCAGGTAC GTCAAGTGAT 12240 

GGTCCGATGT GTTTTTGCAA TTCAGTCATG AACCTTTGGC AGAAGCGCAT CACTTCAGCA 12300 

TCTGTTTTAC CTTTAGGATC GAAGTCTGAT CCACCTTTAC CTCCACCGAT AGGAAGTCCA 12360 

GTCAAGACAT TTTTAAAGAT TTGTTCAAAT CCGACGAATT TCAAGATCCC TTGGTTTACA 12 420 

GTTGGGTGGA AACGAAGTCC ACCTTTGTAT GGTCCAACAG CTGAGTTGAA TTGAACACGG 12480 

TAACCACGGT TTACTTGAAT TTTTCCATCA CGGTCAACCC AAGGAACACG GAAAGAAACC 12540 

ACGCGCTCAG GCTCAGTAAT ACGTGCCAAG ATATTTTCTT CGATATACTC AGGGTGTTTT 12600 

TCAAATACAG GTTCTAAAGT GTTGAAAAAT TCTTCAACAG CTTGGAGGAA TTCAGCCTCG 12 660 

TGCCGG 12666 
(2) INFORMATION FOR SEQ ID NO: 138: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3063 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


138: 






AGCAACTGTT 


GTGAACCAAT 


TCCGATAAAT 


TCCAAGAATT 


GGTTAATAGA 


GCCATTTTGA 


60 


CCAAAAATCC 


CGATAAAAGC 


ATAGGCTTTA 


AGGAGCAAAT 


TGATCCAGGT 


AGGAAGGATA 


120 


ATCAGCATGA 


GCCAGAGTTG 


ACGGTGTTTG 


AGACGGGTCA 


AAAAGAGGGC 


CGTCGGATAA 


180 


CTGATAAGCA 


GTGCCACAAA 


GGTCACAATG 


CCTGCATAAA 


GCACTGAGTT 


GAAACTCATT 


240 


TTAAGATAGG 


TCAAGTTTTG 


TGACGCAAAG 


TAAGATTTGT 


AATTTTCTAA 


ACTGAACTCG 


300 


CCTTCGATGT 


TGAAAAAGGA 


TTGACCGAAA 


ATCAAGACCA 


AGGGTGCCAA 


TACAAAGAGC 


360 


GCAATCCAAA 


GCATGTAGGG 


TACTACAAAG 


AGTTTAGAGC 


TTGTTTTCTT 


CATCTCTTTC 


420 


CTCCTCGATT 


GCATTGATCA 


AACCTGCTTC 


TTGCTCTTCG 


ATTTCTACGT 


ACTCCTCAAT 


480 


ACGAGCATCG 


AACTCTTCTT 


CGGTTTCATT 


GAGACGCATG 


ATGTGGATGT 


CTTCTGGTTC 


540 


AAAGTCCAGA 


CCGATTTCCT 


CACCCACGAT 


AGCCTTACGG 


GTTGAGTGGA 


TCATCCATTC 


600 


ATTTCCAAGT 


TCGTCATAGG 


CGATAATTTC 


ATAATGAACT 


CCACGGAAAA 


GCTGGGTATC 


660 


GACCTTAACT 


TGGAGCTTGC 


CTTCTTCAGG 


AAGGCTAATG 


CGCAAGTCCT 


CTGGACGAAT 


720 


AACGACCTCA 


ACAGGTTCAT 


TTGGCTTCAT 


CCCACCATCA 


ACCGCTTCAA 


AGCGTTTGCC 


780 


GTTAAATTCG 


ACCAAGTAGT 


CCTCAATCAT 


GGTACCTCGC 


AAGATGTTTG 


ACTCCCCGAT 


840 


AAAGGTGGCA ACAAAGTGGT TGATTGGCTC 


ATCGTAGATG 


TCCACAGGGG 


TTCCAGACTG 


900 


GACAATCTCG 


CCATCATTCA 


TAACGAAAAT 


CCAGTCACTC 


ATGGCAAGAG 


CTTCTTCCTG 


960 


ATCGTGAGTG 


ACAAAGACAA 


AGGTAATGCC 


CAATCGTTGT 


TGTAATTCAC 


GCAATTCGTA 


1020 


CTGCATGTCT 


GTTCTCAATT 


TCAAGTCCAG 


CGCTGATAAA 


GGCTCGTCCA 


ACAAGACCAC 


1080 


ACGGGGTTGG 


TTGATGATAG 


CACGGGCGAT 


GGCCACACGC 


TCACGTTGTC 


CTCCAGAAAG 


1140 


TTTGCGGATG 


GAACGTTTTT 


CATAACCTTC 


CAACTGAACC 


ATCTTGAGAA 


CTTCCGCTAC 


1200 


ACGCTGCTCG 


ATTTCTTTCT 


TATCAATTTT 


ACGCAAGCGA 


AGTGGAAAGG 


CAACATTTTC 


1260 


AAACACATTC ATATGTGGGA 


ACAAGGCATA 


GGATTGGAAG 


ACGGTATGTA 


CGTCGCGCTT 


1320 


GTTGGTTGGA 


ATATCATTGA 


TACGAACACC 


GTCTAGCATG 


ATATCTCCTG 


TCGTCGCATC 


1380 


CAGTAAACCT 


GCAATAATGT 


TTAGGATAGT 


TGATTTCCCC 


GAACCAGATG 


CACCTAGAAG 


1440 


GGTCTAGAAT 


TTCCCTTCTT 


CCAACTCAAA 


GTTGATGTCT 


TTGAGAACCT 


TGGTGTTGCT 


1500 


GTCrrCAAAA 


ACTTTAGAGA 


CGTTTTTGAA 


TTCGATAATT 


GGCTTTTTCA 


ATTGGCATAA 


1560 


ATTCCTTCTT 


TTTCATAGAT 


TAACCGATCG 


GGGCTCTGTC 


AGGTCCCCAC 


TACCTCTTGC 


1620 


AGGGAGTAAA 


ACCACCTGCA 


TACATCTTCG 


CTACCGATAG 


GCTTTCACCC 


AAGATCCGGA 


1680 
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CTTCTCTTTC AAGCCTAATA CCTGAGTGTT CCTTGACTTT TTCGATAACC GATTCCATCA 1740 

AGTCCTCGTA GTCTTTGGCC GTTCCATCTG CGACATTGAT CATAAATCCT GCATGCTTTT 1800 

CTGACACTTC TACGCCACCG ATACGATAGC CTTTCAAGCC AGCTTCTCAA ATTAACTGAC 1860 

CTGCAAAATG CCCGACTGGA CGCTTAAAGA CCGAGCCACA AGATGGGTAT TCCAAAGGTT 1920 

GCTTGAGTTC ACGTAGGTGC GTCAAGCGGT CCATTTCCTG CTTGATAACC TGATGGCTTC . 1980 

CTGGAGCTAG GGCAAATTTA ACTGACAAGA CAACTGCACC AGACTCCTGA ATACCTGAAT 2040 

GACGGTAACC AAAAGCCAAG TCTTTAGCAG ACAGGCTTTC GATTTCTCCA TCCTTGGTCA 2100 

AGACCTTACA AGACTGCAAC ATGTGAGCAA TCTCGCCACC ATAGGCACCC GCATTCATAA 2150 

AGACAGCACC GCCAACGCTT CCTGGAATAC CACAAGCAAA CTCAAAGCCA GTTAAACTAT 2220 

GACGGAGGGC AATCCGACTT GTTTCAATCA AGTTAGCCCC AGCTTCTGCT TCAATGGTAT 2280 

AGCCATCAAC AGAAACGTTA TTGAGCTTGT CACACAAGAT GACAAATCCA CGAATCCCAC 2 340 

CATCACGAAC GATGATATTG CTTCCATTGC CAAGAACCAT CCAAGGGATA TTTTCTTGGT 2400 

TGGCAAATTT CACAACGCGA GCCAACTCAA AACGATTTCG TGGAAAGACC AAATAATCAG 2460 

CCTCTCCACC TACTTTTGTA TAACTATAGC TATCCAAGGG TTCCTTAAAA CGGATATCAA 2520 

TTCCTTCTAA GATTTCAAGC ATTTTTTCTC TTACACACAT GTCACTCTTC CTTTTACAAA 2580 

ATTCATTCCA TTATACCATT TTTAGAGACA TTTGACGACC ATAAAAATAC CTTGTTTGGA 2640 

TTTTGCATAA GAAAAAGAGG TTCCCCCCTT TTTATCATTT TTTACAAAAC ATTTCCTTGG 2700 

TTCCATAGGC GACCAGAACG AGCTCCAGTG CTAGAATCAC TTCAACCAAG ACTGGATTTG 2760 

TCAACCAGCC TACTTGGAAA AGAGATGGTG CCAGATCAAA GAACGCATGC AAGCCATAGG 2820 

CTGCTAGGAG ATAAATCCAT TTCTTCTGGC GAACAGCTTG GTAAACCCAA ACTGTCAAAA 2830 

GTAATTGGAA ACCAAGCGCC AAGATTCGCT CAAAACCAAG CAAATAAATC TGCCAGACCG 2940 

AAAGTGACTG AATGGTTTTT AACATATTTT CACACACTAA TTGCATAACC TGTCCATTC: *»CCO 

GAGTTTGAAC TGCCCAAAGA ACAATGTAAA GATTGAGTAA ACTAGTAAGG CCTAGAAAAA 3060 

TCAACTCCAA GCCACCATGC CCC 3083 
(2) INFORMATION FOR SEQ ID NO: 139: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 15363 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 139 
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CCGGAGGATA 


TTGACCACCA 


CCAAAAGCAG 


GGGGAAAATC 


GAAATCAACC 


AATAGTAGGC 


60 


TACTGCGACA 


CTGGTCAACT 


CACTATCTGA 


TGCTTGATAA 


TAATGCAAAA 


AACCTTTTAA 


120 


TAAAGGTTTG 


TCTATCAGCT 


CTTTCCACCA 


CTTTTTCATG 


TCATACTCCT 


TCACTTATAA 


180 


TCTTATACTC 


AATGAAAATC 


AAAGAGCAAA 


CTAGAAAGCT 


AGCCGCAAGC 


TGCTCAAAAC 


240 


ACTCTTTTGA 


GGTTGTAGAT 


AAGACTGACG 


AAGTCGATCA 


CATACATACG 


GTAAGGCGAC 


300 


GCTGACGTGG 


TTTGAAGAGA 


TTTTCGAAGA 


GTATTAACTA 


ATTTCTTCTT 


ACCAATTCCA 


360 


CCATATCATA 


CGGTAGGGTA 


TTGGCAGCTT 


CCTTCAAGGA 


ATAGTTCTCT 


AAGTTATTTA 


420 


CATTTTGTCG 


TAATTTCTTG 


GCATACTTAG 


TCGTAATCAA 


TCGTTTTTCT 


TCGTATTCGA 


480 


AAATCAACTT 


GCGCTCCAGA 


TAATAGCCTC 


TCAGCATTTC 


ATCGATATTG 


TTGGGTTTGA 


S40 


CACGATTGAT 


AACCCGTTCG 


ACAAAGGCAC 


CACTGCTGAT 


AATAGCTGTT 


TCTCGAAGAC 


600 


GAGACTCCTG 


CATAAAACTA 


ATCAAAGAGC 


GTCTGTAGAC 


TCCCTTCAGG 


TTTTCCAAAC 


660 


TTTCAATAAT 


CATCTCTGTA 


TTGGCAAGAT 


AGAGCTCTGC 


AATTTGGTCA 


TAATCAAGAG 


720 


CACGGAGACG 


GCTTTGCTCC 


TTGTTCTTCC 


AGCTACGGAA 


GGTCTTTCCG 


AGAGTAAAAA 


780 


CTTCATGAAG 


GAGAAAACGT 


AAAATCCTCA 


AGGAAACAAG 


AAAATAATAG 


GTCAGTCTTG 


840 


AGGCAAGTTT 


ACGATTGATT 


CCTTGTTCTA 


TATTTTTCAG 


ATAACGTTGG 


TAAACTCCGT 


900 


AAGCACGATT 


GCTAATGTTC 


CCCTCTTCAT 


AGGCCTGTTC 


CAAACCATCA 


CTTTCAATAC 


960 


TAAGAATCAA 


GAGTTTCAAA 


GCAGCCCAGT 


CTTCTTGATC 


ATCCTGGTTT 


TCTTGCCTTA 


1020 


AAATGAGATT 


TTCAATACGT 


CCATGATAAT 


TGTCAATAGC 


CGCATAGAGG 


GGAAGTTTAT 


1080 


TTCTGGTGTC 


TTCCAACTCT 


TTTTCCAACT 


CTAGCCTTAC 


TTCATTCAAA 


ATGGCGATAT 


1140 


GCATAAGATA 


ATCCTTGCTT 


TCTTCCTCTT 


CATCAGAAAG 


ATGAGGCAAG 


ACCAAGAGAC 


1200 


CTGTTAAAAA 


GCTAACAAGC 


GTCACACCTG 


CAACAAGGAA 


AAGCAAAAGA 


GGATACTCCT 


1260 


GTTCTAGATT 


ACTTGGTATC 


AAGAGAATCG 


TAGCAATCGA 


CACCGTTCCC 


TTAACACCTG 


1320 


AAAAGGTCAA 


GAGAAACATG 


TCCTTCATAT 


ACTTATTTAG 


CTTTTTCTTG 


AGGCGTCGGG 


1380 


TTCTATAGGC 


ATAATAGCCA 


TAGATCATAA 


TAAAACGAAT 


GACAAAAAGG 


ACAAAGGTAA 


1440 


GGGCGATAAG 


AGATAGCAAT 


AAAAGTAGAG 


GATTATAGAT 


TGGATTGGTC 


AAGATAGGTT 


1500 


CTGCTATCAT 


TTCCAACTCC 


ATCCCTAAAA 


TCACAAAGAC 


AGAACCGTTG 


AGCATAAAGG 


1560 


TCACTGTATG 


CCAGACCGTC 


TCGGTCACCG 


TATCCACTTG 


GGCTTCGAGG 


AGCGTGATTT 


1620 


TCTTGAAGCG 


ACTTGCCTTT 


AAAATTCCAG 


CAACTACGAC 


GGCAATAATA 


CCTGAAACAT 


1680 


GAACTTCTTC 


TGCCAGAAAG 


AAGGTCACTA 


GAGGCAAACT 


CAATTCTAAT 


AAAAGTTCAC 


1740 
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TGGCAATATC 


V. 1 L VJV. V^V- 


APAPTTAGPA 
fti_AV_ X X nUV.A 


API APfSTATf^ 
nuAAuu lAlu 


PAGGAAGPGP 


I I\»GTlATGG 


i ann 


CTCTTAAAAA 


TTPAHTTaAA 
l V. Unrt 1 lAAn 


A A APPPPPT A 


rtp awe A AAA 
obAl luAAAA 


rtATPAPPPa a 
un 1 VjALL LiAA 


L 1\»lTAGL 1 1 


t of n 

looU 


GCCCCACAGA 


AAAAGPTPPA 


PTTPJTPP A A 


PT(7TPA AAPP 


TAPPTPA A a A 
1 ALL XuAAAA 


<~*r*r* jv /^/** iv a a/* 
GLLALLAAAL 




CAGAAGCATC 


A ft X wwiUnU L 


PfTTPWPPP 
L«p X 1 LuV>vv 1 


1 AAvjAA X A 1 1 


nn a p a pppr^/" 

uviALALuLGL 


I 1AGGAAAGL 




TAAAACGCTP 


PGAAAGAGAG 




PPA APTPPfIT 
LUUUi X LLvi 1 


AVjLAL L AAliG 


GlTGLLLLAA 


ZU40 


CAGCCAAGTA 


AGTTGrT A AG 
AV*V. 1 V)LL AAV 


GG A AGGPTVi A 


APP A A Afi AAf^ 
A\— L/UUtunnu 


ATPPPPPa a/* 
A l\jViOLLAAG 


LLALLLAAAL 




TCAGGGTCGA 


G ATA AAA ATP 


APTGGAAATA 


Tf^A/^ATA A AP 
X UnVJrt X AAAL 


& ATPaTTWV* 
AA I LvA J 1 V_ljL 


L ALf lTj 1 X 1 1 A 




AAATAGPPGT 


nA^n XV— 1 vtL 1 


TPTTP IfiPPT 
X L X X L A\jLL i 


PTPl^PA A A AP 


laagggtll o 


A T 1 TV IV r^"* a ^*fV 

AT AAll AGTG 


inn 


k» ^nnn/utVi AA 


V. 1 LLO 1 A 1 X A 


Auu 1 uAAAU 1 


LAO 1 A 1 x\»(jKj 


^ TV A IV IV IV^fV^IV 

TAAAAAGAGA 


ClAATlAlAA 


2280 




ftAH luLALL 


a a a/w 

AAAvsviuAJLjAiy 


ppa a a & h.fnf* 
L»LAAAAAbtAj 


CAGGAGCTTA 


TTGGTTGTAC 


2340 


1 X vAVf AL AA I 


LAAAAL LAU I 


AAAAATAWjA 


TGAGGTAAAT 


CAGTAATTCC 


ACGCACGTCC 


2400 




11 II 1 AlAAL 


AGGAT1LAAA 


TATCTCCTTC 


TCCTCTTTGA 


TTTTTTGGTC 


2460 


aatpttpp a & 


LAO XL I 1 i\j 1 


GL I LAAi III 


i L rL XtjOLAL 


CGTTCCATTT 


Ik fk Ik ^^^^ m Ifc 

CAAGAGCAAC 


2S20 


I A/\l lilt XA» 


ITuATTi I AA 


ocAri'iTrrr 


GCTTCATATGC 


GCTTGGTCTA 


GCACGCCCAT 


2580 


LuL XLvj X 1 Lvi 


TGO IaJGGI I G 


ATTlAACAAA 


ATTCTGGCGC 


ATGGCATCCA 


GC1 1 i TCGTG 


2640 


X AAv» 1 /V 1 lu X 


^P^W ft ^P^^ 1 ^^ n ^P^^^*^P 

1 I A 1 LLA Ivj 1 


V_ rt»i A1"L J~L I' 


uAArrrni 


AATCATCACT 


A A * ■ Ik 

AAAAACGGCG 


2700 


LA» i 1 vi I I L*AL 


^Ptfl^f^^^^^n^^^l'V* ft ft ft 

1 l\5Gl x 1 AAA 


ft ft ft 

GTTCGGTAAA 


TGGCAGCTGT 


GTACTCTTGT 


TGGTTCAACT 


2760 




* TV**^* A a/"*ir**a 

ATLlAAOALA 


GCATLTTtrrCT 


CGAGATQajLL 


TCCTTCATGA 


ft m il ft ft ft 

CCATAGTAAA 


2820 


x L *> x aa i nviv. 


AAl ILGxlLA 


rVTHTf* R ft IV 

LL 111 LjAL AA 


OTAAoLLALA 




AATGCCTCAA 


2880 


TPflTTPT ITf? 
1 LV» X lulLlu 


HjO I CviOk» X\» 


A i\j AL AG AL 1 


TATCAGCTGC 


CGGCAAATAG 


AAA A ^ Ik HIHI A ft 

CCCAGATTAA 


2940 


#\A/\ i L LL X V>L 


L I I AW- 1 I I T 


IV /TV* * » iv * fttn 

ATwAwVAACT 


GGTCCAGTGT 


LTlAT GGC CT 


PT^r^f^it ft ^ ft nun ft 

TGCAAGATTA 


3000 


APTGGGPATT 


T(WS A/TP A 
1 u 1 wAAo .1 LA 


vjLL 1 VjA 1 VjLA 


AALUL rL I 1 vj 


Gvj'1"L"1"'1"1"1*LL 


AAGGCTvGlT 




TCTGAATATC 


AAAGGCATAG 


ACTTGCTTGG 


CTAGCTTGGC 


TAAAAAAAGC 


GTGTCATGAC 


3120 


CATTTCCCAT 


AGTCGCATCC 


ACTACGACAT 


CCTCTTTTGT 


cacgacctca 


GCCAAAAAAT 


3180 


CATGTGCCAT 


CTCAAGTGGT 


CTTTTCATTT 


TCAAACTCCT 


gttttacagc 


CTTGCATCCT 


3240 


TGAACACTTC 


CACGACGTCG 


CATCTCCATC 


TCAATGCTGT 


tgaggacttc 


CCATTTATTG 


3300 


AGGCTCCACA 


TAGGACCAAG 


CAGCATATCC 


CTAGGCGCAT 


CTCCTGTAAT 


TCGATGGATG 


3360 


ACGATATGTT 


TGGGAATAAT 


TTCCAGTTGG 


TCACAGATGA 


CCCTGACATA 


TTCGTClTGA 


3420 


CTCATCAATT 


GTAAACGCCC 


CTCATGGTAA 


TCTCGTTGCA 


TACGAGTATT 


TGTCATAAGA 


3480 


TGGAGCAAAT 


GCAGTTTAAT 


CCCTTGAATA 


TCGTTATCCG 


TGACACAACG 


GCGGACATTT 


3540 
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TCAACCATCA 


TCTCATGGGT 


TTCACCAGGC 


AAACCATTGA 


TCAAATGGGA 


AACAATCTCA 


3600 


ATTTTTGGAT 


ACTTTCTCAA 


ACGCTTGACC 


GTTTCCACCT 


ACAATTCATA 


AGAATGCGCA 


3660 


CGGTTAATCA 


GGTCAGAGGT 


TGCTTCATAA 


GTAGTTTGCA 


AGCCCAATTC 


AACCGTCACA 


3720 


TCCATGCACT 


CCGATAACTC 


AGCCAAATAT 


TCGATGGTTT 


CGTCTGGTAA 


ACAGTCTGGG 


3780 


CGCGTTCCAA 


TATTGATTCC 


TACCACACCT 


GGCTCATTGA 


TAGCCTGTTC 


ATAACGCTCT 


3840 


CGAATAACTT 


CCACCTTTTC 


ATGGGTGTTG 


GTAAAATTTT 


GAAAATAAAC 


CAGATACTTC 


3900 


CGAACATCCG 


GCCACTTGCG 


GTGCATAAAG 


TCAATTTCCT 


TATAAAATTG 


CTCACGGATA 


3960 


GGCGCATCCG 


GTGCCACAAT 


GGCATCTCCA 


GAACCAGAAA 


CCGTACAAAA 


AGTACAGCCC 


4020 


CCATGAGCCA 


CAGTCCCATC 


ACGATTGGGA 


CAATCAAATC 


CCGCATCAAT 


AGGGACTTTA 


4080 


AAAGTCTTTT 


CTCCAAAGAG 


TTTTCGATAA 


TAATCATTCA 


AGGTATTATA 


AGATTTCATG 


4140 


ACTTTCATTA 


TAACAAAAAT 


CACCCACAAT 


CTCAAAAGCC 


TGACTTTCCT 


ATAAATTCCT 


4200 


CTGTTTCTCG 


TTTCCATTAG 


CCTTTTTTTA 


TGATACAATA 


TGGGTATGAT 


TTTAATGAAA 


4260 


TTAGCATCTA 


TTTTATTATT 


GATACTGACC 


TTAGTCGTCT 


GCATTATCCT 


AACCAAACTT 


4320 


TTTAGATTAA 


AAAAACTAGG 


ACGAAACTTT 


GCGGATTTGG 


CTTTTCCAGT 


CTTGGTATTT 


4380 


GAGTATTACT 


TGATTACAGC 


TAAAACCTTT 


ACCCATAATT 


TCCTCCCTAG 


ACTGGGGCTA 


4440 


GCCCTCTCGA 


TCCTAGCCAT 


TATTCTCCTC 


iwrrcrrcc 


TTTTGAAAAA 


ACGCAGCTTT 


4500 


TACTACCCTA 


AATTTATCAA 


ATTCTTCTGG 


CGTGCAGCAT 


TCTTATTAAC 


CCTTATCATG 


4560 


TATATAGAAA 


TGATTGTTGA 


ATTGTTCTTA 


ATGAAATAGT 


CGAATCCCTA 


AGCATTTTCT 


4620 


AGGGATTTTT 


GCTTTCTCTA 


CAAAATAGTA 


TAGACAATAA 


CACTATACAA 


TTTTATACAA 


4680 


AGAAAAGAGT 


CTGGGACAAT 


AGTCTCTTAT 


ATCCAAAAAG 


GCAACGGATT 


TGCCGTTGCT 


4740 


TTTTTGGATG 


GTTACGATAG TCTTGGTAAA 


ATAGAATTGC 


CCAATAAACC 


ATTTAGAAAG 


4800 


GCTATCCCAT 


GCATATTCAC 


TATAACACAA 


ATCAAACAAC 


TTTACCACTA 


GAAATCAGTT 


4860 


CCTTCTTACC 


ACAAGATCAT CTCGTTTTTA CTATTGAAAA AGTGGTGAAT 


ACCTTGGAGG 


4920 


AACGTCACTT 


CTACACCTCC 


TATCATGCCT 


TTGATCGCCC 


GTCTTATCAC 


CCTAAAATGC 


4980 


TTGTATCTAC 


TCTTCTATTT 


GCCTATTCAC 


AAGGGATTTT 


CTCTGGTCGA 


AAAATTGAAA 


5040 


AATGGAAGAG 


TTAGTGACCT TAGATTGTTT GTTTATTGAC 


AGAACTAAGA 


TTGAAGCCAA 


5100 


TGCCAACAAG 


TATAGTTTTG 


TGTGGAAGAA 


AACGACAGAG 


AAATTCTCCG 


CCAAACTTCA 


5160 


AGAACAGATA 


CAGGTCTATT 


TTCAAGAAGA 


AATCACTCCC 


CTTCTGATTA 


AATATGCCAT 


5220 


GTTTGATAAG 


AAACAAAAGA 


GAGGGTATAA 


AGAGTCAGCT 


AAAAACTTAG 


CGAATTGGCA 


5280 
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CTATAATGAC AAGGAGGATA GCTACACACA TCCTGATGGC TGGTATTATC GTTTTCACCA 5340 

TACCAAATAT CAGAAAACAC AGACAGACTT TCAACAAGAA ATCAAGGTTT ACTACGCCGA 5400 

CGAACCTGAA TCAGCCCCTC AAAAGGGACT GTATATGAAC GAACCCTATC AAAACTTGAA 54 60 

AGCTAAAGAA TGTCAGGCGC TTTTATCTCC CCAAGGTAGA CAGATTTTCG CTCAACGCAA 5520 

GATTGATGTG GAACCTGTCT TTGGGCAGAT AAAGGCTTCT TTGGGTTACA AGAGATGTAA 5580 

TCTGAGAGGG AAGCGTCAAG TGAGAATTGA CATGGGATTG GTACTTATGG CCAATAACCT 5640 

CCTAAAATAT AGTAAAATGA AATAAGAACA GGACAAATCG ATAAGGACAA TCAAATCGAT 5700 

TTCTAACAAT GTTTTAGAAG TAAAAGTGTA CTATTCTAGT TTCAATCTAC TATACAATAA 5760 

GAGAATGACT CAAAATTAAA AAGCTAGAGT TCCACAATTG GAAATATCTA GCTTTTTTGT 5820 

GGTTGAGAAC TATTTTGTCT CAGGCTCTTT ATCTTCTATT TAGGACAAGA GTTTTTCTTT 5880 

GGTCTTTAAT GATAAAGAAG GTATCAAAAT TTCTAGTCTT CTTTTTTACC TTTAGTAACT 5940 

ACTAATCCTG CACTCAAACC TAGAAGAGTT AAACCTGCTG CTACTGCTGC TTGGCTTGCC 6000 

GCACTACCTG TACTTGGTAA CTGGGCTTTA TTAGTTTGAC TAGCTTCACT TGAATCAATT 6060 

GGTTTTGTAT CTGCTTTTTC TGACACTTGT GCTTTTTTAG CTTCTTGAGC TACTGGTTTG 6120 

GTTCCAACCA AGACGATGCG GTCTGTCGGA ACTTCTACCA CTTCACGGAG TTTTTCTTCC 6180 

TTACTTCCAT CAGGATTAAT CGCTGTAAAG ATACGTTCTT TTCCAACTTT TCCTTCTTGT 6240 

TCTACACGAG TTTCACCTAG ATACAGTGTT GAATCTTTTT TCTCAACTGT CTTGTATGCC 6300 

AAATCTTTTT CAACAAATTC GATTTTTGGA AGATCTTCTT GTACAGCAGC AACTGTCTTC 6360 

TCAGAAACTG GTTTTTCCTT AGTCAAGTGG ATACGGTATT CCTTGACTTG TTTTCCACTT 6420 

TCTGAAACGA GGCGAACAAG TACTGGAAAG CTATCTTCTC CACTATCTAC CACAGTTGAA 6480 

GCTACTTGAT TGTTTTCTTC AACTGAGACT TTTGGCCGTT GACCTTTATA GGTAATTTGA 6540 

TAGTCTTGAC GATTTTCAGC GAAATCAGCA AGTTCTTTTC CATCTACAAG AATCTTTGAT 6600 

TGAGTGCTTT CTTGAGGCAA TTCACTTGGT GCAAGGAAGG TCATCTCAAT CATCGCAACA 6660 

CCGCTCTTAT CTG C TTTACG CTCCATACGC CATCTCATAG CTTTGGCTTT GATAGCTTTA 6720 

AATGTTACGT TGATTTCATC ACCACCTGCA ATGTCTTTAT CCGCACGATA AGGAACAGCT 6780 

TCCCAATTTT CTGGATTGTT GAATGGATGG TCTGCGTCGT AGGCTTGGTA GTTTGAATAC 6840 

TAGGTTGGCA CTTCAAACTC TGGACCGACA TAGCCTTCTA AAACGAGTTT AGATGGTGCA 6900 

TCCGTACCAC TATCTGCAAA GAACTGAACT TTTCCTTGTG TAACAGTCCG TTCTACAATC 6960 

TTACCATTTT CACGGAAAAT CACACCCGCT GATACTTCTG GATTAGAAGA TGGTGTTGGT 7020 

GACCAGTTTG TCCAACGACG ATTTTCTGAA TGATCTCCGT CATTGAGATA GTCAACGCGG 7080 
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TCATGAGAGT 
TCATAGTTAG 
TCAGCAACAA 
AGATTTTCCC 
TTACCTGTGA 
AAGCGAGCTT 
AAGTGCAGAC 
GGCTCACCTT 
GTGACTGCCG 
AAGTTTGCTA 
ACTTGTTTCG 
ACCATGCGAA 
CCAGCCTTGA 
TTAAAGACAT 
TTACCAACAT 
AATCCGACAC 
ACACCGACTG 
TTATTGTAGG 
GAAGCAAAGG 
TCAGTTTGAG 
CCATTAACTG 
ACCTTAGCTG 
GGTGCTTCTG 
GAGACTTTTG 
TGATAGGAGT 
AAACCTTTGT 
GTCACTACAT 
GTGAGTTCAA 
TTATGCAACT 



TTTTGTCAAT 
GGTTATCTGA 
GGTTACTACC 
CTGGAACTTC 
AGTAAACTGG 
GTTTAACCGC 
GGTATTCTCC 
CACGAACGCT 
GAACTTTTCC 
AGTCTTTGCC 
CAAAGATTTG 
TACGAACAGC 
GTTGAGCAGG 
GGTCCTCATT 
AATACTCAAT 
TTAGATTATC 
AAGCTTCTGG 
AAATGAGCTT 
CAAGTGGCAA 
CAGATACGCG 
TAAAGACACC 
ATGAAACGTG 
CGATTGGAGT 
CACGCGCTTC 
CTAGTTTTTC 
CATACTCAAC 
TTACAGGACG 
CTTGGTCTTT 
CAAGCATTCC 



ATCATTGGTT 
AAGAGTCTCA 
AAGGACACGG 
TTCCCATTCA 
AACCTTAGTC 
AGCAACTGGT 
TAAGATGTCG 
TGGAACGACG 
ATCTACAGAC 
GTCAACTTGG 
TACCTCTGTG 
ATAGGTTTCA 
GGCTTTTAGA 
ACCAACAAAA 
CACATAAGAC 
AACGGAGCGT 
ATTAGTACGA 
CTCATTAACA 
TTCTGAACCG 
AACATGAAGT 
TTCCTTAGCG 
ACCATTTGAA 
TGTCACACTG 
AAGGTCAATT 
TTTCGGAATA 
TGTTACTGTT 
GATGGATTGC 
AGCTCCCTCA 
TTTACGAATT 



GCTGAAGCAA 
CCAAGTTTGT 
CCTCGAACAG 
ACTGTCAGGT 
GGCAATTCAA 
TTATGAGAAA 
CCATTTTCAG 
GTAGCGAGAC 
TCAAGGTAGT 
ATTCTTGTTT 
ATAGACGTTC 
ACTTTATCAA 
TTAGTAACTG 
CTAGGGTTTT 
TTCGGTACAC 
TTGCTCAAGA 
TTCCAGTTTG 
TTTGAAACTG 
GTCCATTGGT 
TTAGTTGTTA 
TATTGCTCTG 
TCATATCTCC 
ACTTCTTCAA 
CCTTCAACTT 
GCTTGCCAAG 
GCTGGAAGAC 
GCAATCTTCT 
TATTCAGCGT 
GCGACTTCCC 



AGGCCTGGTT 
CTGTCACTCG 
TAAATTGACC 
CTTTTGTTTC 
GTGCTTGACC 
GTAAGCTCTT 
CTTTCGCGAT 
CATTGTPGCT 
AGTCTGTCAA 
GTCCTTGCTT 
CACGCTTGTT 
AGCTAAAGTG 
GTTTCCAGTT 
TAGGAGCTGT 
CAACTCCATG 
TACCTGAATC 
TCCAACGATT 
GGTCGCTTGG 
CAGAAATGTT 
ATTGCGTACC 
GACGAATCGC 
GAACACTTTC 
CTGAAACGAT 
TACCTAGTAC 
TGACTTTATG 
TTGGTTCCTG 
TCTCAGTATT 
TCAGAGTGAC 
CTTCACCACT 



ACTGTTTTCA 
TACAGTGATC 
TGCTTTTGTC 
GTAGCCGTCT 
TACTTGTAGC 
ATCCTTAGTG 
GACACGAACT 
AACACTTGCT 
ATCAGGGTTG 
GGCTGCCGCA 
ATCTGCTTTA 
GTTCATTTCT 
GGCAGAATCA 
TGGGACAGTC 
GTCTTCATGG 
TCCAAACAGA 
GCCTGGTTGG 
ATTTGAGTCT 
TGCACCTTGC 
TTCTAAGCGA 
ATCCCATGCA 
TGGTAATTGT 
ACCTTCTACA 
TTCAAATGTT 
AGTTTTAGGG 
ATGCAAATCT 
GGCTTGGATA 
TGCTCCTGGC 
TGTAGAGAAG 



7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7900 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 
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GTTACTTTAT CAGCTGGTAA TACAGCTTGC GTTCCATCTT GATAGTGAGC TCGAACCGAC 8880 

AATTTGACAG TTTGGTCTTC TTTGAGACTG TCAGCTTTTT CCACTTGCAA GCTCAAGTGA 8940 

GCAATTTTTG GCGCTTCTTC AAGGAATTGA ATTGCATAGG TTTGAAGAGG GCCACCATCT 9000 

TTAGGCTGAA TAAAGATGCT CGCACGCATG CCGTTTGCTG CGCTTGCTTG AAGAACTGTA 9060 

ACAGCTGCAT TTTTAGCACT TGCTGTGACT TCTGGCAACT TAGCTCCATA ACCAAGAGTG 9120 

CGGTATTGCA TTGGTTTTTG ACTAGTAAGA CCTGTTACTG CCTCACCACC AACCGTTACA 9180 

GTTGGTACTG CAGGTGCCGC AGGATTGCCT TCTTCTACCA CAAGGGTTGC ATGAATTGGT 9240 

TGACCTTCTA AATAACCGCT CGCTTGAATA CGAGAACCTG GAATTGCTAA CTTAGCTTTA 9300 

TCTTCTTCGG CAATCTCCCA CTTGTCCACT TCATACTCTT CAACACTTCC ATCAATCAAA 9360 

ACATAGGAAA CAGATTTGTC TACAGAATTC AAGTCAGTAT TTGGAGCAAT ACGTTTCACA 9420 

ACTGGTAGCT CTGATTTAAG AGCAATCACT TCTACACGAG CTTCTACTTC TCGTCCGTCA 9480 

GCCATACCTT TCACCGTTAC AATACCAGGC TTGCTCACAT CTACTGAAGA CCAGGTTACA 9540 

GGACGTTCTG CACGGCTACC ATCACTGTAT ACAAACGGAA CAGTGGTAGG CATTTCAGGT 9600 

GCCTCTCCAA TAATGGTCTG TACTTTTGGC ACTTCTGTCC CCAAAACAGT CTTCTCTTGT 9660 

CCTTCTTTCT TACCAGTAAA GACAGTGACT TGGTTCGATT TCAAGAGATC AGAGTGGGCA 9720 

GTCAGGGTGA ATTTCCCTGC TTGTTCAGTT GATTTGACAA TGGCAACACC TTTACCATTA 9780 

AATGCTTTAC GAATCCAAGA ACCATCTGCT TGCGCCTTAT AGCGTTCACG GCTGGCTTGT 9840 

TCTCCGTTAT CTACACCGAC CAGTTGACCT TCGCCATGCA ATTGGAAGCG AACCAGATTA 9900 

TTAGCAGTTG GAACCACATT CCCCTGGCTG TCAACAATTT CATACTAGAT GTAAGTCAAG 9960 

TCTTTTCCAT CTGCTGCAAT CGCATGGTCT TCCTTAATAA GACGAACTGC CGCTGGCTTA 10020 

CCAGCAGTCG TAATCTTATC TCGAGCAATT TCCTTGCCAG ATTCATCACG AGCAATTGCT 10080 

TCCAAGGTAC CTGGTTGATA GGCAACTTTC CATTCAAGAT AAAGTTCATT AGCATTTGCA 10140 

CCTTCTTGGT AAGTCCGCCC ATCGCTGGTT TGTTTTTTAT TGAAAGTCTT AAGACCAAGA 10200 

GATTTTCCAT TCAAGAACAA TTCTACACTA GAAGCATTCG AATAAGCACG AACTGGAATC 10260 

TTACCTTCTG AGTCAGCTAC TTTGGATGCT AATTCTTTGT TTTCCCAGTT CCAGTGAGGA 10320 

AGAAGGTCTA CCATCGGTTT CTTCTTAACA GAAACCCATT GGCTTTGGTA GAGATAGAAG 10380 

TCATGTTTTG GAATGCCCGC TGTATCTACG ATACCAAAGT AAGAGCTCTT AACAGGAGTT 10440 

TGATTTTGGT TGTGCCATGG TGTAGGTTCA CCAATATAGT CCGTACCTGT CCAGATAAAC 10500 

TGTCCAGCAT AGCCAGCGTT GTCACGGTCA AAAGTCCATG AAGCGGTTGC TGTTTTCCCC 10560 

CAACCCACAC GATCATTTCC ATAATCTGAC TGTTCATAAT TACGCTCAGG TCCATTGCTA 10620 
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TGTTTCAATT CACGTTCAGG GCGATAGTAA CTTCCACGTC TACGGGTAGC TGAAGATGTT 10680 

TCTGATCCAT AAATCAACCA TTTTGGATGC TTAGCTCTAA GGGCTTTGTA ATTATCTTCA 10740 

GAATAGTTAA ATCCAACAGC ATCGAGTTCA TCAGCAATTT TCTCATCCCC TCCGCTACCA 10800 

TTACCGAAAC GGAATTTATC TGCTCCCATG GTAACATAGC GAGTCTTATC AACATCCTTG 10860 

ATAACCTTAA CCAAACGTTT AACAGTTGCT AAAGAGTGGG CATCACCATT AGCTTCACCT 10920 

ATTTCATTAC CAATTGACCA CATGAAGATA GCAGGGTTGT TTTTGCCTCT TTCGACCATG 10980 

GTACGTAGGT CAAAATCAGA CCATTTTTCA CCTTTTCGAG CTTCTGGGTG AGTGGCATCT 11040 

TTTTCAAAGA AACGTCCATA GTCATAAGGT TTCTTGCCAC CATACCACGT ATCAAAGGCC 11100 

TCTTCCTGAA CGAGTAAACC TAGTTCTGCT GCGATTTGCA AGGTTTGCTC ACTAGCAGGG 11160 

TTGTGGGTTG TACGGATGGA GTTAACTCCC ATCTCCTTCA TTTGTTTGAG ACGGCGATAT 1X220 

TCTGCTTTAT AGTTTTCTTC TGCTCCAAGC CCCCCATGGT CGTGGTGCAA GGATACTCCA U280 

TGCAATTTAA TACGTTCACC ATTCAAAGAG AAACCTTCAT TTGGAGTCCA GTGATAGTAA 11340 

CGGTAACCAA ACAAATCCTT CTTAGCATCA ACCAATTGAC CGTCACGGTA AACACGCGTA 11400 

ATCAATTCGT ACAAGGCAGG TTTGTCATTT AAAACAGTCC AGAGTTTTGG TCTTTCAACT 11460 

TCTAAAATCG CATCTAGGCT TGTTGATTCA TGTGCTTTTA AGGTACGACT CGCTGTACGA 11520 

ACTAAGCCTG TTACAGCATG ACCACCTCGT TCAACGATTT GATATTCGGC TACAAGTTCA 11580 

TGGTCTTTGT CGTCCGTATT GACGATTTTG CTGGTCACAT GAGTTTCAAC CTTGCCATGT 11640 

TGTTGTTCTT CAAGTTTTGG TGTTAAAATA GTTGTCCCAT TTTTCTCAAC ATGCACCTTA 11700 

TCTGTCACTT GTAAAGTCAC ATCACGATAG ATACCACTTC CTGAATACCA ACGGCTACTT 11760 

GGCTGTTTGT TGACTGCATG GACAGCAATC ACATTCTCAC GACCATCTTT TTGAAGGTAT 11B20 

TTGGTGATAT CATATGAGAA CTGGTTATAA CCATTTGGAT AATGCCCCAC TAACTGACCA 11880 

TTGACATAAA CTTGAGAATC CATGTAGACG CCATCAAAAG TAAGGCGAAC ATTTTTCTTG 11940 

AGGTCTTTTT CATCTAGTTT GAAAGTCTTG CGATACCAAG CTTCCCCACC GTTGAGCTGT 12000 

CCACCTTCAT TTTGTGCAGG AGATTCATGA TCGAAATCGT TAAAGATACT CCAGTCATAC 12060 

GGTAAATCTA ATTTTTTCCA CGTAGATACG TCTGCATCAG GTTTAATGGC TTCCTTAGAA 12120 

TTTGCATTGA GTTTAAAGTA CCAATTTTGA TTAAAATCCA CTTTCCTGTC TTCAATCATT 12180 

TGATTCACTT CTTCATTTGT TACAGCTTTA GCATCTTCCT TGAGCGGTTT TTCTTGATTT 12240 

GAAGCTTGTG ATTCTATCCT TGGAGCTTTT TCTTCCGGTT TAGCAGACAC TTTTTCCTCT 12300 

TTTGGAGTTA CGGCTTCATC TTCTTTCTTC TCAGATGCAA TAGCCTCAGT TGAACTAGGT 12360 
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TCACTTTGTT 


CTGTCCTTTC AACTATATTT 


TTAGTTTCCA 


AAGCTTTATC 


AGCCTTTTCT 


12420 


TCTACTATCA 


TTTTTTCCTC TTTAGGTTTC 


TCAGCAGTAT GAGTAATAAG 


TGTTTCATCC 


12480 


GCATAAACTA 


CAGATTCTCC AGCTATATTT 


CCTCCTAATA 


AAACTGCACA 


AGTCCCAATC 


12540 


ATTACTGAGC 


AAGCTCCCAC AGCAAACTTA 


CGAATGCTAT 


AAACTCTTTT 


CCGATTCCAA 


12600 


TGGCCTTTCC 


CCATAAAACC CTCCTTATAT 


TATATTTAGT 


GCAGTTAGCT 


ACTACCAAAG 


12660 


CCCAAGTGGT 


ATACATGGTA TGACAACCTA 


CTTTCAACAA 


TTTACACTCT 


GCGAAAATCC 


12720 


AATTCAAACT 


TCGTCAGTGT CGCCTTGCCG TAGATATGAT TACTGACTTC GTCAGTTTCA 


12780 


TCTACAACCT 


CAAAACCATG TTTTGAGCTG 


ACTTCGTCAG 


TTTCATCTAC 


AACCTCAAAA 


12840 


CCATGTTTTG 


AGCTGACTTC GTCAGTTTCA 


TCTACAACCT 


CAAAACCATG 


TTTTGAGCTG 


12900 


ACTTCGTCAG 


TCTTATCTAC AACCTCAAAA 


CTGTGTTTTG 


AGCAACCTGC 


CGCTACCTTC 


12960 


CTAGTTTGCT 


CTTTGATTTT CATTGAGTTT 


ATATTTTATA 


GGAGCGCATT 


ATTTTGCTTT 


13020 


TGCTGCGTAC 


TCTTCGTTAC GTTTGATCAT 


TTGTTTTCTG 


TACCAAGCAA 


AGATACCGAT 


13080 


ATAGAATACA 


AGGAAGACTA CTGCACCAAG 


GATTGCTTTG 


ATATCACCAC 


TTGTAGTGTT 


13140 


ACCAATTGTC 


CAACCAAGAA GTTTTTCGAT 


TGGTCCTTCA 


AGAGTAGAGT 


GAGTAATCAA 


13200 


TTGAGTTTGG 


CTCACACCTT CTGGGAAGGC 


ACCTACACCT 


TTAGCAAGTT 


CTCTTGCAAA 


13260 


TGCTGCAATA 


AGTGTACCTG AAAGAAGGAA 


GAGTGGCAAC 


AAGAGTGTTC 


CGAAGATAAT 


13320 


CATACGGAGC 


AATTTACCAC GAGTTACAAC 


CAAGAGAGCT 


GGAGTAACAC 


CCATAGCGAT 


13380 


GATACCTGCA 


AGTGGCAAGA TACCATTTCC 


AACTTTTGAA 


AGAAGCACTG 


CTTCAATCAA 


13440 


CATGATTGGT 


GCAAGTACGT TGGCACAAGC 


CCAGATTTCA GCACGACCAG 


CGATGAATGG 


13S00 


CCACTCAAGA 


CCGATATTGA ATTTACGTCC 


TTGAAGACGT 


TTAGTAGCAA 


CGTTTGTAAT 


13560 


ACCTTGTGAT 


AGTGGTTCTA CGGCTGCGAT 


GAACCATGAA 


CCGATAAGTG 


AGAAGAGTTC 


13620 


CAAAGATACA 


CCGGCAGTCA AACCAAGAGA 


CAACCATCCT 


TTGATAACAA 


GACGCCATTT 


13680 


ATCTGCATCT 


GCAACACCTG CAATTGGATG 


TGGAGTTCCC 


ATAATACCGA 


TAACGATACC 


13740 


AAGGATGAAA 


CCGATGAAGA ATTTAGATCC 


CCAGAAACCG 


ATTTTCTTGT 


TCAATTTAGC 


13800 


AGCATCAAAG 


TCATATTTAT CAAGGCCTGG 


GAAGAATTTT 


TCAAAAATCT 


TATCCAAAAC 


13860 


CATGATAACT 


GGGTTCATCA TGTAGTTCAT 


GTGAGTTGAT 


GTCATTGGTG 


ATGAACTTGG 


13920 


GGCGTTAAGA 


AGGTCATCAA ATGTAGGTTT 


CATCAAGTCA 


GAGTTGATAA 


TTTTCAACAC 


13980 


ACCCACAAGG 


ACGATAGCTG CTGTAGCAAT 


AAACAGTGAA 


ACCCCTTGAC 


TCACACCATT 


14040 


GTTATCAGCA 


TACCATTTAA TCAAGAGACC 


TGTGATAGAC 


AAGTGCCAGA 


TATCAAAGAT 


14100 


ATCGACATCA 


AGTGTATCTG TTTTCTTCAT 


AGCTAGCATC 


ACTATGTTGA 


CAATCAACAT 


14160 
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GATGAGCAAG 


AAGTATAGTG 


TCCAAGCAGA 


ACCCCAAGTG 






1 A "5 O fl 


ACCAACGTCG 


GTAATACTCA 


ATTGGATACC 

*» A> 4 \J"tam • oW 


Av 4 * 4 4 I Wf\ 




V* 1 AO 1\*A luL 


l j! inn 


TGAGAAAGCA 


GTGTTTAGCA 


TACCGATYIAT 

A a»V* \*i VJ«» A \jr\ A 


AGCACCGATA 




tbA AAu 


1 J* 1 >t ft 


TTTGATACCA 


CCTTCAAGCG 


CTTTGGACAA 




aaaavj 1 AAAu 


LuAATAuiVi 


14400 


CAAAATGATT 

aa aai V- 9 Xa>al ■ 4 A 


AACATGATGA 


CAGGTCCACC 


f* ATT^Pf^T A A 


A TYWI A *PTV* A 
A i \r<&*rt\ I I \»A 


AAALV.TTTCC 


144 oO 


GATTAGGTCA 

**** * 4 *»\/^J A V* aa 


AAGATTGCAT 






1 1 V»A 1\j 1 I AT 


ATGAATGTTA 


14320 


ACAAATTAGA 




X \»l*V) l\» 1 1 U 1 


1 1 AA 1 /iV>^ 1 w 


<- 1 f A 1 1 


GTCAAATACT 


14580 


GGAGCGCTCA 




nt_VJVlAA 1 /W\\J 


A 1 1 uol^vl* Au 


v. 1 IVIjATAAL 


TGGGATACCT 


14640 


CGTTCAAAAC 

* A ^#wwv\^ 






Vjvj L u 1 ftAnun 


1 ATCCTAACC 


TTTCATAAGG 


14700 


TCTTCGTTTA 


CATCTTTCAC 






v. ATCATAACC 


ACGGTTTGAA 


14760 


AGTTCTTCTT 


CTAGAGCACT 


1 I 1AA1 1 t\iu 


1VAW IT, tiA(iT 


TAACACCTGC 


ACCGCAGGCA 


14820 


GCAAGAATTT 


TAATCATTTA 




t*A I 111 A ill 


TTTAATAGAC 


AAGATTAAGC 


14880 


GGTTGCTTCA 


GCAATGTAAG 


TATAAAGGGC 


TTCTGGTTCA 


GAAATrrrit; 


ATAGGTCTTC 


14940 


AAGATGACCA 


TTTCCTGTGA 


AGAAGTCCAT 


TAACTGAGCA 


AGAATGTTCG 


TTTGACTTGA 


15000 


ACTTGAATTA 


TTAATGATAA 


AGAAGAGTAG 


GGATACTTCT 


ACTTCCTTAT 


CAGGAGCTAT 


15060 


CATATTGTGA 


AAAGTTATTG 


GTTTTTCTAA 


TCGAACAACC 


ACCACTTTCT 


CAGCTAGATT 


15120 


ATGAACAATA TCTGTGTGAG 


GAATCGCTAC 


ATTTGGCAAG 


TCCTTTCCTA 


GAAATTCCAT 


15180 


ATCTAAACCA 


GTTGGAAATG 


ACTTTTCACG 


CGTGATCAAG 


GCTTCACGAT 


AAGTTGGAGT 


15240 


GACAATTTCT 


CGTTCTTCCA 


ATAAAGTTGC 


AACCTGATCA 


AAGAGTTGTT 


CTTGACTATC 


15300 


CGCTTCTAAG 


CAAAACACAA 


GGTTTTTGTC 


AAAGAAATAA 


TCTAATACCA 


TAAGTTTTTC 


15360 


CGG 












15363 



(2) INFORMATION FOR SEQ ID NO: 140: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
TAAGACTATT TAATAGTGGA GTGAAATAGG ATACGAACAA ATTGATTAGG AAAATCAAAT 60 
GAATTTATAG AAATCTTTTA GCAGTTATGT TATCCTATTC TAGTTTCAAA ACGCTATAGA 120 
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AGCAGCATTG TGCTAGTCkA GATTCAGTTT 


ACTATACTAA 


AACGAGTAGC 


TTGAAATCAA 


180 


AAAACCCACC CTCACAGGCA GGTTTTATCT 


GTATTATTCA 


GCTAGATTAT 


GCTTTACCTT 


240 


CTGAACCGAA TACGTCGATA CGTTCTTCAA 


CCGATCCTTG 


GATAGCTTTT 


ACACCGTCAG 


300 


CCAAGAATTT ACGTGGCTCG AAGAGTTTTT 


TCTTGTCGTA 


TTCTOJTTCU 


TTTGCTTCCT 


360 


AGTCACGAGC AAATTTACGA GTTGCGTTAG 


CGAATGCGAT 


TTGGCATTCT 


GTGTTAACGT 


420 


TAACTTTGGC AACACCAAGT TTGATAGCTG 


CTTGGATTTG 


CTCATCAGGA 


ATACCTGATC 


480 


CACCGTGCAA TACGATTGGG AATCCTGGAA 


GAGCTTCTGT 


CAATTTTTGC 


AAGTGGTCAA 


540 


GGTCAAGACC TTCCCAGTTT ACTGGGTAAG 


GACCGTGGAT 


GTTACCGATA 


CCAGCTGCCA 


600 


AGAAGTCGAT ACCAGTTTCA ACCATTGCTT 


TAGCGTCTTC 


GATTGGAGCC 


AATTCACCTT 


660 


TACCGATGAT TCCATCTTCT TCACCACCGA 


TAGTACCAAC 


TTCAGCTTCT 


. ACTGAGATAC 


720 


CTTTAGCGTG TGCTTTTTCA ACAACTTCTT 


TAGCCAATTT 


AACGTTTTCT 


TCAACTGGAA 


780 


GGTGTGAACC GTCAAACATG ATTGAAGTAT 


AACCAACTTC 


GATACACTCA 


AGTGCATCTT 


840 


CGTAGTGACC GTGGTCAAGG TGGATAGCTA 


CTGGTACAGT 


GATACCCATT 


GATTCAACAA 


900 


GGTPAGCGAT CAAGTTGCGA GCAACTTTGT 


AACCACCCAT 


GTATTTAGCA 


GCACCCATTG 


960 


AAGTTTGGAT CAAAACTGGA GCTTTTTTAG 


CTTCTGCTGC 


GCGCAAGATA 


GCTTGAGTCC 


1020 


ACTCAAGGTT GTTTGTGTTA AATCCACCAA 


CTGCATAACC 


GTTGTCACGG 


GCTGCTTGGA 


1080 


CAAATTTTTC TGCTGAAACG ATTGCCATTT 


TATCAGGCCT 


CCTGTATATT 


TTTATGGGTC 


1140 


ATCCCATTTA CATTGTTCAT TTTATCACTT 


TTTGCCAAAA 


AAATCTAGTT 


TTTCCCGCAG 


1200 


TTTCGATTGA TTTTCTTCTA ACTCCATCTA 


TGTAAACCCT 


TTCTCTCCCT 


AGTCTTGGAC 


1260 


GACTTTTGGA AAATCTATAA AGAAGGTTAA 


ACTATTCTCC 


TCCATCTCGA 


AACGATAAGC 


1320 


TAATTTTTCA TGTTCTAATA GACTCTTAAC 


CACAAAGAGC 


CCCATACCAG 


ACCCCTTGAC 


1360 


CTTGCGACTG GCATTGTCAG AAAAAGACTG 


GGCTAGTTTT 


TCTTGTTCCT 


CTGAGCTACA 


1440 


GCTATTTTCG ATAAAAAGTT CTCCTTCTCT 


TTCTCCAATT 


CGAACTAAGC 


CACCTGGAAC 


1500 


AGAGTGCTTA ATGGCATTGC TGATGAGATT 


AGAAAGAATC 


AACTTCATAA 


CTGATGGGTT 


1560 


TAGATAAGCC TGCTCATGGC TCAAACTATT 


GTCTATCTGG 


AGCTCTCTTT 


CCTTGGCTAG 


1620 


CAAGGCATAA TCTTTGACCA GATTTTGCGT CATCTGGAGG AGGTCAATTG TTTCCCTATC 


1680 


ATCTCGCAAT TCCTGCACAG AAGAGAGGGA 


AAGTATCTGC 


AGAACATGGT 


GATTGAGTTC 


1740 


ATCCACAATC CCCAAGGCAA CTCCCAGATA 


CTGGTCTCTA 


TCCTTATAAC 


GACCGATATT 


1800 


CTCTCTCATA TTTTCGATTA GGATTTTCAA ACTAGCCAGC GGTGTTTTCA ATTCATGAGA 


1860 


AGCTCCTCGT AGGAATTCGA CCTTCATCTT CTCCAGCTGG AGAATGGCTT CATTCTTTTC 


1920 
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ATGCAAGTCC CCAATAACAG TCAAGAGATG CTGGTAGACG CTATTGATTT GTTCCTTGAG 1980 

ATTACCTATC TCATCCTTAG AATCCACGCG CAATCGCACT TGGGAATCCA GGTCCATCAT 2040 

CCGACGGGTC ACCCGCTTGA TTTCCAAAAT CGGTGCAACA ATAGTCCGAG CGTAGATGTA 2100 

GGCCACCAAA AGGGAAATCA CAAAGGACGC CAGCAACGTA TAGGGAAGAA ACTGGAGACT 2160 

GATTTGCTCC GCTTCCTTTT GTAAATCCAT GGAAGCTAGA AACTGGAGAA TCATAGTACC 2220 

ACCCTCTTGC GTTTTCACCT CGCGCTCCTC AATAAAGAGA GAGGTTGTCT GGCGGTCTGT 2260 

GTCCAGAGGA AGACTGTCCT TGACTTCTAA CTTCTCCTCG GTCATCTCAC CTTTGACGGT 2340 

CCCCTTGATA TCACTAGTCT GGGAATACAA GTCTAACACT TGCTCGATAC TCTGCCTATC 2400 

TTTCCCTTCT AGGGACTGGG CAATGGCTGT TGCCTTTTGA CCAATGGTTT CCTCACGATC 2460 

ACTCAGATAA GTCGAAGGAA AAAGAAAATA AATAGCTAAA TGAAGGCAGA TAACCAGAAC 2520 

ACTAAATATC GAGAAGGTAT AGATAAATAT CTTTGCAAAT AAACCTGTTC GTTTCATTTT 2580 

CGCTCCAATT TATAACCAAC ATTGCGCACA GTGAGGATAC AATCCAAGTC TAGCTTTTTC 2640 

CGCAATTCCT TGATATAAAC ATCAATAACA CGGTCAAAGG GAACCTCATC TGTCGCTTTC 2700 

CAGACGGCAT CGATAATCTG AGATCGAGTC AAGCCCCGGC CTTCATTTTT CACTAGATAG 2760 

TCCAGAATTT CCAACTCTTT GCCATTGATA GGCACTTCTT GACCTGCGAG GCTTGCACTG 2820 

TAGCTTTCAA AGTCCACCTT GGTATCCTTG TAAGAAAAGA TTCGTCCTGT ATCGTAGTAG 2880 

CGCTTGAAAA TCGCGTCCAC CCTCACTTTT AAAAGGGAGA GGGAGAAAGG TTTTTCCAGA 2940 

TAGCCATCTG CCAAAGAGGC AAACCCACTC ATCTTGTATT CCTCATCTTG AAAAGCTGTC 3000 

AACATCAAGA CAGGAACCTG ACTGGTTTTA CGAATCTCAG CTAGGACTTC TAAGCCGTTG 3060 

AGCTTGGGCA TCTGGATATC CAGTAAAACC AGGGCCACCT CATAGCTAGA AAATTGCTCC 3120 

AGAGCTTCCT GACCGTCCGC TGCCTCAATA GTTTCATAGC CACAATCCGT CAAATAATCA 3180 

CTGACCCCCT CACGGATCAT CTCTTCATCT TCTACAATTA AAATTTTCAT ACTTTAACTG 3240 

CTCTCTATTT TTTATTTTTC TTAGAATAAA TACCTACCCT ATTTTCTATT ATAGTCTCTT 3300 

GCTGGCCTTT TGTCTGCAAG CAACTGACCA CTAGATAAAA CGTTGTGAAA TTCCTTTCTC 3360 

ATAAATTCCA TAACTTTAGT ATATTATATT TAAGCACTAA AGTACAAAGA AAGCAACTGA 3420 

AAGCAATGAT TTTCACCACT GCTTTCGGAT TTATTTTGAA TTGTTAAATA GCCATTCCTA 3480 

TCCACTATTC TTGAATAGAA ACACAAGATG CAATCTTTAT TCTAGACTCA TTTTTTCAAA 3540 

TTTATTCACC ATCCAGCAAG AGCTCTTTTG GTTGTTTTCT AAGGAGATTG CTTGAAGCAA 3600 

GCGCCATAAC GAGAACCACT AGAACCAAGG CAAGGACAAA AATGATGATA AAGTCTGATG 3660 
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TCTCAATCGA AATGTCTAGG CTCGACAAGG TCTTGCTAAA 


, GCCATCTACT 


TCTGCACCAC 


3720 


CACCAAGGTT AGAGGCTTGA GCCGCCTTAC TAGCCTGTTT 


GGCAACACCT 


GAAGTCACAT 


3780 


TGGCAAGGAC AGTGTTTCCA ATTGCACGG6 CAGTGTAATT 


AGCTAGGAAG 


TAAGCAGAAA 


3840 


CTAGAGCAGG GATAGCAATC AAGATAGATT CGGTGATGAA 


TTGACCCAAG 


ATACTTGCCT 


3900 


GCTTGAGGCC GATAGAGAGG AGAATTCCCA CTTCCTTGCG 


ACGGGCGTTG 


ATCCAAAGGC 


3960 


TGAGCAAGAG GGCAAGGAGG AGAACTGAGA AGCTCAAGCT 


ACCCCAGAAG 


AGGAGGTTGG 


4020 


CCATCTTGTA CATACCAGAG ATAGATTGCT CAAGAGCTGG 


GTAGTTAGAG 


GAGCTCTTGA 


4080 


CGAGTGTGTA GCTCTTCCAG TTGATACCAC TGATGCCATT 


CAACTCTTTC 


ATAACATCAT 


4140 


CCAAGTTCTT GTCTGCTGTT ACAAAGAAGG TTGCGTCCCC 


ATAAATGGCT 


GTGTCTTCTC 


4200 


TGTATCCATA AAGTTTTGCA GCAGTGTGAA TGTCTGTAAT 


AGCTGTGTTT 


TCGTAAAGTT 


4260 


CTTGTGAGTA GGTTACTGCT GACTTATTAT GACCATCAAA 


GAGTCCCTTG 


ATTGTCACTT 


4320 


CAACTGTTTC CTTGGCTCCT TTTTCATTAT CTGCATCGTA 


GATATTAGAG 


TCCAGTTTAA 


4380 


CCTTGTCCCC TACTTTCCAG CCGTGTTTGG CTGCCAAGTC 


CTTGTGCAAG 


AGGATTTTAT 


4440 


CCTTGTCGTC GTTGGTTAAG TGCTCTCCTT CGACTAGTTT 


ATAAGAACCA 


GAGACAAACT 


4500 


TGTCTTCTTT AGAGGAGTCA TTGACACCTG TAATCATCAA 


GCTACTTCCA 


AAACGCTTGG 


4560 


CACGATCAGC AGTGAGATTC TTCTTGGTTT CTGGCGTTTC 


AATCAGGTCA 


TATCCAGTCA 


4620 


AATCTCCGAT AGCGTTGATA CGTTTGACAT AAGACTCAAT 


GGCCTTGTTT 


TCGGTGATTr 


4680 


TTTTGATGTC TTCACCCTTG ATATTCCCAG CACCACGAGG 


CGTTCCTTGG 


TTGACGCGAC 


4740 


GATTGATTTG CATGGAGAAG CTATTGGTGA TATTTTTAAA 


GGTCTCCTGA 


GAAGCCTTGG 


4800 


CACTAGCTCC CTTGATTGAC AAGCCGACCA AACTCAAGCT 


CGCCATGAGG 


AGAATAATCA 


4860 


GGAAGATGAC AATCGATTTG AAAAACTTCC TTGTAACATA 


GGCAAATGCG 


TTGTGTAACA 


4920 


TACATTCCCT TTCTAGATTT TGTTTTAATC ATTCTATTAA 


AATAAGCTCA 


AATTATTTAC 


4980 


TAGTATTGCG CGTTTCAGTC AGTTTCTTAT CCTTTAATTC 


AAGTGTAATA 






GTGCCACTTC TTTACTGTGA GTTACGACAA TCACACATTT 


ACCTGTTTTC 


TGGGCAAGTG 


5100 


ATTTGAGTAG TTCGACAATA TCTCCAGCAG TTTTAGGATC 


CAGATTTCCT 


GTTGGCTCAT 


5160 


CAGCTAGAAT AACTGGAGCT TCTGAGACCA AACTGCGAGC 


AATGGCAACA 


CGTTGCTGTT 


5220 


GACCACCTGA TAACTGGAGA ACATTCCGCT TGATCTGGCT 


TTCATCCAAA 


CCAAGCTCAA 


5280 


GAAGTGTATT CTTGCTTGCC TTTTTUTTtiA OCAATCGGAT 


ATTTTCCAGC 


GGAGAAAGAT 


5340 


AATCTATCAA GTTATAATTT TGAAAGACCA GGGAAATATG 


GTGCATGCGA 


TGGTAAGAAT 


5400 


AGCCCTTCTT ACGAATATCC TCTCCTTGAA AAAGGATAGA 


ACCTTCAACA 


GGACTATCTA 


5460 
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GACCAGCAAG TAGGGACAAG AGTGTGGATT TTCCTGCTCC TGACTCCCCA ATAATACTGT 5520 

AAAATTTTCC GGGTTCAAAA TTATAATTGA TCTGATATAC GACTGCTTCA GCAGTATTCT 5580 

TATAACGGTA GGTAACATCT TGTAATTGTA ATAAAGTCAT GATTTCTCCT TCTTAACTAA 5640 

TAGATGATAA AATTTCTTTC GGTGATTTTC TAAATAAGAA TAGGAAACAA AGGGCTACAG 5700 

ATAAGCAACT AAGCAGAACT AGAAAAACAT AGGATTCTGC AAAAGATAAG ATGCTAGTTG 5760 

ATAAACTGCT TGCTTTGGCT AGTGTATCTT GTAAGCTTGC CTGATCTCCA CTTGCTAGTA 5B20 

GAGTTTGGAG TAGGTAAGTT GTGATTGCGT TTCCTGCAAC AAATGCTGGA ACCAAAGCTC 5880 

CAAGAGATAC CAAAACTACC TCTAAACAGA ATTGTAGGAA GATCGAGCTC TTGCCTTTTC 5940 

CAAGTGCAAG TAAAATCCCC ACTTCATAGA CCCGTTCTCT CAACCAGAGA GACAAAACCA 6000 

GAATTAAGGC TCCAGCTCCT GCTATCAACA TCCCATAAAG GAAGATGGTC AGGAAGGTTT 6060 

GGAAAGTTGC AACTGAGTCT TTGATTTGTT CAAAAGCCTT GTTTTCCTTT TCGACTTGGT 6120 

AGCCTTGATT TTCCAAGGCC AAGTTTTCTA CCTGCTTCAT GAGTCCGTCC ATTTCCTTAG 6180 

GATTTTCTAC ATAGAAGCGT GCTGCACTGA CTTGAGCTTC ACTATTGCCC AAAAGGGTTT 6240 

GGCTACTTTC ATAGTCTCTA AAGACTTGAT TTTCACTGAA GTCAGAAGAC AAGCCTGTGA 6300 

ATTTCTCTTG TTTTTTACCA GAAAAGATGC CGATAATCTC AAACTCTACT GTTTGTCCTT 6360 

TTCCAGATTC AGACTGACCA GCATCCAAGC CAATCTTGTC ATGAAGCGAA AGACCGTTCT 6420 

TCTTAGCCAA TTCTTCGTGG ATAAGGATTT TCTTGGAATC CCCTTTTTGA AGGTGTCGCC 6480 

CTTCTTTTAC ATTGAAAGCC GAACTGGTAA AGGTTACATC CTTGGATGAA TCCTCAAGAG 6540 

CCGTTAAGCT AACCAAGTTA TTGTCTGCAG CTGATAAATC ATCACGCTCC ACGCTCTGCT 6600 

CGCCAGTCAC TGCTTCCTTG TCTTTTAGTT TTGCGACCGT CTCAAGTTCA GGAGAGACAT 6660 

TTTCCAGCCC CTTAATCTTG CTTACAGATG CTAGGTCTGA CAACTTGAAT GTCTGACCAT 6720 

TCTCTATCTT CTTAATAGAA AAAGATGTAT TGAGTGATTT ATAAAGATTG CTTTCTACX3 67 BO 

TTTTGTTGGA CTTCATCAGA GTCAAACAGG CTGAAATTCC GGCCAATAAG ACCAATAAAA 6S40 

TCAGAAATAA AATAAAACTT CTCAGTCGCT TTCTGCTGAC ATAAGCCCAA GATCTTTGGA 6900 

TTGGATTCAT TTGTCACCTC CATATTTGTA AGACTATTAT AAAACCCAAA TATGAAATAT 6960 

TTATGAAATA CGAAAAAAAA ATATCGAGTA GGGGATAATC TCTAGCCCCT CTCACACCAC 7020 

CATACGTGCC GTTCGGCATA CGGCGGTTCA ACTAACTTTT AACGCATCTC GTTCAAGGTA 7080 

ATAATCCAAA CACGAAACCA GTCCACGTTT TTCAAGGACT GGTTTTGATA TAGCACGTTT 7140 

AAGTACCGAC TTCTGAGCTA CTATAGTAGA TTGAAACTAG AATAGTACAC CTCTACTTCT 7200 
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AAAATATTGT TAGAAATCGA TTTGACTGTC CTGAACAATT CGTCCTATTC TTATTTCATT 
TTACTATAAT TGATAGTGCT CGCCCCAGCC AGATACCTTA TCTGCTATCC ATTTAGGAAC 
CCCTAACTTA AGCAATCCCC ATAATCGTCT CGATTTCTTC TTCCATTGCT TCCAGATAAT 
CACTCGTAGG CGAGTACCCA AGCGCTCATC TATGCTAGTG ACTATACTTT TCATATTTAT 
AATTCATTCC TTTCGTTTCA CTCAACCCAC AACACAGAAT GAAAAACTGT TGTGATCTTT 
ATTTTGTTTT ATAATAATAG TGAGAAAACC TATCACTACT ACAAATCACG GGGAGGTGAA 
TAAGTGAGTG GTACAGCCAC TACCTCGCAT ATTTTGTCAC ATCATTTAAC GCTACATAAT 
AAGTTGTACC ATCTGAATAA GTTGCTACAA TATCATTTGC ATGCTCTCCT TCACCTTTAG 
CAAAGGTTGG AGCTCCTGCT GGATGATTTT TATTTGCCTC TTTCAATTTT TCAATAATGG 
CATTTTTTCT GTATCTTTTA TATTATCAGG ATTTTTCACT AAGATTTTGT CTGGATATGT 



CGGTTTAGCA GAAACAATTT TTACTGTTAC TTCTTTTTTA 
TCCAGCATTA TCTTTAGCAT TTAATTTTAC AGTAATTCCT 



TTCGAAGCAC TTGTCCAGTT 
GAACTAGGAA CTTCAGTAGC 
AGGTTGATTA TCAACATTAT TCAACTTTAA TTTCAAAAGA GCTGTTGCAT CAGACGTTTT 



ATCAATCGTT ATATATAATG ATGAATTGTT ATTATAAACA 
TTGTGAGCTA CTTGAAACAG AACTGAAATT ATACCCACTA 
GCTTACGTCT AAATGAACTT CCCCACTATT ATTTGGCTTA 
ATAACATAAA ATTTGCATAA ATAGATTAGG GAAATCAAAG 
GCAGTCACAG TGTACTTTCC CAGCATCAAG CCACTATAAC 
AGATGGCAAT CCTCTTCTCC AAATATTAAC TTCTTTACAA 



GTTCCTTCAT ATTTAGCTGT 
CCTCCCTGAT TATCTTCAAT 
GCAACAACTG TTATAGTAAA 
CAGCTTCTAG GAATGTTTTA 
TCTGCACATA AAAATGGAGA 
ACCAACTATA GTTGACAAAG 
AACCTAAAAT CAATTGATAA CACAAGGTCA GGTCGGTCAA CTCTTTCAAC TGAAGCCCTG 
TCAACTCTTC CCATTTATCA ATCTTGTATT GGAGAGAATT GCGGTGCAGA TAGAGTTGCT 
GGGCTGTTTT AGTGAGAACA GCACTATTTT CCCAAAGAGA GAGAATGATT TCCTGAATCT 
GATCTTGATC CAAAATCATC TGGTGTAGAC ATTCCTTGAT TGGCTTCAAG TCCACGAGTC 



TTTCTCCCAT ACTCCAAAGA TAGAGCTGAG AAAAAGTATG 
GCCACCATGT CTTGAACAAA TCCCGCTCAG CTTTGATTAA 
CCGTCTGAGA CCAAACCTGA CCCAACATGA TAGAAAGACG 
CCGCTTCAAT CGTATCACTT AAAATATCTC TTACAGAAGT 
CGAAAACATA ATCCTGAGCT CCGACCTGTA GCACTGTCTG 
GCATCATATC TAGCCAAGAA GCCAGATTTT CCTGCTGAAA 



AACACCTTGG TGACCCTGAC 
GTCTGATAGG GCTTGATGTC 
AAGTCCAAAG TCATACTCAA 
GTATTTGTCT TGTTGAAGCA 
ACAATTCGGA AAAAGAGTCC 
ATAAGAAAGA TGGCAATAAA 
CCAACTGAAT CTTTTTAAAA ACTTGCGGTG CCTGTCCCTT GCCCTCAACC AGATAGGAAT 
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ACCAAGGGTT TAGCGAACGA GCCTGCTCCT GCTGGGTCAA AACGCCAACC AACTGCTTTT 9060 

CACGCTCGCT GAGCCCAGCT TCCTCCAGCA AAATCCACTG CTGACAAGCT AAAGGGAGCG 9120 

TGAGATAGCC CTCTTTCTCT ACTGGTTGGT CTGAAATCCG AGCCTCAGGA AACCAGTCTT 9180 

GTAGTTCTTT TGCCCTCATG TTCTAGCCCT CCACTTTTTG GATGCACCAT GAAACCAAAC 9240 

TCTCAAGACG TTCCAGATTC TCAGTCATAT GGAGATAGCC CATAACCGCT TCAAATCCCG 9300 

TGGACATACG ATAAGTCACG ACATCTGCAT TTTTAGCCTT TGTGTGGCTA TTGGTATTGC 9360 

GGCCACGTTT GTAGATTTCT TCTTCTTTTT CCGTTAGGAC CTGCTCCTCC AACATGAGAG 9420 

CAATCAGGCG AGCCTGAGCC TTGGCTGACA CGTACTTAGT TGCTTCTTGA TGGAGTTTAT 9480 

TGGGTTTGGT CATACCTTTG AGGATGAGGT GACGGCGAAT ATACATAGAA TACACCGCAT 9540 

CCCCCTCAAA GGCTAGCGCA ATCCCCTTAA TGAGATTGAC ATCAATCACG TGTCCACCTC 9600 

ACTCCATCCT TGGTATCAAC GAGCTTAATT CCTTGAGTAA CCAATTGGTC ACGGATTTGG 9660 

TCTGCTGTCG CAAAGTCACG ATTGGCACGC GCCTCTTGGC GTTTTTGAAT CAAGTCTTCA 9720 

ATCTCTGCAT CCAAAACTTC CTCAACAAAG ACAATTCCAA AAATTTCTAA CATATCTGCA 9780 

AGAGCTTGCT TGACACTTGC ATCATAGTTC CCTGAGTTGA TCCATTTGGC CATTTCAAAG 9840 

ACAACTGTGA TACCGTTGGC AGCATTAAAA TCTTCATCCA TAGCTGCTAC AAACTTATCT 9900 

TTAAAGTTTT GTAACTCTTG GGCATCCACA TTTCCTGTAA ATGGTTGTTC GTAAGTATTC 9960 

TTGAGATACT TGAGATTGGT CTCGGCATCC CGAACTGCCT TTTCCGTGAA GTTGATAGGC 10020 

TTACGGTAGT GCTGGGTCGC AAAGAAGAAA CGAAGTACTT GCCCATCAAG ACTTTTAAGG 10080 

GCATCGTGTA CCGTAATGAA GTTACCCAAG GACTTAGACA TTTTGACATT GTCGATATTG 10140 

ACAAAGCCAT TGTGCATCCA GTAGTTAGCA AAAGCCTTGC CTGTTTTAGC TTCAGACTGG 10200 

GCAATTTCAT TGGTGTGGTG TGGAAACTCT AGGTCAGCTC CACCACCGTG GATATCAATG 10260 

GTATCACCTA AAATCTCTGT CGACATGACT GAACACTCAA TATGCCAACC CGGACGTCCA 10320 

GGTCCCCAAG GACTATCCCA AGAAATCTCA CCTGGTTTGG AAGATTTCCA TAGAGCAAAG 10380 

TCTACAGGAT TTTCCTTACG AGCCGTTTCT TCATCGGTAC GACCTGAAGC ACCTAGCTCC 10440 

AAATCTTCCA AGGTTTTATT AGCCAATTTA GCATAGTTGT GGGATTTTTC TACACGGAAA 10500 

TAGACATCCC CTTGACTCTC ATAGGCAAAG CCTTTCTCGA TCAAGTCTTC CACAAAACGG 10560 

ATGATGTCTG CCATAAACTC CACTACACGC GGATGGCGAG TCGCAGGTTT CACGCCCAAT 10620 

GCCGTCACAT CCTCACGAAA GGCAGCGATG TACTTATCCG CAACCTCCTG AGGCGTGATA 10680 

CCTTCTTCCC TGGCACGGTT GATAATCTTA TCATCCACAT CTGTAAAATT GGAAATATAG 10740 
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GCAACCTTAT 


ACCCACGGTA 


CTCAAAATAG 


CGACGAATCG 


TATCAAAAGC 


TACCGTCGAA 


10800 
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CTACGTGGAT 


ATAGTTGTAC 
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11160 


ACGAGGGATT 
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GTTCAGGTTC 
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AAACCGTAGA 


GGTAATTCGT 


ATTAGTCGAT 
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AATAACCTGA 


11280 


CCAGCTTTTA 


ATAAAATATA 


TAGTCGGCTT 


TCTGGATCTT 
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AGCGACATAT 


11340 


CTATCCACAA 


CTTCTCTCGA 


TTCATGTTCC 


TCTGAAAATG 


CCTGAAATTT 


TAATTGACTA 


11400 


ATTTGATCCT 


GATACGAACT 


ATCTGCTAAC 


AAAACTTCAA 


GATGGGAAAC 


ATTTGCTAAC 


11460 


GGATAAGGTC 


TTCTATCCTT 


ACCTAACCAA 


GTTTCTGTCT 
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11520 


CAGTTACTGG 


CAAAGTCAGG 


ATGATTCTCT 


AAAAAAATAC 
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GACCGAATGG 


GGAAAGAAGC 


TGTTTCTCTC 
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CCCTGACGGC 


GATGACCTGG 


ATGAACCAGT 


ATCGTCACTT 


CTACATCTTG 


GTCATCTGCA 
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TAGACAGTTA 


ATAAACCAAC 


AAGTTCGCCT 


TTTTCATAAT 


AAAGGAAAAA 


GGCGGGCATG 


11760 


TTTGGGTCAA 


AATTAAGCAT 


GTTAGAGAGA 


TAGGGATCGC 


GATAGGTACC 


GTCATAGTTT 


11820 


TGGCAACAGT 


TAATTACTTT 


TTTCGCCTCA 


GATAGCTCCT 


CTTGGCTTAA 


CTTGTTTCTT 


11880 


GCTTGAATCA 


TATAGGTATC 


CTCTACAAAC 


CAGACGATCT 


GTGACTGGCA 


TCTTTAGCCT 


11940 


GCTCGAGTTT 


ATTGACATAA 


TACTCTCGTT 


TTTCTTCGAC 


TTCGTGAATG 


ACAGGCTCAT 
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CTTTCTTACC 
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ACAATCTTGG 
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GTACATCTGC 
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TGCCACAGTC 


TTTCCCTGTT 


CCCCCGAGAG 


TCACTCCGTG 


ATAGAGAAGA 


acoctttttt 


12240 


CAACAATCGC 


TGTCTCTCCA 


ATCACCAGAC 


CAGAACCATG 


GTCAATAAAA 


ACACCTGAAT 


12300 


CAATCTGGGC TCCTGGATGA 


ATCTCAATCT 


GAGTCCAAAA 


GCGCCAAAAC 


TGACTGTACA 
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TACGAGCTAA 


TAGTTTGAAG 


CCGTGCTTCC 


AGAGAAAATG 


CGAGAGACGG 


TGGGCCGCCA 


12420 


AGGCCTTGAC 


ACCTGGATAA 


GTCAGCAAAA 


CCTCCAAAGT 


GGTGCGGGCC 


GCTGGATCAT 


12460 


TTTCTTTTAC 


AATATCAATG 


GTTTCGCGCC 


ACCACCCCAT 


ACATTTCTCC 


TTTTCTTATT 


12540 
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TTTCATCGAC 
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CCTTGTCTTC 


TGTAATTAAA 


CGTCGCACTT 


CTGCGTGTTC 


CATTTGTTCC 


AAGATTTCAG 


13860 


CCACATCACG 


CATAATACGG 


TCAAATTCTT 


CGTGGTCCGC 


ATATTTTTCT 


TCGTAAACGG 


13920 


CAGTCACTTG 


GTCTTTCACT 


ACTTGAGTCG 


CAGCTTCACG 
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TCTTCTACTT 


13980 


GAACTGCCTT 


TTGGAGGTCA 


CTGTTGTAGG 


CTGCAATGAT 


TTCAGCTTGC 


AATTCAGCAT 


14040 


CCACGTGAAG 


CAATTCCACT 


TCTGCTTTTT 


CTTTACCGAC 


AGCAGCAACG 


ATTTCTTCTT 


14100 


GGAAGGCAAT 


CAATTCTTTG 


ACAGCTTCGT 


GCCCTTTAAG 


GAGCGCTTCC 


AACATGATTT 
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CTTCTGACAA 
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CATCTACATA TCCCACTTGT ACCCCACCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG 
ACAGTGCCAA AGATGAACCA AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG 
AAAGCACTGT ATTGATGACT TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA 
TCGGACGGTC AATCAAACGC GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT 
TCATAAAGCC ACCAGGAAAC TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA 
GTGGGAAGAA ATCCCCAGTT GCCATTTTCT TAGACATAAC GGCAGCAGTC AAGACAGTTG 
ACTCACCGTA ACGTACGACA ACAGATCCAT TTGCTTGCTT AGCAACCTGA CCAGTCTCTA 
CAATTAACTC ACGACCCGCA AAAGTCGTTT GAAACACTTG TTTTGCCATT TTAATCCCCT 
TTGGATTGAT GAAATTATAC GCCTTGCCTA CAAAGATCAA GATACCAAGG ACGTCAAAAG 
CAAAGTAAAA ATAGGAAACT GACGAAGTCT TCGATGAAGA CAAGACAGTT TATCTTTTTT 
ACACAGCTTT TCGGCCGTCT TCAATTACAC AAGATATTTT GGACGGTTCG GCTTGCCGAA 
CATTTCTGTA GAAAAATAGG AAGGTGACGT CGCACTCGAC GAGTGCTAGG AAGCTTATCT 
TTTTTCCTAA GAAATGAGAC CAAAATTCAA GTCATCAAGA TACCAACCCG TCAAGCAACT 
CAAAGGAAGA TAGGAAATCG AACGACGGAG CGACTACTCC TAGGGAGATT TATCTTTTTC 
CACAGAGTTG TAGGCAAGTT CAGTTTTCAA GATACATCAT TAGAAAGGTT TAATACTAAA 
GTATCTAAAG CTTTCACGCT AATCGCTATC GGGCGATTAG CTAAATGCTT TACTAACTCT 
CTCGTCAAAT AACATCGATT TGACTCACTC GTGTCGTTAA ATCTTACAGT TTAAATGCAT 
TGTATTATTT AATACCTTCA TCTTTGTATC AAGTACGTAC AGAATTTATT TTATCATATT 
TTTCTTAAAA AGTGAGGTCT TTACCATTAA AAAGGAACCA TTCCCCTCAC CTGAGAAGAA 
TGGTTTGCTT TTATTATCCT AGAGACTGCT GATTAAACAA GGCATGGGTT GCTTGATGGA 
TGTATTTTGC TGTATCAGCA TTATTCATCG TATAGAGATG CACACCGGCA ACATCCTGAG 
TTACCAAGTC CACGATTTGG TCCACTGCAT AGGCAAGTCC TGCTGCTCTG AGCCACTCAG 
GGTCATGCTC ATACTTGTCT AAGATGGCTT TAAATTTGCG TGGAAGATGG ATATTCTCAC 
AAGTCTTCAA GAGTCGGAGA GCCTGATTTC GATTCAGAAT TGGCATAATT CCTGCATGAA 
TGGGAACATC AATCCCAGCC AAGATACACT TGTCCTGAAA ATCATAGAAG CGCTCATTGT 
CAAAGAAGAG CTGAGTTACG AGGCTCGAAC AGCCTGCATC CACTTTCTTC TTAAGATTTT 
GAATATCTGA AATCTGATTT GGCGAATCTG GATCCCCTTC TGGATAGCAA GCTCCAATAA 
TATCAAAGTG AGGGGTTTGT TCCTTGATAA ACTCAATCAA CTCGGTTGCA TAGCGGAAAT 
CCTTTTGTGG TTCCACGTCT GGAATAATAT CCCCACGAAG AGCCAAGATT TTCTGCACCC 
CAACTTTGTC CAAGTCAGCA ATAGTTTCAG CAACCTTGTC CTTAGTTAGA TAAATAGCTG 
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GCAAGTGGGC AATGGTCGGA ATCGCCAAAT CATTTTGGAT AAAGTCAGCC AAACGAACCG 16140 

TCGTTTCCTT GATATTAAAT TTATTATTGC TCGCAGTTAC ACTGATAAAA TGGGGAGCCA 16200 

ACTCCTGCAT ATCCTGCAAG GCTGAAATAA TGTTATCATT ACCCACGGCT GGGTTTGGAG 16260 

GGAACACTTC AAATGAGAGT GACGGTGTTT GGCGTGACAT ATGTAATAAC CTTTTCTAGT 16320 

TGATTTCTTT TTGAACAACC ACTGTATGGA GAGAAATCCA ATCTTACAAT TTCTCACGCG 16380 

CAGCTTTAGC TGCTTCAACA AGGCGGATCA AGCTTTCTTT TGTTTCTGGG ATACCACGTG 16440 

rrrrcAAACC acagtcaggg ttgatccaaa ctttct tg ct tggcacttta gcaaggatgg lesoo 

CTTCGATTGT GTTGTCGATT TCGCCTTCAT TTGGTACACG AGGTGAGTGG ATATCGTAAA 16560 

CCCCAGCTCC CACTTCTGTT TGGAAGTTTT TCGCTTTGAG TTCGTCCAAG ATTTCAAGGT 16620 

TTGAACGGTT AGCTTCAAAG GAAATAACGT CTGCATCCAT GTTATCGATA GCTGGGATGA 16680 

TATCTGTAAA TTCTGAGTAA CACATGTGAG TGTGGATTTG TGTGTCTGGC GCTACTGTTG 16740 

AGTGTACCAA GCGGAAGGCA GGAATAGCCC AGTCAAGGTA GTCTTCGTAC CAGTCGCTAC 16800 

GGCGGAGTGG CAATTTTTCA CGAAGAGCAG CCTCGTCGAT TTGGATGATT TTCACACCAG 16860 

CAGCTTCAAG GTCAAGTACT TCATCCTTGA TAGCAAGGGC GATTTGGAGA GTTGAATCCT 16920 

TGATAGAGAT GTCTTCACGT GGGAATGACC AGTTAAGCAT GGTAACAGGT CCAGTCAACA 16980 

TACCTTTAAC AGGTTTGTTT GTACGACTTT GTGCATAGCT AGACCATTTA ACAGTGATAG 17040 

GCTTAAGACG AGTGACATCA CCCCAGATGA TTGGTGGTTT TACCCCACGC ATACCGTATG 17100 

ATTGTACCCA TCCATTTTTA GAGAAGAGGT ATCCTGACAA GTTTTGACCG AAGTACTCAA 17160 

CCATGTCATT ACGCTCAAAT TCACCGTGAA CAAGGACATC AAAGTCAATA TCTTCTTGCC 17220 

ACTTGATCCA TTCGTCAATC GTTTCAGCAA GGAAAGCGTC GTACTCTTTT TGAGACAATT 172B0 

CACCTTTACG GTAAGCCAAA CGTTTGGCAC GAACTTCTTT TGTTTGAGGG AATGAACCAA 17340 

TCGTTGTTGT TGGAAGAGCT GGAAGTTTGA AAGCTTCTTC TTGGATAGCT TCACGTTCTG 17400 

CAAAGGCTGG CAAACGAGTG TAGTCTGCGT CTGTCAAGCC AGCGATACGC GCACGAAGTT 17460 

CAGCATTTTC ACCAACACGC TCAGTCGCAA AGAGTTCTTT GTTGGCTGCA AGAGCTTCTG 17520 

AACCTTGACC ATTTCGGATA GCATCCAAGT CACGGATTTC ATCCAATTTT TCAACTCCAA 17580 

AGGCAAAGTG GTTCAAGAGT GCTGGTTCAA ATTCTTCATT AGCAGTTGTA AATGGCACAT 17640 

GAAGAAGTGA GCAAGAGCTT GTCAAAACGA TGTTTTCAGC TGGAATTTGC TCAAGAACAG 17700 

CCAAGCTCTT TTCGTAGTTG TTGCGCCAGA TGTTTTTACC ATTGACAATA CCTACATAGA 17760 

GAGTCTTGTC AGCTGGGAAG CCACCTTTAA CGAGTTCAAG ACTTTTCTTA CCTTCAACAA 17820 
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AGTCAAGACC 


GATAGCATCT 


ACTGGTAAGT 


TTACAAGGTC 


AGCGTATACG 


TCACGAACAT 


17880 


CACCGAAATA 


AGTTTGAAGC 


AAGACTTCAA 


GACCTTTTTT 


GTCACCCAAG 


AGTTTGTTGT 


17940 


AAAGGTTCAA 


GAAGAGAGCT 


TTTTCTTCAG 


CTGTCAAGTC 


TTTTACAAGA 


GCCGCTTCAT 


18000 


CCAATTGCAT GCGAGTCGCA CCAAGTTCAG 


CCAATTTAGC 


AAAAACTTCT 


TGGTAAGCAG 


18060 


CCACTAAGCT 


ATCTACGAAG 


TCGTCTGCTT 


TCACGCcrrc 


TTCAAAGTCT 


GACAATTGAA 


18120 


GGAAAGTGAA 


GGGACCTACA 


AGAACAGGAC 


GAGTGTTCAA 


TCCAAGTTCT 


TTGGCTTCTT 


18180 


GGAACTCATC 


GAAAATCTTG 


TGACCAGCCA 


ATTTTACTTG 


AGTGTCTTTT 


TCAAATTTAG 


18240 


GAACGATGTA 


GTGGTAGTTA 


GTGTTGAACC 


ATTTCTTCAT 


TGGAAGGGCG 


CGAACGTCCC 


18300 


CTTTTTCTCC 


CTGGTAACCA 


CGTCCCAAAG 


CGAAGTAGCG 


CTCAAGGTCA 


GACAAGTCCA 


18360 


AGTTTTGAAC 


GGATGCAGGC 


ACCACGTTGA 


AAAGGAAAGC 


CGCATCTAGG 


AAGTTATCAT 


18420 


AGTGAGAAAA 


GTCATTTGAT 


GGAATTTCAG 


TGATGCCTTT 


TTCTTTGACA 


ATGTTCCAGT 


18480 


GTTTAGCACG 


CAAGTCTTTT 


GCTGCTGCTA 


AAAGTTCTTC 


TTCTGAGATT 


TCTTTTCTAA 


18540 


AGTATTTTTC 


AGTTGTAAAT 


TTTAATTCAC 


GGAATTCGCC 


CAAACGAGGG 


AAACCGATGA 


18600 


TTGTAGTTGA 


CATGATGTGT 


CCTCCAAAAT 


TTGTTGTTGA 


AACTATCTTA 


ACAGAAAAGA 


18660 


AAGCGTCTGT 


ATAATTGTAA 


AAAATTAGGG 


TTTGATATAG 


TTTGAAACTA 


TATATCTGTT 


18720 


TCGGACAAAA 


GAAAAAGACT 


TGAAGCAAAC 


GTCTCAAATC 


CTTTGTAATT 


CTTACTTTAC 


18780 


AGCTATATTC 


CAATTAGAAT 


ACTAAAACAT 


GTTATPAGTA 


ATTCTTATAA 


GTGACTATGA 


18840 


CCTGTTATTA 


GAAAAGACTA 


TAACTGATTC 


TACTCAACTT 


TTTCCCTGTT 


CAAGTGGGAC 


18900 


GATTGCTAGT 


GTCTTTCCTA 


AACTGGCTAG 


GACTTTTAAG 


ACTGTATCCA 


ACTGAGGACT 


18960 


AGTCTTTCCT 


GTCTCCATCC 


TAGCTATGAC 


AGGCTGGCTT 


ATTCCACTGA 


CTTCTTCCAG 


19020 


CTTTTTCTGA 


CTGATTCCTT 


GTTCATACCT 


AGCCTCAATC 


AACTCGCTCA 


TGATAGCCAC 


19080 


TCGCATATCA 


CTTTCAAGGA 


TTTCCTCCTT 


GCTAAAGAGC 


TCAGATGGAC 


ATCCTTCCAA 


19140 


TTACTCCCAA 


TAGCACTATT 


CTTCATCACT 


TAACCCTCTT 


TTTTTTACGT 


CTATGTATTT 


19200 


TTAAAAAAAT 


GAGCGAATTA 


TGATTCGATA 


GATTGACCAG 


TGGGTTTAAA 


GTTGGTGCTA 


19260 


GCCTATTTCT 


TAAGCGATTT 


TCCTTTTCTA 


GGATAAAGCA 


GTTCCTGCTT 


GCTTAACCCC 


19320 


AATTTTCCAC 


GATGAATCCA 


ATAGTAAATG 


GTTGAAATTC 


CCACGTTAAC 


CCCTTTAGCC 


19380 


ATCACCATCA 


TTTCAGGCGA 


AAATTTTTGG 


TTATGTTTTT 


GGTTATGTAT 


AGTGGAGAAT 


19440 


CTTTTCCTTT 


AurnmAA 


GACTGTTGAG 


CGTAGTCGGC 


AGAATAAATC 


TCTTTGAAGC 


19500 


GCCCTTTTCC 


AAGACATTGT 


CGGACTGTCC 


CACGCTTGAT 


TTCAGTGTGG 


ATAGTTTGAG 


19S60 


GAGCTTTTCC 


AAGTAGAGAG 


GCAATTTCTC 


TATTTGATTT 


TCCTTCTTTT 


TTCCATCGTT 


19620 
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CGATTAAGCG 
TTTCTGTGCC 
AATCTATGAG 
CTCCGAGCAA 
CTTTCCGTCA 
CTTAACATTT 
ACTGCAACTA 
AATAAACAAA 
TTCGGAACAG 
ACTTGGAACG 
GACGCCGATA 
CAGAGAATCA 
ACTTCTCCAC 
CGTTTCTATA 
TCGAGAAAAC 
TTAATGGAAA 
CCAAAGGCTA 
ATAAGCTCTC 
AGCCAAAGTC 
TACTATGTTA 



ACGGCTATCG ATTGTCAAAT 



TTTTAATCAT TTCAATCTTA 
GAAGACAAGA AAAAGAATAT 
CCATTGCAAA TTGAGGTACT 
TTCTCTTTTC TATCCCTATC 
GGCTATCTAC CCAACCTATC 
TGGCTGCTAT TTTAAGTTTC 
TACGGTTTTA CTCTGGCATT 
ATAAAACCCT TTCTTCTGCA 
TCGCTAATCA AATAGAAGCA 
TGGCTATATT CCCTGAACTA 
AACTATTGTT TCATCAAGTT 
CTACCAATAG TGGAATAGCT 
CAGAAGCTAA AACTTTTCAT 
AAAATATACC AGATATCATT 
TCTGGAAGCA AGACTTAAAC 
TTATTGCAGG TGATTTTAAT 
ATAGGGACGC ATTAAATGCA 
CAAAACTTTT TAATGCAACA 



AAGATTTAGA CATTGTAACT 



GTTTGCCTTT 
AATTGGACTT 
CAATCAAGTA 
CACACAATGA 
ATAACTTATT 
TTGGCACAGA 
TTATTTTTTT 
TTGCTCTTAC 
TCAAATAAGA 
CAACATATTG 
GCTACCAATA 
GGACTTTCTA 
CCTGTGACTG 
ACAACACGGT 
GCCTTGCATA 
ATCATTCATA 
GCAACTATGC 
CTGCCACCTT 
ATAGATCATA 
TTTCAAAACT 



TGTAGTATAA 
TTTTTACTTG 
AAGTCACAAA 
TTAAAACATT 
CTTTTTTCCC 
TTTATGCCTT 
TCCTATCTTT 
TATCGCTCAT 
CTAAAACCTT 
AGCGAATTTT 
TCAGAGGTGA 
TGGCCAACTA 
TGATTGTCAA 
TCGGGACAAT 
CTGCGCCTCC 
ATCAATTGGC 
GTCATGCAGC 
TTGAAAGAGG 
TTTTATTGCC 
CTGATCATAG 



TTGTCTTGCA 
GGTTGTACTT 
GTCACATTAC 
TCTCTCTGCC 
ATCTTCTAAT 
CCCCTTAGCT 
TTACAAGAAA 
ATT ACT ATT A 
AAAATTAGTA 
TAGCCATTTT 
GCAAGAAAAC 
TGATATTTTC 
GAAAAGTTAT 
TGTATTACAT 
TCTGCCAGGT 
TTCAAAATAT 
ACTTGCAAAA 
AACTTGGAAT 
TAAAAACCAC 
ATGTATTTTT 



ACAGAAATCA CATTTTAATT ATTTTATATA AAATCACCCC 



GGGGGAATTT 

ACGCTTGGAT 

TTTGGAAGGT 

AGACGTCAAA 

CAACTTCTTG 

GGAATTTGGT 

GGTTAAGCTC 

CTTGGCCAAG 



GTATCCTACT ATCGTTTAAC GCACTTCTGC 
TTTTTCCATA TAGCGTGCGA CTTCTTCGTC 



CAAGCTATAA 
GAGTTTGATA 
GTGAGTCACT 
GATTTCCACA 
AGCTACATAC 
GAAACCAAGA 



GCCATTGACT 
TCTGTCAAAC 
TCTGCCTTGA 
AATGGAACAG 
GTTTCTGGAA 
ACTTGGTCAC 



TCATACCAAG 
GTTTCACGCC 
GGAGAAGGGC 
CAGGTTGGAG 
TATCGTAAGC 
CGAGTGAAAT 



TCTAATGTTC 

ATTGACTTTT 

CGTTAAGCTG 

TCCCAGTTTT 

GGCAGCTTGG 

AACGTCACGG 

CGCCCCTTCG 

CTTGGCAGTG 

CACGGCTGTA 



ATAAACTAGA 

TCTTCGAGAG 

TCTTCTGGAT 

TCACCTGAGA 

ATAGCATCTA 

CTGACTGCTG 

ATGGCTGAAA 

ACTGGATGCA 

CGACCTGGAT 



19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

21300 

21360 
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GAAGGCTAAC 


GATTTCAGAT 


GTTGCTGTAT 


AGGTTACTTG 


GAGTCCCAAA 


CGAGTAAATA 


21420 


GGGCTTCAAG 


GATTCCCTTA 


GCATAGAAGA 


AATCAACTGG 


AACTGCTGCT 


GTTTGGAAAT 


21480 


CTTTTTCACC 


AACCAAGCCT 


GTCAAGGCAA 


AGGCAAAGCT 


GTTGATCTCA 


TTTGGAAGTT 


21540 


CTTCTTTTGG 


ATTACCTGTT 


TGTTCAAAGA 


CTTTTCCAAT 


CTCATAAAGG 


GCCAAGTTTT 


21600 


TATTCTTACG 


AGCCACGTTG 


TAGGCAACGG 


TATCAAGGAT 


CCCTGAAATC 


ATATTTTGAC 

* » * O A A m m Mil 


21660 


GGAGGACTGA 


ACGATCCACA 


GTCATTGGCC 


ACATGAGTTC 


AGTAAGGTTA 


CTTf5GTTf5Af» 


21720 


CTGTGAACTC 


AACTGCTTTT 


TCAGGAGTTG 


TCAGAGCATA 


GGTGATGATT 


TCTGTCAAAC 


21780 


CTGCTCCTTC 


AGCAATGGTA 


CGAACTTGAC 


GGCGGAGTTT 


TTGTATCACA 


GTCAATTCAC 




CACCTGTACC 


ATCGTCTTTT 


GGAAGGCTGG 


TTGGCAAGCG 


GTCATATCCA 


TAGATACCAfi 




CGATTTCTTC 


AAAGAGATCA 


GCTTCGATTG 


TGATATCCCA 


ACGACGACGT 


V3VJ x nww\> X \3f\ 


21 Qsn 

A470U 


CTGTAAAGCT 


GTCTGCATTT 


CCAGAAAGAC 


CAAAGCCAAG 


ACGACGGAAG 




2202ft 


CATCAGCATA 


AGACAGCTCA 


GTTCCGAGGA 


CACGGTTAAC 


ATCAGCAAGG 


RTTflAAfiAAA 




CTTCCACATC 


AGAGGTATCA 


AGCTCACCCG 


CTGAAACGAT 


ACCCTTXCGC 




22 1 Aft 


CTGCAAGCTC 


TGCAATCATG 


CTAGCTGCCG 


CATCAAGGGC 


TTCATTAAPT 






TAATTCCTTT 


TTCAAAGCGA 


GAAGATGACT 


CAGAACGAAG 


VIA WWUViUn 






TACGGATAGA 


TTTGCCATTA 


AAAACAGCAG 


CTTCAAGGAT 


AACACGACTA 


vHl 111 1 vAll 


22 12n 


AAATTTCTGT 


AGCCTGACCA 


CCCATAACAC 


CGGCAAGGGC 








TAATCACGAG 


GTCTGTCTCA 


GCCAAGTCTC 


GTTCTTCACC 




1 #w\ 11111 


A ****** v 


CACCATCACG 


CGCTTCACGC 


ACACGGATGT 


A V*. V*> * 4 


AAATGTGTCC 




46 JUU 


CATGCATAGG 


TTGACCAAAG 


TAGAGCAGGA 


4W4fnJ 4 * 4- \# * 


CACGTCTACA 


ACCTTATTfiA 


22*1 firt 
« A J o w 


TGGGACGGAT 


GCCTTCGTTC 


ATGAGAAGGT 


TTTGCAACCA 




GGTGfGATAG 


22fi2ft 


TCACATTGTC 


CAAGATACGA 


GCTGCATAGT 


AAGGCGCCTT 




ATGCTGACAH 


22 fiftO 

44DOU 


AAAGGGCATC 


TGCCGCAGCT 


TCATTAGTTT 


CTGTTAGAGT 


AAATT1T1TA 


AAGTTGACTG 


22740 


CCTTGTCATA 


GATGGCTGCC 


ACTTCGTGAG 


CCACTCCACA 


CATAGAAAGG 


GCATCTGCAC 


22800 


GGTTTGGTGT 


GATGGAAAGT 


TCGATGATTT 


CATCATCCAA 


GTCTAGGTAA 


GAAAAGACTT 


22860 


CCTCACCTGG 


CACGGCATCT 


TCAGGCAAGA 


TTTGGATGCC 


ATCTGCGAAT 


TCCTTAGGCA 


22920 


CAACTGAGTC 


AGAAATTCCC 


AATTCACCAA 


GTGAACAGAT 


CATTCCAAGT 


GACTCCAAAC 


22980 


CACGGATTTT 


tccttttttu 


ATTTTCTAGT TATCAGCGAT 


ACGAGCTCCT 


GGAAGAGCCA 


23040 


CCATGACCTT 


GATCCCAGCA 


CGCACATTTG 


GGGCACCACA 


AACGATCTGA 


CGCTCTTCTT 


23100 


CTTCGCCAAC 


GTTAATCTGA 


CAAACATGGA 


GGTGAGTCTC 


TGGCACATCT 


TCGCAAGACA 


23160 
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AGACCTCACC GACGACAATT TTTGAGAGAC CAGCAGCTGC TGATTCGACA 
CGATCCCTGT AGTTGACATT TTTTCAGCCA ACTCTTGTGA TGGCACATCA 
ATTCTTTTAA CCATTTATAA GATACAAGCA TAATTTAGTT CTCCAGAATG 
CTCTAGTTCT TTTCCTTTCC TATCATTTCA ATAGAAGAAT CCTCTTCTTA 
TTTCTCACTA ACCAATCCGT ATCTACTTTT TGACCAACCA TAAAATGATG 
TTTTCAAAAC CATATCGGTT ATAAAACGCT TGAGCTTTTG TATTATCCTC 



CCCTCTACCT 
ATGTCCACCA 
ACAGTTGTCA 
CCTTAATTTC 
TTGGCTAAAT 
CCAAACACCT 



AGCCAACCCC AAGAAAAACT ATTTTTTGTA GCAAGTTCAA GTGCGAATTC AAACAGTTGC 
TTACCTAGTC CAAATCCTTG GAATTTTTGT AGCACATAGA GACGTTGAAT TTCAAAAGCG 
TCCTCTAATT CTCTCTCAGT TTGAGCACTT CCCCAGTTGA CTTTGAGAAA ACCAGCTATC 
TCCTCCTCAT GCATAATGAA ATAGGTTTCA GAGTCAGGAT TTCCCAACTC AGTTGACAAA 
GTTTTCAGAC TATAAGCCTC TTCAAAGTAT TCCTGTAACT GCTCTTCCGT ATTATCATAC 
GCAAAGGTTT CACGAAAGGT TTGTTTGGCA ATTTTAGCCA ACACCTCAAC ATCTGCCATT 
TCTACTTTTC TAATCATTAT TTAAACTGTT CTGAGAAGCG GACATCTCCT TGGTAGAATC 
CACGGATATC GTTGATTCCA TAACGGAGCA TAGCTACACG CTCTTCTCCA AGACCAAAGG 



CAAAGCCAGA 

TACCGGCCCC 

CACACTTGAA 

GACGCAAACG 

CTTGAAGATC 

GGTGACTGTG 

TCAAAGGACC 

GGGTACGGAG 

GGTCTTTTGG 

CCACGACTTG 

AAACGTGACG 

TAGCCAGTTG 

CAGTCAAGAC 

AAACATCTTT 

CACGCAAATC 



GTATACAGTC 

CATAATTTCG 

GCAAGAAACA 

AATTTGACGC 

AGCCATAGAG 

GGTCGCATCG 

TTTAGAAAAA 

CAAGATTTCT 

AAGGTTCATA 

ATAACCCATA 

GTGACCAGTC 

AGCCGCGACT 

ATCACGAGCT 

CATCCCTTTG 

TTGCATCTCT 



GCATCGATAC CACTCATTTC AAGGACACGT GGGTGAACCA 
ATCCAACCTG TTTTCTTACA TACATTACAG CCTTCTCCAC 
TCCACCTCAA CAGATGGCTC TGTGAATGGG AAGTAAGATG 
TCTTCACCAA ACATTTTTTG GACAATCAAC TGAAGCGTTC 
ATATTTTTCC CAACTACCAA GCCTTCGATT TGGTGGAATT 
TCCGTATCGC GACGGAAGAC ACGCCCTGGC GAGATCATCT 
TCATGGGCAT CCATAGCACG CGCCTGAACT GCAGACGTGT 
TCAGTGATAT AGAAAGTATC CTGCATATCA CGAGCTGGGT 
CGTTCAAAGT TATAGTAGTC TTGCTCCACT TCAAAACCAT 
CCGATGAAGA TATCTTCGAT TTCTTCACTG GTTTGTGTCA 
GCAACTGGAC GACCTGGAAG CGTCACATCT ATACTCTCGC 
TTCTTTTCTT CCAAGAGCTT AGCTGTTTCT TCAAAAGCAG 
TCATTGACGT GTTTCCCGAT GATTGGACGC ATCTCAGCAG 
AGGATTTCAG TGAGCGAACC CTTTTTACCA AGGACAGAGA 
TTTTCATTTC CAGCAGTAAT CTGCTTCAAG CTAGCCAGCG 



23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 

24900 
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TTTCTTCGCG 


AAGCGCTTTT 


AATTGTTCTT 


CAATAGTTGA 


CATATTTCCT 


CCATCAGTCT 


24960 


CTCGTAGATA 


AAAAGAAAAC 


CACATGCCAA 


AAACTCCACT 


CGGAGCGTTG 


ACACGCGGTA 


25020 


CCATCCGTTT 


TCATCTGACA 


AGTCAGACCT 


TCATTTCTAA 


ATCCATGCGC 


AAGTGAATTC 


25090 


ACCCAGCTTT 


CATATAGAGA 


GCTTGCAGTC 


ACGGCTCTCC 


TCCCTGATAT 


ACTTCCCTTG 


25140 


GGCTACTAGT 


CTTTCAGATT 


CCTATTCAAT 


TACTACTTAG 


TTTATCAGAT 


TTTTACCATT 


25200 


CTTGCAAGAC 


CTATCTTACT 


TCTGCTTGTT 


AGCTTATTCT 


TATCTAAATT 


TATATAAACc 


25260 


TTATCTAAAT 


TAACTATTTA 


TAATTTTTGT 


AACAAAATTA 


AATTAATTGA 


CACTCCCCTA 


25320 


TAAAATAAAG 


AAGTTTAGAA 


TTTAATGTCT 


TCCAAACTTC 


TTTATTCCAT 


ATTTAATGAA 


25380 


ATGCCACCTT 


AACCGTGATA ATAGCTAGTC 


ATCAATAAAA 


AACTATTTGA 


ATAAGGATTC 


25440 


TCCATTTGAT 


TCAATCACTT 


CTTTATACCA 


AGTAAAAGAC 


ATTTTCI"rAT 


ATCGATTTAA 


25500 


TGTACCACTT 


CCATCATCGT 


TTCGATCAAC 


ATAAATGAGA 


CCGTACCTTT 


TAGAAAGTTG 


25560 


TGCAGTGGAC 


ATAGAAACAC 


AGTCAATACA 


TCCCCAAGAC 


GTATAGCCCA 


TAATTTCAAC 


25620 


ACCATCCTGT 


AGAGCTTCAG 


CAACTTGCAA 


TAAATGTTCT 


TTCATATACT 


GAATTCTATA 


25680 


ATCATCTTGG 


ACGGTTAAGT 


TATTAAGTTC 


ATCTTTTATT 


AGTTGATCTT 


TAGCACCTAA 


25740 


TCCATTTTCT 


ACTATAAATA 


ATGGGATTTG 


ATAACGGTCA 


TAATATCTAT 


TTAAAATTAT 


25800 


ACGTAGTCCA 


ATTGGATCAA 


TTTGCCATCC 


CCACTCTGAA 


GACTCTAAAT 


AAGGATTTAC 


25860 


TAAACCACCA 


ATAATATTCC 


CTTCTCCTGA 


ATTATACTGT 


GTTGGAAGAG 


CAGATTGAGT 


25920 


CACACTCATG 


TAATAGCTAA 


AGGATAAAAA 


ATCTACGGTA 


TAATTTTTTA 


ATAACTCTCC 


2S980 


ATCTTCAGCT 


GCAAACTCTA 


TGTTAATGTC 


ATTTTCCTTA 


AAATATCTTT 


TTGCATAATT 


26040 


CGGATAATAA 


CCTCTAACAT 


GCACATCTGA 


AAATAGATAA 


TTTAGATTCT 


CATACTCATG 


26100 


AGTCGCCCAT 


ACATCTTTTG 


GATTTGGAGT 


CATTGGATAA 


GCTGGCATAG 


CTAATACCAT 


26160 


ACATCCCACC 


TTAAACTCTG 


AATTAATCTC 


ACGAGCAATT 


TTTGTAACCA 


AACTTGAGGC 


26220 


GACTAATTCA 


TGATGTATAG 


CTTGATATAA 








t c *i on 


ATCTATTCCT 


CCACTAGTAA ATGGTAATTC 


CAAAACAGAG 


TTTACTTCGT 


TAAATGTAAG 


26340 


CCAATATTTA 


ACTTTATCTT 


TATACCTTTC 


TAAAACTGTT 


CGAGCAAATT 


TTTCATAAAA 


26400 


ATGAATCATT 


CTCCTATCAA 


CCCATCCATG 


ATATTTTCTT 


GCTAAATATA 


ATGGAGTCTC 


26460 


ATAGTGTGAA 


AGAGTTACAA 


GTGGTTCTAT 


CCCGTGAGCA 


TGTAGTTCAT 


CAAACAATTC 


26520 


ATCATAATAT 


TTCAACCCAG 


CTTCGTTAGG 


TTCTTCCTCA 


TCTCCTTTTG 


GAAAAATTCT 


26580 


ACTCCATGCA ATAGAAGTAC 


GAAAAACATT 


AAAGCCCATT 


TCAGAAAACA 


AGGATATATC 


26640 


TTCCTTATAT 


TTATGATAAA AATCAATACC 


TATCAATTTT 


AAGTTATCTT 


CTGTAGGATT 


26700 
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A A W 4 1 A 1 


1 v. 1 V.L lAAt L 




tn ft ft m a il im a 

TAAC ACATC C 


99%^^k ft ft ^■feB^fttfft ft ft 

TGAACTGATA 


AGCCCTTACC 


26760 


«*V»A 1 A In 






AGCTGCAACA 


GCTCCACCCC 


AAAGAAAATC 


26820 


ATfTfVlA & & & 


A 1 \AjTCAT AA 


CTTTCCTCCA 


TTATAATATT 


ACCACTAATT 


CCTTAGAATG 


26880 


PTrTATTrnT 


ftu 1 * k mm ft raw* ft ^ 

TGATTTATTAG 


ft ft fit ft mm ft ft 

GTAATACTAA 


TACATCTAGA 


AAATCATTGG 


TATTCGTTAC 


26940 


t\n 1 i JU» I VjO 1 


GTAAC 


CGTAGCCTTT 


AGTCTTGATT 


AAATTCAAGT 


CCATTTCAAA 


27000 




1" AAAA 


CTCTGTCTCC 


TTCTTCTACA 


TGACTAATAA 


AACCTTGACC 


27060 


I 1 i i AGCTCA 


ACAGTATCTA 


ATCCAATATG 


AATTAGTAAC 


TCAACACCCT 


CATCACTCTT 


27120 


L.AA 1 LUlAl 1 


GCGTGCTTAG 


■■l/BMM ft ft ft ft ft Jlh 

TCGGAAAAAT 


ATTTGTAATT 


TTCCCATCAA 


ATGGTGCATA 


27180 


AALtTrTACCT 


ft ^*w#mft^m #ft,^ft ft 

TCACTTGGGA 


TAATCGCTAC 


TCCGTCTCCA 


ATTAGTTTAT 


CTGAAAATGT 


27240 


TTTATCCTGG 


ACATCGCTTA 


ACGGAATGAT 


TTCTCCTGAT 


ATAGGAGAAA 


ATATCATTTT 


27300 


nTATTTGAA 


ACTCCAGCTT 


CAACTTCTAA 


ATTGCTAGAA 


CTCTCTTCTT 


CATCGATTCC 


27360 


mil m ft *n ft #f* m it 

AAATATATAA 


GCTAATACAA 


AGGTAATAAC 


AACCGAAATG 


ACCGCCACAA 


TTAAAGCATT 


27420 


TACAATATTT 


GATGGCACAT 


CAGAATAAAT 


AAATTGAGGC 


AACGCTATCA 


AAGATGGGAC 


27480 


ft lh ft ft ih ft m. 

AGCAAATAGA 


TATGCTTTAA 


CACTAGTAAG 


ACCTGCAAAT 


AATCCCGCTA 


ATCCACCACC 


27540 


ft ft Mft^* ft ft 

AATCATAGCT 


GCATAAAGCG 


WHWWATA 


TTTTAAAGTC 


ACACCATATA 


ATCCAGGTTC 


27600 


ft ft 

GGTAATCCCT 


GCAAGTAAGG 


CTGAGAAACC 


TGCTGCAAAA 


GCAATTTGTT 


TTGTATTATT 


27660 


A"i"i~i"i~i - ACTC 


TTTAATGCAA 


CAGCCATCGA 


AGCAGCCCCT 


TGAGCTAAGT 


TTGACCCTAA 


27720 


CATTGCTGGA 


AGAATTAATA 


CGTCTGGAGT 


AGCAATAGAT 


GCCGCCAAAA 


AAATAGGTGC 


27780 


AAAAGCCCAA 


TGCATTCCAG 


TCATAACAAT 


AAATGGCATA 


ATAGCACCAA 


GAATAGCTAA 


27840 


TGTAAGCCAT 


CCAGCTACAC 


CATACATTTG 


CCCAACTAGA 


TTTGATAATC 


CTTCACCAAC 


27900 


AATTACTCCA 


ATAGGTCCGA 


CTACAACTAA 


GGCAATACAG 


CTTGATACTA 


ATAATACTAG 


27960 


CGTAGGTTGC 


AAAAAACTCT 


TAGTAATAGC 


TAGTGTTAAT 


TTAGCAATTA 


TTTTTTCAAT 


28020 


ATATTTCATC 


AACCAAACCA 


TAATAAGAAT 


TGGAACGACT 


GATGAACCAT 


AACTAGCTGG 


28080 


TGTCACAGGT 


GCACCAAATA 


AACTAAGAGG 


ATTCCCTGAT 


TGCACCATTT 


GAACAAAATT 


28140 


TGGATGGAGA AGTACACCTG 


CTACAGACAT 


AGCTAATGTA 


GATGTTACTT 


TTAATTTTTG 


28200 


TGATGCAGAA 


TAAGCTAATA ACAGCGGTAA 


GAAATAATAT 


GGAGCATCCC 


CAAAAAATGT 


28260 


CAAAAAAGCA 


ATAGTCTGAG AATCTGATTG CAATATACCA AGCATTCCTA AAATGATTAC 


28320 


CAAGACTTTC 


AACATACCTC 


CCCCTAACAT TGCTGGAATG 


ATTGGAGTCA 


TGGAACCAGC 


28380 


GATATACTCA 


ATGATTCTTT 


CTAAAATATT 


CCCTTTGTGC 


CCTTGAACAA 


CTGAATCGGA 


28440 
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TTCAAAATTG CCAAGTTTAA CGAATTCTTT ATAATAATTA GCTACATCAT TACCAAGTAT 28500 

AATTTGATAT TGTCCATTCT TTTTCATAAT ACCTATTACA CCTCGTATCT TCTTCACATC 28560 

ATCATCATTG ACTAAATTTT CATCTTTTAA TTCTAATCTT AAACGTGTTA CACAATGGGT 28620 

AACTCTATTG ACATTTTTTT CACCTCCAAT TACATCGAGG ATTTTTTGTA CCGTATCTTT 28680 

ATAACTCATG GTATTCTCCT ATTCTATTAA TCTAAATTTT TTGTTAAGCG ACGAATATGA 28740 

GCCATCAAAT AAACTAATTC ACTAGAAGTC AGCAAATAAT TGTACTCCGT TTCTATAAAC 28800 

ATTGCTACCT GTTCACCACA TTCATATTCT CTAGGATATT TATTTTTCAT TAATGCTAAC 28860 

AAGTCTTCAT CATCATCGTC GG 28882 
(2) INFORMATION FOR SBQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

GCCTATGTCT TTTTCAAAAA AATGCTTGAC TTGAGACGGG AACTAGGGAA GTCTAAAGGC 60 

GGAAGGCATT GATTTATACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGGA 120 

TTATATATGT AACTGACTTC GTCGATGCTT ATCTACAACC TCAAAGCAGT GCTTTGAGCA 180 

ACTTGCGGCT AGTTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATTATA TTACTTTCTA 240 

i 

TTTGTAGGAG GTGGCTTATG AAGATTCCTC TCTTAACTTT TGCAAGGCAT AAATTTGTTT 300 

ATGTCTTGCT TACTTTGCTT TTTCTTGCTT TGGTTTATCG TGATGTTTTG ATGACTTATT 360 

TCTTTTTTGA TATTCATGCG CCCGATCTAG CTAAATTCGA TGGACAAGCA ATTAAAAATG 420 

ACTTATTAAA ATCAGCATTA GATTTTCGTA TTCTCCAGTT CAATCTAGGT TTTTATCAAT 4 B0 

CATTTATTAT TCCAATCATC ATTGTTTTGC TAGGTTTTCA ATATATTGAG CTGAAAAATA 540 

AAGTTTTACG ATTGAGTATT GGAAGAGAAG TGAGTTATCA AGGGTTAAAA AGAAAGTTGA 600 

CTTTGCAAGT TGCAAGTATC CCTTGTTTGA TATATTTAGT GACTGTGCTG ATAATTGCAA 660 

TTATAACCTA TTTCTTTGGG ACTTTTTCTC CTCTTGGATG GAATTCTCTA TTTTCTGATG 720 

GAAGTGGTTT ACAAAGACTC CTAGATGGAG AGATAAAAAG CTATTTGTTC TTTACTTGTG 780 

TCCTACTAAT CGGTATTTTC ATCAATGCAA TCTATTTTTT ACAAATAGTT GATTATGTGG 840 

GGAATGTGAC TCGTTCGGCA ATCACCTATT TGATGTTTCT TTGGCTTGGT TCTATGCTGC 900 

TTTATAGTGC CTTGCCTTAC TATATGGTTC CTATGACGAG TTTGATGCAA GCTAGCTATG 960 
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GGGATGTAAG TTTGATGAAA CTCTTTACTC CTTATATCCT 
TGCTTGAAAA ATATGAAGAT AATGTTTAAG AATTTTAACA 



ATTGTTTTAC 
GCGCTTCAAA 
TATAATGACT 
TTCATGGTAG 
CACTTGATTC 
ATTTCTAAAT 
ATTCATTTCA 
CTACTGTCTT 
TTTTTAGCAT 
CTCTTAAGTT 
TAAACGATGA 
AATCAGTTGA 
TCTGGCAAGA 
GTTCTTCAAG 
TTAATTGAAA 
AGGTATTTTT 
TTACAGGAAT 
GCCTTGATTC 
GCTTTGGATG 
GAAAATGGTC 
GATCTATTAG 
TGGCTTATGG 
GTTCTGCTGA 
AGGATTGACA 
ATGGCCTTGC 
CTTTGGATTA 
ATAAAAGTCG 



TACTTCGTAT 
ACCAGGATGC 
ATTCTGAAGC 
GATGGCTCTC 
GCTATCAATC 
TTTTTACTCA 
AAACAGTCGC 
ATCTGATAGC 
TTGTGGGACA 
TATTAGTTAT 
GACTTGAAAT 
ATTTGGTGTT 
CGGTTCTTTT 
ATGGTAAAGT 
AAGTCGAGTT 
CATCTAAAGT 
TTGAAGACAT 
AAGCCTTTAT 



AGTTCTGATG ATGATTTTGA 
TGTTATCTTT TTCAAGAGAG 
GAATTTAGAA ATCCCCAAAC 
TGTCATTTTA CTTGAAAGTG 
AAGCTCCTTT TTCGATTATA 
AGATTTGTTT GTCTGGTTTC 
ACTTTTCTTT TTACTTGCTC 
ACTGATTAGT GCGGGCGCTG 
AGAATGGATG ATGGATCATA 



TTATATTGTC CCTTACATGG 
ATATTTTGCT AAATAGAAAG 
TAAACCATCT ATTGTCAACA 
AATTGATTTC AATTTTTTCC 
TATTGTTAAA CCTTTCGCTT 
ATTTGGCAGA CCATTACCAT 
CAAGGAAACG ATTGGTTGTC 
TTCGTTTACT TCCTCTAGGA 
AGTTAATGAT GTTGTACTTA 
GTTTTTCCTT TTTTCTCTAT 
TTGTAACAGT GTATTTACTA 



GTTGATTGTT AGTCGCTTGG AAGAGAAATT TAAGAAAGGA 



TATAAATGGA CAGAAAATTT 
TCAATCAGGA AAAATTTATG 
AAAGATACTT GCTGGTTATA 
TTACGGGGTA AAAAATCATT 
TTTATCTCAT TTATCCCTGA 
TACGGAAAAA AGAATTGCCT 



TGAATACCGT CATTTATCCT 
TTCCTCTCCT TCTATACTCT 



ATGGGAAAAG ACCTATTTTA 
GACTTAAAGG TGATAATCGA 
TTAAGCTTGA CAAAGGAAAA 
ATATTCAGGA TGCAGGAATT 
GAGAAAATTT GGAACTGTTA 
ATTGGATTCA ATACTATGAT 
TAGGAACAAA GCAAAAAATG 
TTCTCGATGA ACCTATGAAT 



AGAAGAGTGT 
TGGTTATCCT 
TTGTCGAAAA 
CACATCTAAA 
TCTGGCTGTC 
TCTTCATCAT 
TGTCCAAGAA 
ACTTATTTAT 
GAGAGGTTAG 



GAGGTTAACC AAACAGGTCA TTTTATCTTA CCTGAAAAAA 
GACGTCGCAC ATATCGGAAG ATATTTCAGA CCTTTGTACA 



TGGACATATA CAAATGTAAA 
ATCATTTATT ACACGATATT 
TTTCTTCTTT ATCCCTTGGG 
ACAGAAAATC TTGCTAGCTT 
AGTCAGTCTC TTTGTTTTTG 
CACATTTGCC ATTTTGCCGA 
CTTGAAAACT AACCTCTTTT 



GGATATACAA TCCTAGGAGA 
CCAAGGTTTA TATTGGTTTA 
ATAAACCACT TCTGGGGATA 
TTGGAATTCT GTCCATTCTC 
GACTGATTTG CTGTCTTTCT 
TTTTTGGCAA TTAAACAGTC 
TCCTTTTCAA AATGGGGATT 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 
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CTTCCTTGAA AATAATCAGT AATTGTGCTA AAATTAAACG AACATTCTAA AATATTCGGA 
ATTTAAAGTA AGGAAAAACA TGGCTAATAT TTTAAAAACA ATTATCGAAA ATGATAAAGG 
AGAAATCCGT CGTCTGGAAA AGATGGCTGA CAAGGTTTTC AAATACGAAG ACCAAATCGC 
TGCTTTGACT GACGACCAAC TAAAAGCAAA AACAGTTGAA TTTAAGGAAC GTTATCAAAA 
TGGAGAATCA CTGGATTCAT TGCTTTACGA AGCATTTGCG GTTGTCCGTG AAGGTGCCAA 
ACGTGTCCTA GGTCTCTTCC CTTATAAGGT TCAGGTCATG GGGGGGATTG TTCTTCACCA 
TGGTGACGTG CCAGAGATGC GTACAGGGGA AGGGAAAACC TTGACTGCGA CCATGCCGGT 
ATACCTCAAT GCCCTTTCAG GTAAAGGGGT TCACGTAGTT ACGGTTAATG AATACCTGTC 
AGAACGTGAC GCGACTGAGA TGGGTGAATT GTACTCTTGG CTTGGTTTGT CAGTAGGGAT 
TAACTTGGCT ACCAAATCTC CAATGGAGAA AAAAGAAGCC TATGAGTGTG ATATTACTTA 
CTCAACTAAC TCAGAAATCG GATTTGACTA CCTTCGTGAC AACATGGTCG TTCGCGCCGA 
AAACATGGTA CAACGTCCGC TTAACTATGC CTTGGTCGAT GAGGTTGACT CTATCTTGAT 
TGACGAGGCT CGTACACCTT TGATTGTATC AGGTGCCAAT GCGGTTGAAA CCAGTCAGTT 



GTATCACATG GCAGACCACT ATGTAAAATC TTTGAACAAA GATGACTACA 
GCAGTCTAAG ACTATTGGTT TGTCTGATTC AGGGATTGAC AGGGCTGAAA 
ACTTGAAAAC CTCTATGACA TCGAAAACGT GGCTTTGACT CACTTTATCG 
TCGTGCCAAC TACATCATGC TTCTCGATAT TGACTATGTG GTGAGCGAAG 
CTTGATTGTC GACCAATTTA CAGGTCGTAC CATGGAAGGT CGTCGTTATT 
GCACCAAGCT ATTGAAGCCA AAGAAGGTGT GCCAATCCAG GATGAAACCA 
CTCAATCACG TACCAAAACC TCTTCCGTAT GTACAAGAAA TTGTCTGGTA 
AGGTAAGACT GAGGAAGAAG AATTCCGTGA AATCTACAAC ATTCGTGTTA 
AACAAACCGT CCTGTTCAAC GTATTGACCA CTCAGACCTP CTTTATGCAA 



TAAGTTTAAA GCGGTTGTCG AAGACGTTAA GGCTCGTTAC CAAAAGGGTC 
GGTTGGTACA GTAGCGGTTG AAACTAGTGA 
TGTTCCTCAC 
TGCTGGTCAA 

CAAGCTTOGT GAAGGTGTTC GTGAACTTGG AGGACTTTGT GTTATTGGTA 
TGAAAGTCGT 
TGAGTCACAA 
CTTGAAGGGA 



CTACATTTCT AAGAAATTGG 
GAAGTCTTGA ATGCCAAAAA CCACTATAGA GAAGCCCAAA 
CGTGGTGCCG TTACCATCGC AACCAACATG GCGGGTCGTG 

AGGACTTTGT GTTATTGGTA 
CGTATCGATA ACCAGCTTCG TGGACGTTCA GGTCGTCAAG 
TTCTACCTAT CTCTTGAAGA TGATTTGATG AAACGTTTTG 



ATCTTTGAAC GCTTGAACAT GTCTGAAGAG GCCATTGAGT 



TCATCGATGT 

GCTACTTCAA 

ATAACGCCCT 

AGCAAGAAAT 

CTGATGGATT 

AGACATCTGC 

TGACGGGTAC 

TTCCAATCCC 

GTATCGAATC 

AACCTGTCTT 

TTGCAGCTGG 

TCATCATGAA 

GTACCGACAT 

CAGAACGTCA 

GAGATCCAGG 

GTTCTGAACG 

CTCGCATGTT 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

34 80 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 
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GACGCGTCAG 

ACAAGTCCTT 

TTACGATGTC 

CACGATTGAA 

AATTTTGAAC 

GTCAGGCTTG 

CGATAGTCAC 

GATTCTACGA 

TAACGCGGTT 



GTTGAAGCAG 
CAATACGATG 
ATCACTGCAG 
CGTGTCGTTG 
TTTGCTAAGT 
TCTCATAAGG 
GTTTCAAAAC 
GTCGTCCATA 
GGACTTCGTG 



AGGTTTCCGT 

GATGAAAGCA 

AGCGACTCGC 

GATTGGACGC 

TAAAAGACAA 

TGGGTATCCG 

ATAAATATAG 

TCACAGCGAG 

GTAATCGGGC 

GCAGTCCTAC 



ATGTTTAATG 

CAAATTCATG 

AATATCGCTG 

AATGAACTTT 

TAAAATGAGA 

TTTGCTTTAT 

AAGAAGTTCG 

ATCAAGAGCT 

CATGCTCATC 

AAGAAGAAGT 



AAACCCCGTA CCAACGGAGA 
GCGCCTACTC TTATCAATGG 
GAAACAGGGA TGACAACTGC 
GATTTGATTT CTTACATGGC 



CTCAGAAACG 
ATGTCATGCG 
ATCGTGACTT 
ATGGTCATGC 
ACAACTTGCT 
CCATCAAGGA 
TACGCGATGA 
ACAAGTGGAC 
GCTATGCTCA 
ATATGATTGG 
AACAAGAAAG 
CTCACCAAGC 
GCCCATGTGG 
TAGTTTAGAG 
AAGGAGATGA 
TGCCTTGTCA 
AGAAGCCATT 
TGACAACGAA 
GGCAGATCGT 
TGGCTATAAG 
AATCAAAGCC 



TGTCGAAGGA 
TGAACAACGT 
GGCACCTGAA 
GCGTGCCAAA 
TCCTCAAGAT 
AGAGCTTTTC 
AGAAGCAGTT 
AGATCATATC 
GAACAACCCT 
TTCGATTGAG 
ACCACAGGCA 
AAGTATGCCA 
TTCTGGTAAG 
GCGGATATCT 
GTTATGGTAT 
AAATTAGAAG 
ATACGTGGAG 
GAAGCTGTCC 
ATCTTTATGG 
GGCTTGATTC 
GTTCGCCATC 



AATAACTACG 
GAGATTATCT 
ATTCAGTCTA 
CAAGATGAAA 
TCTATTACGA 
CAACGTTCCT 
AAAGAATTCC 
GATGCCCTTG 
GTTGTTGAGT 
TTTGATGTGA 
GAACGTCATA 
GAAGATTTGG 
AAATTTAAAA 
TGTGAAAAGT 
TTACAGCAAA 
GTCAGGCTTT 
AAGACCAGCG 
TTGAATACGC 
TTATGCGTGT 
ACCAGCCTAA 
TTCACTATCG 



ATACCCGTAA 
ATGCTCAACG 
TGATCAAACG 
AACTAGAGGC 
TGGAAGACTT 
TGAAGGTTTA 
AAAAAGTTTT 
ATCAATTGCG 
ATCAGGCAGA 
CACGCTTGAT 
TCAGTACAAC 
ATTTGAGCCA 
ACTGTCACGG 
AAATTTTTAC 
AAGCTCTAAA 
GGAGAGGAAA 
AATTCTCTTG 
TAAGCGTTTG 
TTATACTGCC 
CGCGACAGAA 
TGTCATCACA 



GTGGCAAGTG 
CTCATGTTTA 
GAAGTAGAAA 
TATGGAAAAA 
GAGAAAATGG 
AAGCAGTATA 



GGGCAGGATT 
ATGGGATTTA 
CAACTGGGAA 
ATATTCCCAA 
GCTTGGAAAA 
TTGAACAGAT 



TGATGAAATG 
AGTTGGTGCC 
TTCTACTGGT 
TGCTGCTCAA 
CCCGCTTTCA 
CTACTATTAT 



CTTTATCCTG 
CGTTCAGTTG 
TTTAAAAATC 
AACAAACAAA 
CACGCTATTC 
GACAATTTAA 



AAAACCTTCC 
AAGACCAGCA 
CAACCTCTGG 
GTTTCCTTTT 
TTCGTGGTGC 
TTGATACCAT 



GCTTGTACAT 
ACACCGCTTT 
AAATCTCAAT 
CTTAGGAAAA 
TCTTAATGAG 
TGCCCAGTAT 



TCCTTTTATC ATCATTGATA CCAATCATGA CAATTCTGGT 
CCGAATTGTC CCCCAGACCT TGATTAACCG TGCTTGGAAT 



4560 

4620 

4680 

4740 

4800 

4860 

4920 

49B0 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 
6240 
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GAAAAAATTA 


AGCAGTTCGT 


TCGTGGTTTT 


ATGATTGAGT 


CTTATCTGGA 


AGATGGTCGA 


6300 


CAAAATGAGC 


CAGAAGTATT 


TGGTAAGTCT 


ATCACAGACC 


CTTGCCTGGG 


TTGGGATAAC 


W J U V 


ACAGAAGCTC 


TTGTCAGAGA 


AATTTACAAA 


ACGTTAGGAG 


AATAAGATGG 


CATTTATTGA 


6420 

v V 4 W 


AAAAGGTCAA GAAATCGATA 


TGGAAGTCAT 


CAAGGCTGAA 


ACCCAATTGT 


CTGCGGAAGC 


U i O \J 


CTTGAGACTC 


AAGGAAAGCC 


GTGACAGGGA 


ATTGGCAGAT 


ATTATTTCAG 


GGGAAGATGA 




CCGTATTCTC 


TTGGTGATTG 


GTCCTTGCTC 


TTCTGATAAT 


GAAGAGGCGG 


TTTTGGAATA 


A Ann 


TGCTCGCCGT 


TTATCTGCCT 


TGCAAAAGAA CGTAGCGGAT 


AAGATTTTCA 

M mm IV**» A. A A A Mm 


TGCTCATCTG 


QQQU 


CGTGTATACT 


GCTAAGCCTC 


GTACCAATGG 


AGACGGCTAT 


AAAGGATTAf! 




kt^ n 


AGATACTTCT 


AAGGCTCCAA 


GCCTGATTAA 


TGGCTTGCAG 


Vf V. 1UI VJV.VJV.V 




D / DU 


CCGCGTGATT 


ACAGAGACTG 


GTTTGACAAC 


GGCAGATGAG 


A*TW*T*fI*A'Pf* 

n » VJ\» L L k J\ 1 V* 




DOIv 


GATCTTGGTG 


GATGACTTGG 


TCAGCTACCA 


TGCCGTTGGA 


VA« A v. vi I IVi i Va 


1 uuAAuAL LA 


OjUU 


AGAGCACCGC 


TTTGTGGCTT 


CTGGGATTGA 


TGCACCAGTA 




A X LLAALL i \_ 


COCA 


AGGAAATTTG 


GGTGTTATGT 


TTAACGCCAT 


CTATGCTGCT 


V>fUUVtvnnu\f 


fc * » rPTTrt^P 
/Vr\i\L,L, I 1 1 


/ U«U 


TTATCATGGG 


CAGGAAGTTG 


AGACATCAGG 


TAATCCTTTG 


uLLV.Alui in 




/you 


AGCAGTCAAC 


GAGTATGGCA 


ATTATATGCC 


GAATTACTAC 


ip a TV & a a a^v* 
1 A 1 1mu\AA 1 ^ 






CATTGAACGC 


TATGAAACCA 


TGGGACTTGA 


AAATCCTTTT 








TGATAACTCA 


GGCAAGCAAT 


ATATGGAGCA GATTCGAATT 








TCGTGATTGG 


AATGAGAAAA 


TTAAAAAGAC 


GGTTCGAGGA 


i 1 IniuAl I\J 


Art Jul 1 AV»s> 1 




AGCAGATGGT 


CGTCAAAACC 


AACCAGAGAT 


CTTTGGTTGC 








AGGTTGGGAA 


AATACAGAGG 


CCTTGGTAGA 


AGAGATTTAT 




PAAAATAAGT 
lanAnAlnnu 1 




GAAAAGGATG 


GAGTTGGGGA 


ATCTCAACTC 


CTTTTGATGA 


GAATGATAGT 


TGHAPAPRGA 


7500 


ATTGACATCG 


AAGAATTGGC 


TTCGATAGAA 


AGCGCAGTTA 


CACGACATGA 


AfSG A' f ■ I TV"If""*P 

A\JV3tr\ 4 * 1 Uv I 


/ J o u 


AAGCGTGTAC 


TGACCGCTCA 


GGAAATGGAG 


CGCTTCACCA 


GTCTCAAAGG 


ACGCAGGCAA 


7620 


ATAGAATATT 


TAGCTGGTOG 


CTGGTCGGCT 


AAGGAGGCCT 


TTTCCAAGGC 


TATGGGAACG 


7680 


GGCATTAGCA 


AGCTCGGTTT 


TCAGGATTTG 


GAAGTCTTGA 


ACAATGAACG 


TGGGGCGCCT 


7740 


TATTTTAGTC 


AGGCACCATT 


TTCAGGAAAG 


ATTTGGCTGT 


CTATCAGCCA 


CACCGATCAG 


7800 


TTTGTGACAG 


CCAGTGTCAT 


TTTGGAGGAA 


AATCATGAAA 


GCTAGTCCAC 


ATAGACCAAC 


7860 


CAAGGCTCTG ATTCATCTGG 


GAGCTATTCG 


ACAAAATATT 


CAGCAAATGG 


GGGCTCATAT 


7920 


CCCTCAAGGA 


ACGCTCAAGT 


TGGCTGTGGT 


TAAGGCCAAT 


GCTTATGGTC 


ATGGAGCTGT 


7980 


TGCCGTTGCC 


AAGGCAATTC 


AAGATGATGT 


TGATGGCTTT 


TGCGTTTCCA 


ATATCGATGA 


8040 
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AGCCATTGAA CTCAGACAAG CTGGACTCAG CAAGCCAATC CTCATTTTAG GAGTTTCTGA 8100 
AATCGAAGCT GTTGCTCTAG CTAAAGAATA TGACTTCACC TTGACAGTGG CTGGACTGGA 8160 
GTGGATTCAA GCACTCTTAG ATAAGGAAGT GGACCTAACT CGATTGACAG TCCACCTCAA 8220 
GATTGATTCA GGGATGGGAC GGATTGGTTT TAGAGAGGCA AGTGAGGTTG AGCAGGCTCA 8280 

AGATTTGCTC CAACAACACG GTGTTTGTGT TGAAGGAATC TTTACCCACT TTGCTACTGC 8340 

TGATGAGGAA TCAGATGACT ATTTTAATGC CCAGTTAGAA CGGTTTAAAA CTATTTTAGC 8400 

TAGTATGAAG GAAGTTCCAG AGCTGGTTCA TGCTAGCAAT TCTGCAACGA CTCTTTGGCA 8460 

TGTAGAGACT ATTTTCAATG CGGTTCGTAT GGGAGATGCC ATGTATGGCC TCAATCCAAG 8520 

TGGAGCGCTC TTGGATTTGC CTTATGATTT GATACCGGCC TTGACCTTGG ACTCTGCTCT 8580 

GGTTCATGTC AAGACAGTTC CAGCTGGAGC TTGCATGGGC TATGGAGCAA CTTATCAAGC 8640 

GGATAGCGAG CAAGTCATCG CGACCGTGCC AATCGGGTAT GCAGATGGAT GGACAAGAGA 8700 

CATGCAAAAT TTCTCTGTCT TGGTAGATGG CCAAGCTTGC CCAATTGTCG GCAGGGTTTC 8760 

GATGGACCAA ATCACTATTC GATTGCCTAA GCTTTATCCG CTAGGAACCA AGGTAACCTT 8820 

GATTGGCTCC AATGGGGATA AGGAAATCAC TGCAACTCAG GTAGCGACCT ACCGCGTAAC 8880 

CATTAACTAT GAGGTGGTTT GCCTCCTCAG CGACCGTATT CCGAGAGAAT ATTATTAGAA 8940 

AAGAAAGGAG TGGAGCATGA ATCTACATCA ACCCTTGCAT GTCTTGCCTG GTGTGGGACC 9000 

AAAGTCAGCA GAAAAATACG CCAAACTAGG AA7TGAAAAC TTGCAAGATC TCTTGCTCTA 9060 

CTTTCCTTTC CGTTATGAAG ACTTCAAAAC CAAGCAGGTG CTGGAGCTGG AAGACGGTGA 9120 

GAAGGCAGTT CTTTCTGGTC AGGTAGTGAC TCCTGCTAGT GTCCAGTATT ATGGTTTCAA 9180 

GCGCAATCGC CTGCGTTTTA GTCTCAAGCA GGGAGAGGTC G TTTTTGCGG TGAATTTCTT 9240 

TAACCAGCCC TATCTGGCTG ATAAAATAGA GTTGGGAGCA ACCCTTGCTG TCTTTGGAAA 9300 

ATGGGACCGC GCTAAGGCTA GTCTGACTGG GATGAAGGTT CTGGCTCACG TAGAACATGA 9360 

CCTCCAGCCT GTCTATCGTC TGGCTCAGGG AATCAGTCAG GCCAGTCTGG TCAAGGTCAT 9420 

CAAGACGGCT TTTGATCAGG GACTGGACCT CTTGATAGAA GAAAATCTGC CCCAGTCTTT 9480 

ACTAGACAAA TACAAACTCA TGTCCCGTTG TCAGGCAGTC CGTGCTATGC ATTTTCCAAA 9540 

GTATTTGGCA GAATACAAGC AGGCTCTTCG CCGTATAAAG TTTGAGGAAC TCTTTTATTT 9600 

CCAAATGCAG CTGCAGATGC TCAAGTCTGA AAATAGAGTT CAGGGAAGTG GTCTGGTTCT 9660 

GAATTGGTCT CAGGAAAAAG TGACAGCAGT TAAAGTAAGT CTTCCTTTTG CCCTGACCCA 9720 

AGCTCAGGAA AAGAGTTTGC AGGAAATTTT AACTGATATG AAGTCCGACC ACCACATGAA 9780 
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TCCTCTCCTA CAAGGGGATG TGGGGAGTGG AAAAACGGTA GTCGCTGCCT TGGCCATGTT 9840 

TGCGGCAGTG ACAGCAGGTT ATCAGGCTGC CCTAATGGTA CCAACAGAAA TCCTCGCAGA 9900 

GCAACACTTT GAGAGTTTAC AGAACCTTTT TCCCAATTTG AAACTGGCTC TCTTGACAGG 9960 

TTCCTTGAAA GCTGCAGAAA AGAGAGAAGT CTTGGAGACC ATTGCCAAGG GTGAGGCTGA 10020 

TTTGATTATA GGAACTCACG CTCTGATACA AGATGGGGTG GAGTATGCTC GTCTTGGTTT 10080 

GATTATTATC GATGAGCAGC ACCGTTTTGG TGTAGGGCAA AGCCCTATTT TACGGGAAAA 10140 

AGGTGACAAT CCAGATGTCC TCATGATGAC GGCGACTCCC ATTCCACGGA CCCTTGCCAT 10200 

CACAGCCTTT GGAGATATGG ATGTTTCCAT TATCGACCAG ATGCCAGCAG GTCGGAAGCC 10260 

TATTGTGACG CGCTGGATCA AACATGAGCA ACTACCTCAG GTCTTGACTT GGTTAGAGGG 10320 

GGAAATTCAA AAAGGTTCCC AAGTCTATGT CATCTCTCCT TTGATTGAAG AATCAGAAGC 10380 

TCTAGATTTG AAAAATCCCA TTGCCTTATC AGAGGAGTTG ACGACTCATT TTGCAGGCAA 10440 

GGCAGAGGTG GCTCTTCTAC ATGGTAGGAT GAAGAGTGAC GAAAAAGACC AGATCATGCA 10500 

GGATTTCAAG GAGAGAAAGA CGGATATTCT GGTTTCGACG ACGGTTATTG AGGTTGGGGT 10560 

CAACGTTCCC AATGCGACTG TCATGATTAT CATGGATGCC GATCGCTTCG GTCTCAGTCA 10620 

ACTTCACCAG CTTAGAGGTC GTGTCGGTCG GGGGGACAAG CAGTCCTACG CTGTTCTCGT 10680 

TGCTAATCCC AAGACGGATT CTGGGAAAGA CCGCATGCGT ATCATGACAG AAACGACCAA 10740 

TGGATTTGTC CTTGCGGAGG AAGATTTGAA AATGCGTGGT TCTGGTCAGA TTTTTGGAAC 10800 

CAGACAGTCA GGACTTCCAG AGTTCCAAGT GGCTGATATT ATCGAAGATT TTCCGATTTT 10860 

AGAAGAAGCA AGAAAGGTTG CTAGCTACAT TAGTTCTATA GAAGCTTGGC AAGAAGATCC 10920 

AGAGTGGCGC ATGATTGCCC TTCATCTGGA AAAGAAAGAA CATCTGGATT AAGCTTTCTC 10980 

TAAGGAAAAC TTATACTCAA TGAAAATCAA AGAGCAAACT AGGAAGCTAA CCGCAGGTTG 11040 

CTCAAAACAC TCTTTTGAGG TTGTGGATGA AACTGACGAA GTCAGCTCAA AACACCGTTT 11100 

TGAGGTGGCA GATAGAACTG ACGAAGTCAG TAACATATAT ATACGGTAAG GCGACGCTGA 11160 

CGTGGTTTGA AGAGATTTTC GAAGAGTATT AAGCTAGTTT TTAGGTTTGG CTCTTATACT 11220 

AGAGTCATCA AAAAGAAACG AGGACTCTCA TATGACAGTA ACGATTAAAG TAAATTACCA 112 BO 

AACCACTTTC CAAAAGAAGG AAGCAAAAAA CTAGTATAAA CAGAAGAGAG AGCGAAATGC 11340 

TCTTTTTTCG TTTCTAAAAC TACTTTCAGC CCATCATCCT AAAAGTAAAG AATCTAAATT 11400 

CACTTTCTAT TTACCCTTCT TTCTTGCATT GATTACATAG ATATGCTACA GTTGTGGTAA 11460 

CGATTACAAA ATAAAAGGAG CATGCTATGA AAAATCCAGC TTTGCTAGAA GAAATTAAGA 11520 

CCTATACAGG AAGGGATGAG GTTCCGGAAG ACTTTGATGA TTTCTGGGAT GGGGAAGTGA 11580 
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AAAATCTTTC CACGCTTCCA TCCTACCACT TGGAGGAAAG AGATTTCCAC ATTCCTCAAG 
TCAAGTGCTA TCAGTTAACA TTTGAAGGAA GCAAGGAAGG AAAGGTCTAT GCACGCATTG 
TTCTTCCAAA GAGTGAGGAG AACGTCCCAT TAATCTTCCA TTTTCATGGT TATATGGGAC 
GTGGCTGGGA CTGGGCCGAC ATGCTGGGCT TCACCGTAGC TGGTTACGGT GTTGTTTCCA 
TGGATGTGCG GGGCCAGTCA GGTTACTCAC AAGACGGCTT GCGTTCTCCT TTAGGAAATA 
CCGTGAAGGG GCATATTATC CGTGGTGCTG TGGAAGGTCG GGACCACCTC TTTTATAAGG 
ATGTTTATCT GGATATTTAC CAGTTGGTCG AAATTGTTGC TACTCTGTCT CAGGTTGATG 
AGAAGCGTCT TTCTAGCTAT GGTGCCTCAC AAGGAGGGGC TCTAGCTCTA GTTGCAGCAG 
CGCTCAATCC TCGAATTCAG AAAACAGTTG CCATTTATCC CTTCTTGTCA GACTTCAGAC 
GGGTGATTGA GATTGGTAAT ACTAGCGAGG CTTACGACGA ACTTTTCCGT TATTTCAAGT 
TTCACGACCC CTTCCATGAA ACAGAGGAGG AAATCATGGC GACCCTTGCC TATATCGATG 
TCAAAAATCT TGCCCATCGT ATCCAAGGTG AGGTTAAGAT GATTACGGGC TTGGACGACG 
ATGTTTGCTA TCCCATTACC CAGTTTGCGA TTTATAATCG TCTGACCTGC GATAAAACCT 
ATCGCATCAT GCCTGAGTAT GCTCACGAAG CCATGAATGT ATTTGTCAAT GACCAAGTCT 
ACAACTGGCT CTGTGGAAGT GAGATTCCTT TTAAATATCT AAAATAAGGA GTCGACTCTA 
AGCACAAAAT CTTAAAAATT ACAAACACGC ATAGTATCAG GGGATTAAGA AAACTTTATA 
CTATGCGTTT TATCATGCAA ATATAGTAAA ATGAAATAAG AACAGGACAA ATCGATCAGG 
ACAGTCAAAT CGATTTCTAA CAATGTTTTA GAAACAAATG TGTACTATTC TAGTGTCAAT 
CT ATT AT A TP TATAGAATTT TTTGTTGCTA GATTTGTCAA ATTGCTTAAA ATAATTTTTT 
TCAGAAAGCA AAAGCCGATA CCTATCGACT AGGGTAGTTC TTGCTATCGT CAGGCTTGTC 
TGTAGGTGTT AATACTTTTC AAAAATCTCT TCAAACCACG TCAGCTTCGC CTTGC 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5020 base pairs 

(B) TYPE; nucleic acid 
<C> STRANDEDNBSS : double 
(D) TOPOLOGY: linear 



11640 

11700 

11760 

11820 

11680 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

127B0 

12835 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
GGGGATATGA AGAACAAAAG AATATTTAAA GACTTCCAAG CTTCAAAAAT GAGTTTAAAC 60 
ATTTACACAA GCCCCTTGTT AGCCTTTGTT TTTGTCTTCA TAGGAGAGTT TGTGGCTTTT 120 
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ACTTTGTATG 


GTATTGGCTT 


GTTAGCTCTC 


ATCGGACTTG 


CTAGAAATTT 


TGGAGAGGCT 


180 


GGTCAAAATC 


TTGCAAGCTA 


CTTGCAGACC 


TTGCATCAGA 


GCTTGACGGA 


TAAAACAAGT 


240 


GACTTTCGTT 


TAATTTTAGG 


ATTACTGGCC 


nTGGTTATT 


CTTAACACTG 


TGTTCAGATG 


300 


GACAAGAAAA 


GTTGAGAAAA 


GACCTATTCG 


AACCTTGGGA 


TTTTATAGAG 


AGAATTTCCT 


360 


CAGCAATCTT 


CTGAAAGGAT 


TTAGTCTAGG 


CCTGGCACTT 


TTTCTTCTGA 


CCTTGTTAGG 


420 


TTTAGTGGTC 


TTAGGTCAAT 


ATCGTTTGGA 


ATCCATTCAC 


TTGAATCCTT 


ATTCTCTTGC 


480 


CTTTGTCGTC 


TTTACTATCC 


CATTTTGGAT 


TTTACAGGGG 


ACAGCAGAAG 


AAGTGGTGGC 


540 


CCGTGCTTGG 


CTACTTCCTC 


AATTGGCCTC 


AAGAACCAAT 


CTAAAACTAG 


CTATTCTTAT 


600 


ATCTAGCCTG 


TTCTTTACCC 


TGCTTCATAT 


CGGCAATTCT 


GGTCTCACCC 


CTCTATCTCT 


660 


AGTAAATCTC 


TTTTTATTCG 


GAGTTGCCAT 


GGCTCTTTAC 


CTTCTCAAAA 


CTGATACAGT 


720 


TTGGGGTGTT 


GCAGGTATTC 


ATGGTGCTTG 


GAATTTTGCT 


CAGGGTAATC 


TCTTTGGGAT 


780 


TTTAGTTAGT 


GGTCAACCGT 


CAGAACGTCT 


CTGATGACCT 


TTTTACCACA 


AGGCAATCAA 


840 


GATTGGCTAT 


CAGGTGGTTC 


TTTTGGCATA 


GAAGGTTCCA 


TTATGACAAG 


TCTGGTATTA 


900 


CTACTGCTGA 


TTGTCTATCT 


TGCTAATAAA 


TTAAAGAAAG 


AAAATGAAAG 


GATGTGACTT 


960 


CGGTCCGTCC 


TTTTCTTCGT 


GAAAATACTA 


TAAGTATGCT 


AAAATAGGAA 


TAGCACATGG 


1020 


AGAGAGGATT 


CTTATGATCA 


ATCACATTAC 


AGATAATCAA 


TTTAAACTAG 


TATCAAAATA 


1030 


TCAACCATCA 


GGAGATCAAC 


CCCAAGCTAT 


CGAGCAGTTG 


GTGGATAACA 


TTGAGGGGGG 


1140 


AGAAAAAGCT 


CAGATTCTGA 


TGGGGGCGAC 


TGGAACAGGG 


AAGACCTATA 


CTATGAGTCA 


1200 


GGTCATTTCT 


AAAGTCAATA 


AACCAACTCT 


GGTTATTGCC 


CACAATAAAA 


CTCTGGCTGG 


1260 


TCAGCTCTAT 


GGGGAGTTTA 


AGGAATTTTT 


CCCTGAAAAT 


GCAGTTGAGT 


ATTTCGTATC 


1320 


CTACTATGAT 


TATTACCAGC 


CAGAGGCCTA 


TGTCCCTTCT 


AGCGATACCT 


ATATTGAGAA 


1380 


GGATAGTTCT 


GTCAATGACG 


AGATTGACAA 


GCTTCGCCAC 


TCAGCTACCT 


CAGCCCTTTT 


1440 








Ma I \- rt i rGT 


ATCTATGGTT 


TGGGTTCGCC 


1500 


CAAGGAATAC 


GCTGATAGTG 


TCGTTAGTCT 


CCGTCCTGGT 


CTAGAGATTT 


CTCGTGATAA 


1560 


ACTCTTGAAT 


GACTTGGTCG 


ATATTCAGTT 


TGAACGTAAT 


GATATTGATT 


TCCAACGCGG 


1620 


AAGATTTCGC 


GTTCGTGGGG 


ATGTGGTAGA 


GATTTTCCCA 


GCTTCCCGAG 


ATGAACATGC 


1680 


CTTTCGAGTA 


GAATTTTTTG 


GAGACGAAAT 


TGACCGTATT 


CGTGAAGTTG 


AGGCTCTGAC 


1740 


AGGTCAGGTG 


TTGGGAGAAG 


TGGATCATTT 


AGCGATTTTC 


CCAGCGACAC 


ACITTUTUAC 


1800 


CAATGACGAC 


CACATGGAAG 


TTGCCATTGC 


AAAGATTCAG 


GCCGAGTTGG 


AAGAACAATT 


1860 


AGCTGTCTTT 


GAAAAGGAAG 


GTAAACTGCT 


TGAAGCCCAG 


CGTTTGAAAC 


AGCGGACAGA 


1920 
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GTATGATATC 


gaaatgttgc 


GTGAGATGGG 


CTATACCAAT 


GGGGTTGAAA ATTATTCTCG 


1980 


CCACATGGAT 


GGACGGAGCG 


AAGGAGAGCC 


TCCTTATACG 


CTTCTCGACT TCTTCCCAGA 


2040 


TGATTTCTTG 


ATTATGATTG 


ACGAGAGTCA 


TATGACCATA 


GGGCAAATCA AGGGCATGTA 


2100 


CAATGGAGAC 


CGTTCGCGTA 


AAGAAATGCT 


GGTTAATTAT 


GGTTTCCGTT TGCCGTCTGC 


2160 


TTTGGACAAT 


CGTCCTCTCC 


GTCGGGAGGA 


GTTTGAGAGT 


CACGTTCATC AGATTGTTTA 


2220 


CGTTTCAGCG 


ACACCTGGTG 


ACTATGAAAA 


TGAACAGACC 


GAGACAGTGA TTGAGCAAAT 


2280 


CATTCGTCCA 


ACGGGACTCT 


TGGATCCAGA 


GGTGGAAGTC 


CGTCCGACTA TGGGACAGAT 


2340 


TGATGACCTC 


TTGGGTGAAA 


TCAATGCCCG 


CGTTGAAAAA 


AATGAGCGTA CCTTTATCAC 


2400 


AACTTTGACC 


AAGAAAATGG 


CAGAGGATTT 


GACCGACTAC 


TTCAAGGAAA TGGGTATCAA 


2460 


GGTCAACTAC ATGCACTCGG ATATCAAGAC CTTGGAACGG ACGGAGATTA TCCGTGACCT 


2520 


GCGCTTGGGT 


GTCTTTGATG 


TCTTGGTCGG 


AATTAACCTG 


CTCCGTGAAG GAATTGACGT 


2580 


TCCTGAAGTG 


AGCCTCGTAG 


CTATTCTCGA 


TGCTGACAAG 


GAAGGTTTCC TTCGCAACGA 


2640 


ACGTGGACTC 


ATCCAGACCA 


TTGGACGTGC 


TGCACGTAAT 


AGCGAAGGTC ATGTTATCAT 


2700 


GTATGCGGAC 


ACGGTTACCC 


AGTCTATGCA 


ACGTGCTATC 


GATGAAACTG CCCGCCGTCG 


2760 


CAAAATCCAG 


ATGGCCTATA 


ATGAAGAACA 


TGGTATCGTT 


CCACAAACCA TCAAGAAAGA 


2820 


AATCCGTGAC TTGATTGCTG TGACCAAGGC AGTTGCTAAG GAAGAAGACA AGGAAGTCGA 


2880 


TATCAATAGC 


CTCAACAAAC 


AAGAGCGCAA 


AGAACTAGTC 


AAAAAGCTTG AGAAACAAAT 


2940 


GCAAGAAGCA 


GTTGAAGTGC 


TTGACTTTGA 


ACTAGCAGCT 


CAGATTCGTG ATATGATGCT 


3000 


GGAAGTCAAG 


GCCTTGGATT 


AGGGGAATAG 


TATGATTTAT 


TTAAGAAAGT TAAAGAAAGA 


3060 


AGATTTGATG 


TCTTTATGGG 


AAATGGCTTA 


TTCACAGCTT 


AATCCAGTTT GGAAACAGTA 


3120 


TGATGCTCCC 


TATTATGATG 


ATTATCAGTA 


TTTTTCAAAT 


TTTAAAGAAT TCGAACTACA 


3180 


AAAATCAGAA 


TCCATTTTAA 


GCAACTCAAA 


TCGCCTTGGT 


ATTTTTUTTG ATGATAAACT 


3240 


AGTTGGGACT 


GTTTCGCGTT 


ATTGGGTATG 


TAAAGAAACA AGATGGATGG AATTGGGAAT 


3300 


TGGTATTTAT 


GATAAAAAAT 


TCTGGAACAC 


TGGTATTGGG 


AAAGTTGCTA TGTTGCAGTG 


3360 


GATAGATAGG 


ACGTTTCAGG 


ATTACTTGGA 


GTTGGAGCAT 


CTGGGTTTGA CAACTTCGTC 


3420 


AGGAAATATT 


GGTATGATGA 


AACTTGCTGA 


AAAATTAAGA 


ATGAAAAAAG AAGCTCATAT 


3480 


TCCAAAAGTT 


CGTTATTATC 


AAGGTAAATA 


TTTTGACAGT 


ATTAAATATG GTATTTTGAG 


3540 


AGAAGACTGG GAGAAAATAA ATGACGGTTA TTATCAAATC AATGGAAACT CCTGAAGAGA 


3600 


TAGAAGGTAA ATCCTTCGTT CACTGGCAAA CGTGGAGAGA GGCTTATGAT GACCTTTTGC 


3660 
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CTGCGGAATT TCAGGAGACA ATGACATTAG AAAGATGTCG ACTCTTTAGT CAAAAGTATC 3720 
CAGAAAATAC ATTGATTGCG ATGGATGGTG TGAAGATAGT TGGTTTTATA AGTTATGGCA 3780 
ACTGTCGTGA TGAGACTATT CAAGCTGGTG AAATTATTGC TTTATATGTT TTAAAAGACT 3840 
ATTATGGAAA AGGAATCGCA CAAAAGTTAG TGAAAGCAGC TTTGACTGAT CTTAATCATT 3900 
TTTCTGAAAT TTTCTTATGG GTATTGAAAG ATAACAAGCG CGCCATTGCT TTCTATCAAA 3960 

AAATGGGTTT TACTTTTGAT GGACAAGAAA AAATACTTGA ACTTGGAAAG CCTATAAAGG 4020 

AAAAACGGAT GGTATTCTAT TCTAAATAAT TCTCAAAAGT AAAAGCTAAT ATGGTACCAA 4080 

GTCTGAAAAT TTAATAAATT AGAAAGCGAG TAAATTTATG TCCCGTTCCC AATTAACAAT 4140 

TTTAACAAAT ATCTGTCTGA TTGAAGACCT CGAAACTCAG CGCGTGGTGA TGCAGTATCG 4200 

CGCCCCTGAA AACAATCGCT GGTCTGGTTA TGCCTTTCCT GGAGGTCATC TAGAAAATGA 4260 

TGAGGCTTTT GCGGAGTCTG TCATTCGTGA AATCTACGAA GAAACAGGGT TGACTATCCA 4320 

AAATCCTCAA CTTGTCGGCA TTAAAAATTG GCCACTAGAT ACAGGTGGGC GCTATATTGT 4380 

CATTTGTTAT AAGGCGACTG AGTTCTCTGG TACCCTTCAA TCTTCAGAAG AGGGAGAAGT 4440 

TTCTTGGGTG CAAAAAGACC AGATTCCAAA CTTAAATCTG GCCTATGATA TGCTACCATT 4500 

GATGGAAATG ATGGAAGCTC CCGACAAGTC AGAGTTTTTC TACCCTCGCC GTACAGAAGA 4560 

CGATTGGGAA AAGAAAATCT TCTAGTCTTT TACTAAATAA CCTAGCTGAT CCAAGGCCTC 4620 

CTCGATATAG TGGAGGTCTT GTTCTCTCTC GGCTTCAACT AGGTGATAAT GAATACCATC 4680 

TGTTAACTCA GAAATTGGCT TAAAGTCAGA ACGTTCAACT TGTTCTAGAA AATGTTGCAC 4740 

GTCGCGGCGA CAGGTCAGTT TTAGTAAGGT TTCAATCTCT CCATAAACAG GATGATCAAT 4800 

CAAGATATTT TGAACGCGAC CACCATTATC TACGATAGCA AGTAATTCTC GTCCAATTTC 4860 

TTCAACTTCA TGCTTGACCT TAAATAATTT GTGATGATAA GTATTTGCAT TAGCATCTTT 4920 

ATAGATATAA CCACGATTGG TAGATAGAAT TGGAGATCCA TCACCTCTTA AAATTGCAAT 4980 

ATCTTGAACA ATAACTTGTC GAGTGACATG AAAGTGCTCA 5020 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
AAAAAGTGGC AATCCATTGA TTGGCCACTT CATTTAGAGA ATTATCGTCT CGCCCTTGAA 60 1 

* 
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GAAGAAGCTC GTGTAGTACT TGAGTTACTG CTATCGCTAG AACTACTACT TGAACTGCTG 120 

GAGCTGGATG GAGTTGGTAG ACTCCCCACA ATACTAGACC AAGCATTCTG ATAATCCGCA 180 

TCACTTCCGC CAATAGCAAA GCGATAACTT GTCGCTGGCG CTCCTGACTT ATTAGCCCAA 240 

TAGCTGGTAA CAGTCGAACC TGTGACCTCT ACTTCTTTTC CTTCAACAGA AACCTTCTCT 300 

GGTTTTTGAC CTGTTGATTT CAAGACTTCC GATTTCACTA CACTAGGATC TAAAGCAAAG 360 

CGCTCXTPTCC CCCAAATCCT TGGGGAAGCT TGCTGAATCG CATTTACCAG ATGAGCCATG 420 

TAATTAGAGT TATTAGAATA ACCTGCTCTA CGTGACAATG AATGATTATC ATCATGCCCA 480 

ATCCAGCCAC CTAGGGTTAA TCTAGGTGTC GAAAGCATGA GCCACATATT TTCGTCTTGG S40 

TTGGTTGTAC CAGTCTTCCC AATCCAATCT GCATTAGCCA GAGTAGGATT TAAAGAAGTC 600 

AGGTPAGACT TGAAGGTTGT TGTCACACGA GAGGATAGAA CTTCTCGTAG CAATCCCTGC 660 

ATAATCGTCG CAGTAGCTTT TGAATAGACT TGAACCGGTT TATCCTGATA CTCATACACC 720 

ACTCTACCAT CTGCTGCTTC AATCTTTGAA ATCACATGCT TCTGATGATA AACTCCATTA 780 

TTAGCTAAGG TCTGATAGCC ATTGGTATGC TGGGCAACTG TGACTTCAAT ACCACCACCC 840 

ATTGGCAAGC TCTCAATACC GTACTCAGGA ATCTCGTAAC CCATCTTTTC CATATAACCC 900 

TTGACATCAA CACCCTTTTC ACGGAGCATA CGATAGGTCC AGTAAGCAGG GATATTCCAT 960 

GAATAGTTCA GAGCTTCTCC CAAGGTCATC ATTCCTGTTC CCTTGCTATT AGCATACATA 1020 

ATCGGATTGC CATTAGCAAA GTTTGTTGGA TAGTTAGATA GAATCGTTTC ACTTCCCATC 1080 

AAGCCCTGGT CAATAGCAAT ACCGTAGGCC AGCAAGGGCT TGGTAGTAGA AGCTGGCGAA 1140 

CGTTTGGTAT CAAAGGCATG ATTATTTTGA TTTTCTTGAT AATTACGACC ACCTACAAAG 1200 

CCTAGAATAG CACCTGTTTG GTTATCCATC AAGACATTCC CTACTTCTAC ACGACCTGTT 1260 

CCATCGTCTA AAAGATAGCC ATAATCAGCA ACCGCACTTT GCATGGCAGA ATGAATTTTC 1320 

TGATCTATGG TACTAGTAAT CTTATAACCA CCATTTTCAA TTTCCTTGGC TGCCAAATCT 1380 

CGATAAAACT TCTGAGTTCC CTCATTTTTC AACTCCTTAG CGGAGACATT C3TCTCTCT<SA 1440 

GCTAGATAGT CATACATACG TTCTTGAGCT TCTGCCAAAG TTGTAAAGTA TAAATAGTCT 1500 

CGTGAAATTC CTGTAACCGT GCCCGATGGT AAAAAGTCCT GTTTAAGGTC ATAATCCTTG 1560 

TACTGAGAAT ACTCGTCTTT GCTTAATGCA CCTGTACGAT ACATACTGTA AAGAACTGCC 1620 

TTAGCCCGTC TTAAGCCAAT TTCTAGGTCT TCATCACTCT TCAACTCCCC AGTATTTTCA 1680 

TAAGGAGAGT AAGTAATGGG ACTCTGTGGA AGTCCTGCTA AAAATCCTGC TTGAGGAACA 1740 

GTCAACTGAC TGGCATCTAC ACCGAAAATT CCCTCAGCTG CTTGCCGAGC CCCTGCAATA 1800 
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TTCTGTCCCT TATTATTTCG GCCAAAGGGA GCCACATTGA GATAGGTCGT TAAAATCTCA 
TCTTTATTCA TGGCGCGTTC CAAGGCAAGA GCATCCACAA TCTCTGCCGC CTTACGAGCC 
AAGGTCGGCG CATCCCCAAC CACCTGCTGT TTAATTAGTT GCTGGGTCAA GGTTGAACCC 
CCACTAGAGC AACCCAAACC TACAAATTTC CCCAAGGTCG CACGAATCAC CGCCTTGGGT 
ACTACACCCT TATGTTCTTT AAAGTGTTCA TCTTCTGTCG CAATGATAGC CTTCTTCAGA 
TTTTCCGAAA TTTGCTCAGA TGAGATAGAA GTGCGCAACA AATCACTCTC TATGGAAGCA 
ATCACCGTCC CGTCCGAATA GGTAATCTCT GAAATAGAAG AGATGTCCTT GACCTGATTC 
ACCAATTCTT CTCTCTGAGG CACCCGAACC TTGTCAAATA AGGCCACTCC GTATCCCAAA 
GCAATCCCAG CTCCCAACAT TCCTCCTAGA AAACCGAGTA CAAAGAGTAA GTTAAATAAG 
GCTTTTATAC TCAGTAAAAT AGCTGGGAAA ATGACTGACT TATCTAAGGT TTTAGATTTT 
TTGGTACTTG AACCTTTCTT GCCAGGTCTA GCTGATTTTT TATTTTTTTG TTTTTCCTGG 
AAAAATTCCA GCATTTTTCG TTTTAATTCA TTTAATTGAT TTTGCATGGA TTTCCTCACT 
TTATCTATTA TACCACAAAA GGGAAATTTT CAATAAAATA GCCACTTTCT TCCCTATTCT 
GCTAGGCTAT TGCCCAAGTT TGTGATACAA TAGGTAGAAA CAATAATTTT AAAAAGGAGA 
AAAAACACAT GCACATTTTT GATGAGCTAA AAGAGCGTGG TTTGATATTT CAAACGACTG 
ATGAAGAAGC TTTGCGTAAA GCCCTAGAAG AAGGTCAAGT TTCTTATTAT ACTGGCTACG 
ATCCAACTGC TGACAGCCTT CACCTAGGCC ACCTTGTCGC AATCTTGACA AGTCGTCGCT 
TGCAACTAGC AGGTCACAAA CCTTATGCGC TCGTTGGCGG TGCTACAGGT CTCATCGGAG 
ATCCGTCCTT CAAAGATGCT GAACGTAGTC TCCAAACAAA AGACACAGTA GATCGCTGCG 
TCAAGTCTAT CCAAGGACAA CTTTCTCGTT TTCTTGACTT TGAAAATGGC GAAAACAAGG 
CTGTCATGGT CAACAACTAC GACTGGTTTG GCAGCATCAG CTTCATTGAC TTCCTCCGTG 
ATATTGGAAA ATACTTCACG GTCAACTACA TGATGAGTAA GGAATCTGTT AAAAAACGGA 
TCGAAACAGG AATTTCTTAC ACTGAGTTCG CTTACCAAAT CATGCAAGGG TATGACTTCT 
TCGTCCTTAA CCAAGACCAT AATGTCACTC TTCAAATCGG TCGTTCTGAC CAGTGGGGAA 
ATATGACAGC TGGTACCGAA TTGCTTCGTC GTAAGGCGGA CAAGACTGGT CACGTTATCA 
CTGTTCCACT AATCACAGAT GCAACTGGTA AGAAATTTGG TAAATCAGAA GGAAATGCCG 
TCTGGCTCAA TCCCGAAAAG ACTTCTCCAT ACGAAATGTA CCAATTCTGG ATGAACGTGA 
TGGACGCTGA CGCTGTTCGC TTCTTGAAAA TCTTTACTTT CTTGTCACTT GATGAGATTG 
AAGATATTCG TAAACAATTT GAAGCAGCGC CACAOGAACG CTTGGCTCAA AAAGTCTTGG 
CTCGTGAAGT TGTTACACTT GTTCACGGAG AAGAAGCCTA CAAAGAAGCA CTTAACATCA 
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2400 
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2700 
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2B20 
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CTGAGCAACT CTTTGCAGGA AACATCAAAA ACCTTTCTGT CAAAGAGCTC AAACAAGGAC 3660 

TTCGTGGTGT GCCCAACTAC CAAGTACAGG CAGACGAAAA CAACAATATC GTGGAACTGC 3720 

TCGTCTCATC TGGTATAGTT AACTCAAAAC GCCAAGCCCG TGAAGACGTC CAAAACGGAG 3780 

CCATCTACGT AAACGCCGAC CGCATCCAAG AGCTTGACTA TGTCTTGAGT GACGCTGATA 3840 

AGTTAGAGAA TGAACTGACT GTTATCCGTC GTGGGAAGAA AAAATACTTT GTATTGACTT 3900 

ACTAAACTAT TCAACATTTA TCTATAAACA AAGGAGTTAA CCTCGAGAAA GGTAACTCCT 3960 

TTTTGCTGTT AATAACTCTC ATCTATCTAT TTTTAATAGA CAGGCTACGC AGGACAATGC 4020 

GCAAGGTTGT TAGATTATGT AAGATAGAGA GATTTGAAGG ACTGAACCAA TTAAATAAGC 4080 

CAAAGCCAAT CAAACTACTA TTTACGACAA CGGTATCCTG AATATTTTTC TTGATCAGTG 4140 

TTTGCAAAGA TGATGATAAC GAATCCAACT CTTGGAAGAA ATCCAAACGA TTATCTAACA 4200 

ATAAGATATC ACTCATCTGC TTAGAAATAT CTGCACTCTC ATTCATCACC ACACCGATAT 4260 

CTGATAGAGT TAAAGCCGCT GAGTCATTCA ATCCATCTCC AACCATCAAA ATAGTGTGAC 4320 

CTGCTTTCTG CAGTTTCTCT ACTAACTCAA ATTTCCCATC AGGTTTCAAG TCTGTATAGA 4380 

CCTGATCAAA GGGCAAATCT TTGACTAATT CCTCTGTCCT AATCAAGGTG TCTCCTGTTG 4440 

CCAGAATCAA TTTTTTCCCC TGTGCCTTAA GTTTATCCAA GGCTGTTTTT GCTTCTTTTC 4500 

TCAAAGGAGT ATGAATGCAG AACATTCCAA TCAATTCATT TTGATAAGCC AAGAATAAGA 4S60 

GATTGTAGTG ACTCTTGTAC TCTTCAATTA AAGCATTTTG TTCTGAACTG ATATGAATCT 4620 

GCTCATCCTG CATCAAGACA TAATTCCCAA TAAGAACTGG TTGGCCATCT ATATGAGATT 4680 

TGATCCCCTT GCTTGCGATA TATTGGAGTT TCCCATGCAT TTCCTCATGT TCAATTCCCT 4740 

CTATCTCAGC TTGCTTGACG ATGGCATTAG CAATAGCATG ATAAATGTGT TCCTCAAGAC 4800 

AGGCACTGAT TCTGAGAATA TCTTCCTCAC TATAGTCTCC AAAAGGTAAC ACCTTTTCAA 4860 

CTATAGGATA ACTAGTTGTG ATTGTTCCTG TCTTATCAAA CAAGAAAGTA TCAACTTCCA 4920 

GATATTTCTC CCTGTTGTGG CCTCTGGCTG TCATCTCTGT GCTGG 4955 
(2) INFORMATION FOR SEQ ID NO: 144: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3232 base pairs 

(B) TYPE; nucleic acid 
(C> STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
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CAGGGGCGTA TTACGTGACA ATTCAATGTA GGCTGTCGCT ACTTGCGCCA AAACAAGGAT 
TCGATAATGT CGGATGATAC TAACGATTAA ACCGAGCAGA AAGGATCCCA AAATTCCCCA 
AACTGCAATA TGCAAGGTCA GAAAGAATGC CTTTTGATAT 
AATGGATCAA TCCAAAAATA GAACCTCCCA TCTAGAAATA 
AAAATCTTCT TTGGATAATA TCTATTTTTT ATTGCCGTTA 
ATAAAATTTC TGAAATTTCC AAACAAAATA TTTTAAAAGT 



AGTGGTAGAT ATTGTTCAAC 
ATACAGTTAT TGTAGCACTT 
TAAGGATTTT TATCATAGAC 
TTTGAAAAAG AGTTAAGATA 



TTTTTGTAAT ACACAAAGTA AACGCTTACT TATTAAGGAG GACATTTTAT GTCATACAAA 
ACAAGCAATG CAGAAGGTCA TGTAGATTTC ATCAATACCT ATGATTTGGA GCCAATGGCG 
CAACAAGTTA TTCCTAAAGC AGCATTTGGC TATATCGCTA GTGGGGCGGG AGATACTTTC 



ACTTCTTTCC AGTGATTTTA GCGTCAGGTT 
TTCTTGCTTA TTTATGATAA AATGGGAGTG 
TGAGTAAAAC TAGGAGGATC CCATGTCTAC 
GATCGTTCGC CGTGAAAAAA TGGCTGCGCT 
ACGTTTTGAA CGTACTGCAA ATTCACAAGA 
AGAACAATTA CACGATAAAA ACGAAACAGC 
TGGTAAAGGA AAAGTTGGTT TTGCCCACCT 
CGTTCGTAAG GATGCTGTCG GTGAAGAAAA 
TGACTTCCTT GGTGTCGAAG GTGAAGTGAT 
GGCAACCCAC ATCACACACT TGTCTAAGGC 



CTTTTTAGTT 

TCGCAAAAAA 

AGAACATATG 

CCGCGAACAA 

ATTAAAAGAT 

TACTATCGCA 

TCAAGACCGC 

CTACGAAATC 

GCGTACGGAT 

TCTTCGTCCT 



TTTAAAGATT 

TGACTCATCG 

GAAGAACTAA 

GGAATCGATC 

AAATATCCCA 

GGACGCTTGA 

GAAGGCCAGA 

TTCAAAAAAG 

ATGGGAGAAC 

CTTCCTGAGA 



ATCCGTGAAT 

TATTCAATTT 

ATGACCAGCA 

CTTTCGGAAA 

ACCTCGATAA 

TAACCAAACG 

TTCAGATCTA 

CAGACCTTGG 

TCTCTATCAA 

AATTCCATGG 



TTTGACAGAC GTTGAAACAA TTTACCGTAA ACGTTACCTT GACTTGATTT CTAATCGTGA 
AAGCTTTGAA CGCTTTGTCA CTCGTTCAAA AATCATCTCT GAAATCCGTC GTTACCTTGA 
CCAAAAAGGA TTCCTTGAAG TGGAAACACC TGTTCTTCAT AATGAAGCCG GTGGTGCTGC 
TGCCCGTCCA TTTATCACCC ACCACAATGC CCAAAACATT GACATGGTGC TTCGTATCGC 
GACTGAGCTT CACTTAAAAC GCCTTATCGT GGGTGGTATG GAACGTGTCT ATGAAATTGG 
CCGTATCTTC CGTAACGAAG GAATGGACGC TACTCATAAC CCTGAGTTCA CTTCTATCGA 
AGTTTACCAA GCTTATGCAG ACTTCCAAGA CATCATGGAC TTGACTGAAG GCATTATCCA 
ACACGCTGCT AAATCAGTCA AAGGTGATGG CCCAGTCAAC TACCAAGGTA CTGAAATCAA 
GATTAACGAA CCATTTAAGC GTGTTCATAT GGTGGATGCT ATCAGAGAAA TTACTGGTGT 
CGATTTCTGG CAAGACATGA CTTTGGAAGA AGCTAAAGCT ATCGCTGCTG AGAAGAAAGT 
TCCAGTTGAG AAACACTACA CTGAGGTTGG TCACATCATC AATGCCTTCT TTGAAGAGTT 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



WO 98/18931 



PCT/US97/19588 



971 

TGTTGAAGAA ACTTTAATCC AACCAACCTT TGTCTATGGA CATCCAGTAG CTGTATCTCC 1860 

ACTCCCTAAC AAAAATCCTG AAGACCAACG CTTTACTGAC CGTTTCGAGC TCTTTATCAT 1920 

GACTAAGGAG TACGGTAATG CCTTTACTGA GTTGAACGAC CCAATCGACC AACTTAGCCG 1980 

TTTTGAAGCC CAAGCTAAAG CCAAAGAACT TGGTGATGAT GAAGCGACAG GAATCGACTA 2040 

TGACTACATT GAAGCTCTTG AATACGCTAT GCCACCAACA GGTGGTTTGG GAATCGGTAT 2100 

CGACCGTCTC TGCATGCTCC TCACTGATAC AACAACTATC CGTGATGTAT TGCTCTTCCC 2160 

AACAATGAAA TAAATTCTTA TCCTCTGGGT CTTATCAGAG GATTTTTTGA TTCAAAAAGA 2220 

GACTGAATTT AAGGAGAAAA TGAAGTGTAG TATATTGAAA TTGAAATAGT ACACTTTGAT 2280 

TTCTAAGACA TTGTTAGAAA TTGGTTTAAA TTCCCTAAGC AATTTGTGCA TGTTTTATTT 2340 

CATTTTACGA TAGTACGCTG AAACTTTTCA AAAAGTACTA GAAATTGACT TGGATTCCCC 2400 

AATTGATTTG TTCAGATTCA CTATAAATAA AAAATTAATA ACTGGGATAG GAAGTTAGCG 2460 

TCAACTAGGA TAGTATCTTG CTTAAACAGT ATATATGGGA TTGATATAAG TCCATAGGTC 2520 

CTATTAGAGG ATGTTCTGGT GTCTTATTCA CTTGTTTTTr ATAGTATTAC TAGATAGAAT 2580 

CAGCAAATAA AAACCCAAAT CATTCATACC TCTCTCAACT AGATGTAACT T ACAAAACC C 2640 

CTGACCTCAT GAGCCACTTT CTTCCTCCTC ATGAGGTCAG TTTTACTTTC TGCTGTTCCA 2700 

GTATCGTTTT TCCTCGCTAG ATTTCCTCAA AAGGGCAGAC TCCTCCCTTG GTGCGTCACA 2760 

CGATTTTTTC ATCTCGACTG TTCTTTAATG CATCATTAAC GACGCTTTTC TTCTAGGTGG 2820 

TTCATAAGGA ACAGGAAGAT TCAGGTTGAC TTTTCTAATC CTAGAATAAA GTGCTGAAAA 2880 

CAATTCGGAA TAGGCATAGA GACTAGACAA TTTGAGGAGC TGCTTGCGTC CTGTTCGAAC 2940 

ACATTTTCCC ACCACGTGAA GAAAAAGATG GCGGAAGCGT TTGATTGTTA AAGTTTGGAA 3000 

GTCACCTCCA GCTAGATGTT TGAGAAAAAG ATAGAGATTG TAGGCGATAC AGCTCATCAT 3060 

CATACGAACT TCGTTTTTGA TTAACGTTGA ACTATCCGTT TTATCGCCAA AAAATCCCTC 3120 

CTTCATCTCC TTGATGAAAT TCTCGGCTTG ACCACGTCCA CGATAAAGCT GAAACTGGTC 3180 

TTGGCTTGTT CCACTCGTCA TATTTGTAAC GAGAGAAATA ACATCGTAGA AC 3232 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10711 base pairs 
IB) TYPE: nucleic acid 
fC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
CCGGAGAAAA TGATGAAAAG TTCAAAACTA TTTGCCCTTG CCGGCGTGAC ATTATTGGCG 
GCGACTACTT TAGCTCCATG CTCTGGATCA GGTTCAAGCA CTAAAGGTGA GAAGACATTC 
TCATACATTT ATGAGACAGA CCCTGATAAC CTCAACTATT TGACAACTGC TAAGGCTGCG 
ACACAAATAT TACCAGTAAC GTGGTTGATG GTTTGCTAGA ARATGATCGC TACGGGAACT 
TTGTGCCGTC TATGGCTGAG GATTGGTCTG TATCCAAGGA TGGATTGACT TACACTTATA 
CTATCCGTAA GGATGCAAAA TGGTATACTT CTGAAGGTGA AGAATACGCG GCAGTCAAAG 
CTCAAGACTT TGTAACAGGA TTAAAATATG CTGCTGATAA AAAATCAGAT GCTCTTTACC 
TTGTTCAAGA ATCAATCAAA GGGTTGGATG CCTATGTAAA AGGGGAAATC AAAGATTTCT 
CACAAGTAGG AATTAAGGCT CTGGATGAAC AGACAGTTCA GTACACTTTG AACAAACCAG 
AAAGCTTCTG GAATTCTAAG ACAACCATGG GTGTGCTTGC GCCAGTTAAT GAAGAGTTTT 
TGAATTCAAA AGGAGATGAT TTTGCCAAAG CTACGGATCC AAGTAGTCTC TTGTATAACG 
GTCCTTATTT GTTGAAATCC ATTGTGACCA AATCCTCTGT TGAATTTGCG AAAAATCCGA 
ACTACTGGGA TAAGGACAAT GTGCATGTTG ACAAAGTTAA ATTGTCATTC TGGGATGGTC 
AAGATACCAG CAAACCTGCA GAAAACTTTA AAGATGGTAG CCTTACAGCA CCTCGTCTCT 
ATCCAACAAG TGCAAGTTTC GCAGAACTTG AGAAGAGTAT GAAGGACAAT ATTGTCTATA 
CTCAACAAGA CTCTATTACG TATCTAGTTG GTACAAATAT TGACCGTCAG TCCTATAAAT 
ACACATCTAA GACCAGCGAC GAACAAAAGG CATCGACTAA AAAGGCTCTC TTAAACAAGG 
ATTTCCGTCA GGCTATTGCC TTTGGATTTG ACCGTACAGC CTATGCCTCT CAGTTGAATG 
GACAAACTGG AGCAAGTAAA ATCTTGCGTA ATCTCTTTGT GCCACCAACA TTTGTTCAAG 
CAGATGGTAA AAACTTTGGC GATATGGTCA AAGAGAAATT GGTCACTTAT GGGGATGAAT 
GGAAGGATGT TAATCTTGCA GATTCTCAGG ATGGTCTTTA CAATCCAGAA AAAGCCAAGG 
CTGAATTTGC TAAAGCTAAA TCAGCCTTAC AAGCAGAAGG AGTCCAATTC CCAATTCATT 
TGGATATGCC AGTTGACCAA ACAGCAACTA CAAAAGTTCA GCGCGTCCAA TCTATGAAAC 
AATCCTTGGA AGCAACTTTA GGAGCTGATA ATGTCATTAT TGATATTCAA CAACTACAAA 
AAGACGAAGT AAACAATATT ACATATTTTG CTGAAAATGC TGCTGGCGAA GACTGGGATT 
TATCAGATAA TCTCGGTTGG GGTCCAGACT TTGCCGATCC ATCAACCTAC CTTGATATTA 
TCAAACCTTC TGTAGGAGAA AGTACTAAAA CATATTTAGG GTTTGACTCA GGGGAAGATA 
ATGTAGCTGC TAAAAAAGTA GGTCTATATG ACTACGAAAA ATTGGTTACT GAGGCTGGTG 
ATGAGACTAC AGATGTTGCT AAACGCTATG ATAAATACGC TGCAGCCCAA GCTTGGTTGA 
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CAGATAGTGC TTTGATTATT CCAACTACAT CTCGTACAGG GCGTCCAATC TTGTCTAAGA 1800 

TGGTACCATT TACAATACCA TTTGCATTGT CAGGAAATAA AGGTACAAGT GAACCAGTCT 1860 

TGTATAAATA CTTGGAACTT CAAGACAAGG CAGTCACTGT AGATGAATAC CAAAAAGCTC 1920 

AGGAAAAATG GATGAAAGAA AAAGAAGAGT CTAATAAAAA GGCTCAAGAA GATCTCGCAA 1980 

AACATGTGAA ATAACTGTTG CAAAATATAA GAAAGGATTT AGTATTTCCC TTGAATGCTG 2040 

AATCCTTTTT TACATTTGTA AAGAAAGATT CTAAAATGTA CGGACCCCCA AAAGTTGGAG 2100 

CCTCTTTTTG TCAGAATAGA GAAAATTTTT GTTAATTTTA CTTGTTTCCT ATTGCTTTCT 2160 

CAGCTATTAT TTGTTATATT AAAAGTATAA TTATTTTTTA TTTATCAGAG TTAAGCATTG 2220 

CACTTTCAGA GGAAGGACTA TTTTTTAAAA AGAAAATGTA AACGTTTGCT CAAAAATGAA 2280 

AGGATTTAGA AGTTTATGAA TAAAGGATTA TTTGAAAAAC GTTGTAAATA TAGTATTCGG 2340 

AAATTTTCAT TAGGTGTTGC TTCTGTTATG ATTGGAGCTG CATTCTTTGG GACAAGTCCG 2400 

GTTCTTGCAG ATAGCGTGCA GTCTGGTTCC ACGGCGAACT TACCAGCTGA TTTAGCTACT 2460 

GCTCTTGCAA CAGCAAAAGA GAATGATGGG CGTGATTTTG AAGCGCCTAA GGTGGGAGAA 2520 

GACCAAGGTT CTCCAGAAGT TACAGATGGA CCTAAGACAG AAGAAGAACT ATTAGCACTT 2580 

GAAAAAGAAA AACCGGCTGA AGAAAAACCA AAAGAGGATA AACCTGCAGC TGCTAAACCT 2640 

GAAACACCTA AGACGGTAAC CCCTGAATGG CAAACGGTAG CGAATAAAGA GCAACAGGGA 2700 

ACAGTCACTA TCCGAGAAGA AAAAGGTGTC CGCTACAACC AACTATCCTC AACTGCTCAA 2760 

AATGATAACG CAGGCAAACC AGCCCTGTTT GAAAAGAAGG GCTTGACCGT TGATGCCAAT 2820 

GGAAATGCAA CTGTTGATTT AACCTTCAAA GATGATTCTG AAAAGGGCAA ATCACGCTTT 2880 

GGTGTCTTTT TGAAATTTAA AGATACCAAG AATAATGTTT TTGTCGGTTA TGACAAGGAT 2940 

GGCTGGTTCT GGGAGTATAA ATCTCCAACA ACTAGCACTT GGTATAGAGG TAGTCGTGTT 3000 

GCTGCTCCTG AAACAGGATC AACAAACCGT CTCTCTATCA CTCTCAAGTC AGACGGTCAG 3060 

CTAAATGCCA GCAATAATGA TGTCAATCTC TTTCACACAG TGACTCTACC AGCTGCGGTC 3120 

AATGACCATC TTAAAAATGA GAAGAAGATT CTTCTCAAGG CGGGCTCTTA TGACGATGAG 3180 

CGAACAGTTG TTAGCGTTAA AACGGATAAC CAAGAGGGGG TAAAAACAGA GGATACCCCT 3240 

GCTGAAAAAG AAACAGGTCC TGAAGTTGAT GATAGCAAGG TGACTTATGA CACGATTCAG 3300 

TCTAAGGTCC TCAAAGCAGT GATTGACCAA GCCTTCCCTC GTGTCAAGGA ATACAGCTTG 3360 

AACGGGCATA CTTTGCCAGG ACAGGTGCAA CAGTTCAACC AAGTCTTTAT CAATAACCAC 3420 

CGAATCACCC CTGAAGTCAC TTATAAGAAA ATCAATGAGA CAACAGCAGA GTACTTGATG 3480 
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AAGCTTCGCG ATGATCCTCA CTTAATCAAT GCGGAAATGA CAGTACGCTT GCAAGTTGTA 3540 

GACAATCAAT TGCACTTTGA TGTGACTAAG ATTGTCAACC ACAATCAAGT CACTCCAGGT 3600 

CAAAAGATTG ATGACGAAAG CAAACTACTT TCTTCTATTA GTTTCCTCGG CAATGCTTTA 3660 

GTCTCTGTTT CTAGTAATCA AACTGGTGCT AAGTTTGATG GGGCAACCAT GTCAAACAAT 3720 

ACGCATGTCA GCGGAGATGA TCATATCGAT CTAACCAATC CAATGAAGGA TTTGGCTAAG 3780 

GGTTACATGT ATGGATTTGT TTCTACAGAT AAGCTTGCTG CTGGTGTTTG GAGTAACTCT 3840 

CAAAACAGCT ATGGTGGTGG TTCGAATGAC TGGACTCCTT TGACAGCTTA TAAAGAAACA 3900 

GTCGGAAATG CCAACTATGT AGGAATCCAC AGCTCTGAAT GGCAATGGGA AAAAGCTTAT 3960 

AAGGGCATTG TTTTCCCAGA ATACACGAAG GAACTTCCAA GTGCTAAGCT TCTTATCACT 4020 

GAAGATGCCA ATGCAGACAA GAACGTTGAT TGGCAAGATC GTGCCATTGC TTATCGTAGC 4080 

ATTATGAACA ATCCTCAAGG TTGGGAAAAA GTTAAGGATA TCACAGCTTA CCGTATCGCG 4140 

ATGAACTTTG GTTCTCAAGC ACAAAACCCA TTCCTTATGA CCTTGGATGG TATCAAGAAA 4200 

ATCAATCTCC ATACAGATGG TCTTGGGCAA GGTGTTCTCC TTAAAGGATA TGGTAGCGAA 4260 

GGCCATGACT CTGGTCACTT GAACTATGCT GATATTGGTA AGCGTATCGG TGGTGTCCAA 4320 

GACTTCAAGA CCCTAATTGA GAAGGCTAAG AAATATGGAC CTCATCTAGG TATCCACGTT 4380 

AACGCTTCAG AAACTTATCC TGAGTCTAAA TACTTCAATG AAAAAATTCT CCGTAAGAAT 4440 

CCAGATGGAA GCTATAGCTA TGGTTGGAAC TGGCTAGATC AAGGTATCAA CATTCATGCT 4500 

GCCTATGACC TAGCTCATGG TCGTTTGGCA CGTTGGGAAG ATTTGAAGAA AAAACTTGGT 4560 

GACGGTCTCG ACTTTATCTA TGTGGACGTT TGGGGTAATG GTCAATCAGG TGATAACGGT 4620 

GCCTGGGCTA CCCACGTTCT TGCTAAAGAA ATTAACAAAC AAGGCTGGCG CTTTGCGATC 4680 

GAGTGGGGCC ATGGTGGTGA GTACGACTCT ACCTTCCATC ACTGGGCAGC TGACTTGACC 4740 

TACGGTGGCT ACACCAATAA AGGTATCAAC AGTGCCATCA CCCGCTTTAT CCGTAACCAC 4800 

CAAAAAGATG CTTGGCTAGG GGACTACAGA AGTTATGGTG GTGCAGCCAA CTATCCACTG 4860 

CTAGGTGGCT ACAGCATGAA AGACTTTGAA GGCTGGCAGG GAAGAAGTGA CTACAATGGC 4920 

TATGTAACCA ACTTATTTGC CCATGACGTC ATGACTAAGT ACTTCCAACA CTTCACTGTA 4980 

AGTAAATGGG AAAATGGTAC ACCGGTGACT ATGACCGATA ACGGTAGCAC CTATAAATGC 5040 

ACTCCAGAAA TGCGAGTGGA ATTGGTAGAT GCTGACAATA ATAAAGTAGT TGTAACTCGT 5100 

AAGTCAAATG ATGTCAATAG TCCACAATAT CGCGAACGTA CAGTAACGCT CAACGGACGT 5160 

GTCATCCAAG ATGGTTCAGC TTACTTGACT CCTTGGAACT GGGATGCAAA TGGTAAGAAA 5220 

CTTTCTACTG ATAAGGAAAA GATGTACTAC TTCAATACGC AGGCCGGTGC AACAACTTGG 5280 
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ACCCTTCCAA GCGATTGGGC AAAGAGCAAG GTTTACCTTT ACAAGCTAAC TGACCAAGGT 5340 
AAGACAGAAG AGCAAGAACT AACTGTAAAA GATGGTAAAA TTACCCTAGA TCTTCTAGCA 5400 
AATCAACCAT ACGTTCTCTA TCGTTCGAAA CAAACTAATC CTGAAATGTC ATGGAGTGAA 54 60 

GGCATGCACA TCTATGACCA AGGATTTAAT AGCGGTACCT TGAAACATTG GACCATTTCA 5520 
GGCGATGCTT CTAAGGCAGA AATTGTCAAG TCTCAAGGGG CAAACGATAT GCTTCGTATT 5580 
CAAGGAAACA AAGAAAAAGT TAGTCTCACT CAGAAATTAA CTGGCTTGAA ACCAAATACC 5640 
AAGTATGCCG TTTATGTTCG TGTAGATAAC CGTAGTAATG CCAAGGCAAG TATCACTGTG 5700 
AATACTGGTG AAAAAGAAGT GACTACTTAT ACCAATAAGT CTCTCGCGCT CAACTATGTT 5760 
AAGGCCTACG CCCACAATAC ACGTCGTGAC AATGCTACAC TTGACGATAC AAGTTACTTC 5820 
CAAAACATGT ACGCCTTCTT TACAACTGGA GCGGACGTCT CAAATGTTAC TCTGACATTG 5880 
AGTCGTGAAG CTGGTGATCA AGCAACTTAC TTTGATGAAA TTCGTACCTT TGAAAACAAT 5940 
TCAAGCATGT ACGGAGACAA GCATGATACA GGTAAAGGCA CCTTCAAGCA AGACTTTGAA 6000 

AATGTTGCTC AGGGTATCTT CCCATTTGTA GTGGGTGGTG TCGAAGGTGT TGAAGATAAC 6060 

CGCACTCACT TGTCTGAAAA ACACAATCCA TATACACAAC GTGGTTGGAA TGGTAAGAAA 6120 

GTCGATGATG TTATCGAAGG AAATTGGTCA CTCAAGACAA ATGGACTAGT GAGCCGTCGT 6180 

AACTTGGTTT ACCAAACCAT CCCACAAAAC TTCC G TTTTG AAGCAGGTAA GACCTACCGT 6240 

GTAACCTTTG AATACGAAGC AGGATCAGAC AATACCTATG CTTTTGTAGT CGGTAAGGGA 6300 

GAATTCCAGT CAGGTCGTCG TGGTACTCAA GCAAGCAACT TGGAAATGCA TGAATTGCCA 6360 

AATACTTGGA CAGATTCTAA GAAAGCCAAG AAGGCAACCT TCCTTGTGAC AGGTGCAGAA 6420 

ACAGGCGATA CTTGGGTAGG TATCTACTCA ACTGGAAATG CAAGTAATAC TCGTGGTGAT 6480 

TCTGGTGGAA ATGCCAACTT CCGTGGTTAT AACGACTTCA TGATGGATAA TCTTCAAATC 6540 

GAAGAAATTA CCCTAACAGG TAAGATGTTG ACAGAAAATG CTCTGAAGAA CTACTTCCCA 6600 

ACGGTTGCCA TGACTAACTA CACCAAAGAG TCTATGGATG CTTTGAAAGA GGCGGTCTTT 6660 

AACCTCAGTC AGGCCGATGA TGATATCAGT GTGGAAGAAG CGCGTGCAGA GATTGCCAAG 6720 

ATTGAAGCTT TGAAGAATGC TTTGGTTCAG AAGAAGACGG CTTTGGTAGC AGATGACTTT 6780 

GCAAGTCTTA CAGCTCCTGC TCAGGCTCAA GAAGGTCTTG CAAATGCCTT TGATGGCAAT 6840 

GTGTCTAGTC TATGGCATAC ATCTTGGAAT GGTGGAGATG TAGGCAAGCC TGCAACTATG 6900 

GTCTTGAAAG AACCAACTGA AATCACAGGA CTTCGCTATG TTCCGCGTGG ATCAGGTTCA 6960 

AATGGTAACT TGCGAGATCT GAAACTTGTT GTGACAGATG AGTCTGGCAA GGAGCATACC 7020 
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TTTACTGCAA CTGATTGGCC AAATAACAAC AAACCAAAAG ATATTGACTT TGGTAAGACA 7080 

ATCAAGGCTA AGAAAATTGT CCTTACTGGT ACCAAGACAT ACGCAGATGG TGGAGATAAA 7140 

TACCAATCTG CAGCGGAACT TATCTTTACT CGTCCACAGG TAGCAGAAAC ACCTCTTGAC 7200 

TTGTCAGGCT ATGAAGCAGC TTTGGTTAAG GCTCAGAAAT TAACAGACAA AGACAATCAA 7260 

GAGGAAGTAG CTAGCGTTCA GGCAAGCATG AAATATGCGA CGGATAACCA TCTCTTGACG 7320 

GAAAGAATGG TGGAATACTT TGCAGATTAT CTCAACCAAT TAAAAGATTC TGCTACGAAA 7380 

CCAGATGCTC CAACTGTAGA GAAACCTGAG TTTAAACTTA GATCTTTAGC TTCCGAGCAA 7440 

GGTAAGACGC CAGATTATAA GCAAGAAATA GCTAGACCAG AAACACCTGA ACAAATCTTG 7500 

CCAGCAACAG GTGAGAGTCA ATCTGACACA GCCCTCATCC TAGCAAGTGT TAGTCTAGCC 7560 

CTATCTGCTC TCTTTGTAGT AAAAACGAAG AAAGACTAGT ATTTAGTAAA ACCTCTTAAC 7620 

AAGATTACGG AAGCAGTCTC TATCTTTTCC AATGAGGTTT ATAGTACAGA AAAAGCCTGA 7680 

GAAGATGTCT TCTCAGGCTT TTGTTAAGCA CATAAATACA ATAGTGCTAT GACAAAATCA 7740 

CCCAGAAAAA TCTGGGTGAT AAATGTTATG GTTGTGCTGG TTGAGGATTC TGATTTTGTT 7800 

GATCAGGGGT TGTATTTGAT TGTTGCGTAT TATTGTTAGG ATTGGTAGTC GTACTATTAT 7860 

TTGTGCTTGG AGTGGTTGAG CTAGACTGTG AAGTTGAACT ATCTGATGAT GAGCTTGAAC 7920 

TTTCAGTTGA TGGGGGTTGT TGTGGAGCAG GTGAGTTCCA CGTAGAACGA GCACCATTTT 7980 

TAAATACGAA TTCTCCATTT CTGTAGAGCC CCTCTGGTAT ATTCCAATCT TCTGGATTGC 8040 

TTCCTTCAGA CAGGTAGGTC ATCATAGAGC GGTAAACTTT GGCAGCGACC GTAAGGCCAT 8100 

TGCCTACAAG TGGTGTCAGA CGGTTAGAAT AGCCTGTCCA TACAGCCATT GAATATTTAC 8160 

GCGTATAGCC AGCAAATAGT TCATCAGGTG CTACAAATTG AGAGCTCTTG ATGTGGTTTT 8220 

CAATTTCCTC GTCTGTATAC TTAGAGGTTC CTGTTTTACC AGCCTGAGGG AGCCAAGCAA 8280 

GATAGGCATT TCGTCCAGTT CCATAAGTCA AGACTGTTTT CATCATGTCG GTCATCATAT 8340 

AGGCTGTCGT TTCCTTCATG GCACGAGTTC CGACATTAGA GAACTCTTTT TCACTCCCAT 8400 

CACTAAAGAC GACTTTATGG ATATACATTG GTTTATAGTA AGTTCCACCA TTTGCAAAGG 8460 

CAGCGTAAGC AGCAGCCATC TTTTCACTAC TTGCTCCATA 1 TlT f 1V1V1 ' GATTCGGTTG 8520 

TGTTACTTGA AATGGCATTT GAGTACTGAA TACTTGGGTA GTCGATTCCT AGACCATTTA 8580 

GGAAAGTCTT GGCGCGGTTG AGTCCGACCT TGTTTAGAGT TTCCACGGCT GGGACGTTTC 8640 

GCGATTGTTG CAGGCCGTAT TGCAAGGTGA TGTTGCCAAA GTAGCCCCTA TCCCAGTTAT 8700 

AAACAGGAGT ATTTGTCCCA GGCTAGTTAT AGGGCTCATC GTGAACGATA GTAGCACTTG 8760 

AATCGTAGAC ACCGTACTCC AAGGCAGGAG CATACTCTCT GATCGGTTTC ATAGTTGATC 8820 
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CCCAGTCGCG GTTTGTTTCT ACTGCTTGGT TAATTCCGAA GGAAACATTA CTTGACTGAT 8880 

GGCGTGCTCC TAGCTGGGCA ATGACTTTAC CGTTAGAAAC ATCAACAATG GTAGAAGCGA 8940 

CTTGCAATTC ATCGTCTGGA TAGGCAACGT ATTCGTCTGT ATTGTAAATA TCCCACAGAT 9000 

GTTTTTGAGC TTCTTGGTCT ACATTTGTGT AGACATCCAT CCCAGTTGTG AGTAGGTTAT 9060 

AGCCTGTTTC TTCTTCAACT TGATTGATGA CTTCCTTGAG GTAATTATCC ATGTAACCAG 9120 

GGTAATTACT TGCTGATTTG AGACTTTGTA GTCCATCAGT AATTGGTGTA TTGACTGCTT 9180 

TCTCATACTG TTCAGCAGAG ATGTAGCCTT GATTTTTCAT TTCAGATAAG ACCAACTTTC 9240 

GGCGGTCTTG GGCTGCTTCT GGATGTGAAT AGGGGTCATA TTGGTTTGGT GCCTGAGGCA 9300 

TTCCAGCCAG CAAGGCTAAC TGAGGTAAAC TTAAATTATT GAGGTCTTTA CCATAGTAGT 9360 

TTTGAGCTGC TGTCTGCATT CCATAGTTCC CATTAGACAT GTAGACCTTA TTTATATAGT 9420 

AGGTCAAGAT TTCTTGCTTG GTTGCTTTTT GTTCTAACTG AATCGCTAAC CAAGCTTCCT 9480 

GAGCCTTACG AGAAATAGTC TGGTCGGAAG TCGAAGTTGA AAAGTAAGTC AACTTAATCA 9540 

ACTGTTGGGT GAGAGTTGAT CCACCTTGGA GGGAATTGCT TTGCAGATTG CGCAAGAAAG 9600 

CTCCCAGCAT ACGGATGGTA TCAATCCCCC TGTGGTCGAA GAAGCGATGG TCTTCGATAG 9660 

AAACGATTGC CTTAACCAAA TCTGTGGGAA TATCATTAGC TTGGGCATTG ACGCGGCGTT 9720 

CAGAACCCAA GTCAGCAATG AGTTGATTTT TATTGTCGTA GATTTTACTA GAAGTTGTTG 9780 

CAACTAGTTT ACTCTCGGAT AGGCTAGGAG CCTTGCTAAC GTAGTAGAAA AAAACTCCTC 9840 

CGCCTAAGAC AATGGCTGCG ATAACCAAGC TTAAGAAGCT AATGCTCAGA TACTTGATTA 9900 

6GCGCAGAAT CGTTGGTTTG TTCATCTTGT TTTACCACCT AATAAATGTT CTTTGATAAC 9960 

ATTGAGATAA GGAATTTGAG GGAAGGCACC AGCCTTGATT TCATATCCAT ATTCTCGAAT 10020 

ATATTCAAGT GGCATTGATT TTTGTCCCTT ATCTTGATGA TAGAAGCGAA TCAAATCGAA 10080 

TGCCGGCAAT AAGTACGTTT CTTGCTGAGA AGAAAAGTGA AGAAGGACAA AGCAGATTCC 10140 

TTGTTGGGCA AGGACTTGTT CCATATGCTG AATCTGATGT GGATGAAAAT TTTTCATCGG 10200 

AATCGCACGT TTTTGTTTTG TTTCCTTGAC TTCAAAGTCG ATGTAATATC CATTATAAAC 10260 

GCCAGAATAG TCCGTCGTTG AAGCTTGTCG AAAATAGGCT TCAACAATCT TGGCACGACT 10320 

TCGTTGTGGA TAGTCCACTT GTACGATTTG AATAGGAGTT GGTTTCTTAT GTATAACAGC 10380 

CAACCCCTGA GACAAATAGT ACTCGTTGGT AGCATTGATC ATCTTTTCAA AGCGTACCGA 10440 

GCTCGAATTC GTAATCATGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 10500 

TCCACACAAC ATAOGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT AATGAGTGAG 10560 
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CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG 
CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTCCGTA TTGGGCGCTC 
TTCCGCTTCC TCGCTCACTG ACTCGCTGCG C 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11887 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 



10620 
10680 
10711 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
TACATTCATT CCATCGGCTA CTCCATAATA CTTAGATAAA ACCATAGCTG AAGTCGAATA 
CGGATACTGT AAAGTATTAT CAATTTTAAT CAAATCATCA TTACCGATAA TACTTCTGAT 



TGCTTTTGGT 
TCTGGACCTT 
CTCAAAGCAG 
CCCAATTTTT 
CTATTCTAAA 
CTCCATTATC 
AGAGGTACTT 
TAAAGCATGT 
CTTTATTGAT 
GAGTATTGGT 
TATTATCTGA 
GCCGCAGTTG 
ATTTTCGAAG 
TACTACTCGA 
TAGCTTAGGC 
CACATATTGG 
AATCTTTGTT 
CAATACCTAG 
AGTCTTTTGC 



AGTATGAACC ATACGTTGGT GAAATCTCAG ATAATGAAGA ATCATTAGAC 
TTTCTAGTGT CTCACTTACC TCATATTCTT CACCCTTACT AGAAATAACA 



ATACTGTCGA 
TATAAACAGT 
TTTCTTAATC 
ACATAAACAG 
CCTTGCTTCT 
TACCCAAGTG 
AAAAAATTCT 
CTTCTCTTTT 
TAGGCTCCAT 
CTCAAAACAC 
AGTATTAAGA 
TCTAGGATAT 
AAATCGGTTT 
TATTTTCTAG 
TTTTTCGCTT 
AAAAGAAAAT 
TGCTTCTAGC 



TAACTGGCTA GCCAATAAAG TACTCGCAAT AATTGAAATA 



TTTCTTCATT ATTGTATCCT 
TACTATAGAA TCAAGAAATC 



CCTAATGTAA TTATAGCGTA 
TACCACCTTC TTTAAATACC 



GTAAACTTTT CAATTAATGA CTGCGCTTTT CAATCACGCT 



TTGATACTAA GTTCAGCCAT 
GGATTCGTTT TGGAGTAGTC 
GTTTCGTTCA AGTTATGAAG 
CTTAACAACT GGTGGTCAAT 
TTTATACTCA ATGAAAATCA 



TCTTTCCTTG TTTTTCTCAA 
TCCCAGAGTC CAGCCAATGG 
GAGAATCTTT TCCATTAATT 
AGCGACACGT CTCAGCCAGA 
AAGAGCAAAC TAGGAAGCTA 



TGTTTTGAGG TTGCAGATAG AGCTGACGTG GTTTGAAGAC 



TTATTTCTTC TAGTTCAGGG 
CTACCGTGTC CCACAAGGAT 
CCTTTAGATA AGACTGCATT 
GATCCTTTTC CCAGCAAGTG 
CTGGAAAATA TTTTATAGAG 
TGATGGCGCA TATAGGCTTC 
TCCTCAAGTA AATCTGCTAA 



TGTTCATACA 
TTTGTCACGA 
GCTTTCAAAT 
TCTGCAAAAT 
TTTATTTCTT 
CATGGACCTT 
ACTCATCTAA 



CCAAACTCCC 
CTAACTGCTC 
AGTTAGCAGC 
CCCAATCCAT 
TCAGGCACCC 
GCTTTTTTAG 
AACTCCTCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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CCCCCACCAA ATGGTGCTGA AAGGCATAGA CAGCCGCCTG GCTACGATCG CTGACTTCAA 1320 

GTTTGGCAAG AATATTGGAC ACGTGGCTCT TGACCGTCTT GAGAGAGATA AAGAGGTCAT 1380 

CTGCGATGCG CTGATTTTCG TAGCCCTTGG CGATGAGTTG GAGAACATCT CGCTCACGCG 1440 

CAGTCAATTC TTCATGAAGT TCCATATGAT TGCGGTGGTA TTCAACCTTC TTGCTAACCT 1500 

CTTGCTCAAT GGCCAGCTCG CCAGCAGCTA CCTTACTGAC GGCATGAAGC AATTCATCTG 1560 

CACTAGAAGT CTTGAGCATA TAGCCTTTGG CACCAGCATC TAAGACTGGC ATGATTTTTT 1620 

CATTGTCCAA ATAAGAGGTC ACAATCAAAA TCTTGGCTTC AGGCCATTCT TTAAGGATTG 1680 

CTAAGGTCGC GTCAATCCCA TTCATCTCAG CCATGACAAT ATCCATGACA ATGACATCTG 1740 

GACGCAGTTC CAAGGCCAAG TCAATCCCTT GAGACCCGTT GGACGCCTCA CCCACAACTT 1800 

CTACATCGTC TTGGAGGTCA AAGTAGCTTT TCAAGCCCAA TCGGACCATT TCATGGTCAT I860 

CTACTAGTAA AATTTTCATC TTTACTCCTT TATCATTCCT TATCTAACAG GGGAATACGG 1920 

ATATCAACCG CCAGCCCTTG CTTGGGAGCT GTCAAGAGTT GAACTGTTCC AGCCATATCT 1980 

TCAACCCGCT CCTTGATATT TCGCAGTCCA TAACTCAAGT CGTCTAAGCT CCCTAACTGG 2040 

AAACCAATCC CATTGTCCAC CACCTTCAGT TGCAATTCAA CATCTGTCTG ATAGAGGTAG 2100 

ACATCTAGGC AAGATGCCTG GGCATGGCGG AGGGTATTGC TAATCAACTC TTGCAGGATA 2160 

CGGAAGATAT GCTCCTCGAT TTTCTTAGGC AATTTCGTCA TATTCTGCTT GAGACTAACC 2220 

CTAAGATCAC TCTTGTCCTC AAGCTCTTTT AAAAGAATTT GAATCCCTTC TATCAAGCTC 2280 

TTCTGCTCCA GTTCAACTGG TCGCAAATGC AAGAGCAAAA CCCGCAAATC CTTCTGGGCT 2340 

GTTTCTAAAA TAGCTGTGAC ACTCTGCAAC TGGGTCTGCA TCTTTTCTCT ATCCAATTTC 2400 

AAAGCCTGCT GACTGATACC CGATAAAATC ATGTGGGCCG CAAACAACTC CTGACTGACT 2460 

GTATCGTGCA AATCCCGAGC AATTCCCTTC CGTTCCTTCT CGATGATTTC CTCTTCCTGA 2520 

GCAAGGCTCT GATTTTCAGC TTTTTGAAGA GCCTCTGTCA AAAGGTTAAG TTTACCTGAT 2580 

AAGGACTTGA AACTGGCATC CAAATCTGGA TCTGCAACCT GAACCACTTC TTGCCCTGCT 2640 

AATAAACGCT TGAGATTAGC CTGCATTTTT CTTAGAGAAA GCTCTTCGAT CCCTCGCCAA 2700 

AACAGGGCTA AGAGACAGGT CATGGACATG CTGAAAACCA ACAATAAAAA GACAAATTTT 2760 

TCTGTTTTTT CGACATCGTG CAAAAAGATA GACCAGTCAA AATCAAGTAT TTCCAGCAAG 2820 

CTGTGGGAGA AAAAAAAGAC AAATAGGAAG GAGGTGAGAG CAATAATGAC ATAGGCTTGT 2880 

TTTTTCATCC TCTAACCACC TCCACATCAC CAATCATAGT GGTCAAGAAA ATCTTGACAC 2940 

TCTTGTTACT CTTGAGATAG TCTTTTGTTT CTTGATGATA GTGTTCATTG CGGAGGGCTC 3000 
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GCTTGGGCTG GTTGAAAAAA ATCAAATCCC CATAGAGACA GTTAACGCTG AGACTCACTT 3060 

CCACATCTAC AGGTACGATG ATTTTGGTCC TTCCTACCAT CTTTCTGAGG ATAATGACAT 3120 

TGTCATGATT GGTTAAGATG ACCCTCTCCA GATGAATAGT GTCCTTGCCC ATGAAGCGAA 3180 

AGAGATTGAT ATCATCGAAT TGGCAAGTCT GGTAGCTTGA AAAATGATGA AGATTTCCAA 3240 

ACCAACGATT TTTCTCCTTC TTAACCGTCA CGACCTCTTC AAAAACCAAA TTGGTCTGCT 3300 

CTTTTTCCTG GTTCATCATC GGGTAAAGAA GAAAGAGGCT ATAGATAACC GCAACAAAAA 3360 

TAGCTAGAAT CACAAAAGGA TTGAGCATAA CGATGAAAAA GAAGAGAATG GTTGCCGCTA 3420 

CTAAAAGAAG ATTATTTCCC TCTTTACCAG TGTAGTAGCG AATCAAAAGC AAAAAGAGGA 3480 

ATAGTATCAG CAGAAAACGC GAAAAATGCT CTGATACCAT CAAAATCAGA GCTCCTGTCA 3540 

GAAGACAGGC TTCGATAAAT AAAAAGATTT TAAATTTTCT CATAGGTTCA TCCTCTCCCT 3600 

TCTATTTTAT CACAATTCAA AAAAGTCACC TCAGTCTCAG GATGGAAAAA AGGCGCTGGT 3660 

TACGCCTTTT TCATCTGATC CTTTGCTTCT TTTAATTTTC CATAAAGAAG ATAGTCTACT 3720 

TTTTGTAGAT CTGCTATGGT GGCACAGTTA AGGGAACACA TAATCAAGCG TAGATCTGCT 3780 

TTCCAGCCTT GGACAATGCC AATCACTTCT TCAACTGTGT AGGTTTCAAC CAATTCCAGA 3840 

ACGGTTCGTG ACAATCCCAC ACCCTTAGCA CCAAAAACCA AGCACTTAAT CATATCCAGC 3900 

GGATTCCGAA CCCCTCCACT AACCAAGAGT TCGACCTTAT CTTTCCATTC TTGGGCATTG 3960 

AGAAGGCCCT GCATGGTAGA CTGACCCCAT TGATTGAGGT AATCACGCTG GCCACTACGA 4020 

CGGTTTTCGA TATAGGCAAA GCTGGTGCCA CCACGACCCG ATAGGTCCAC TGTACGAACA 4080 

CCGAATTCAT AGGCTCTTTC GATTGTCTTG GCATCCATTC CAAAGCCCAC TTCCTTGAGG 4140 

ACAATAGGAA CGGGAATTTG CTTGCTATAA TCTGCTAGAT GCGATTGCCA GCTTCTAAAC 4200 

TTCCTTTCTC CCTCGGGCAT GAGTAATTCC TGCATGACAT TGACATGCAC TTGCAATAGA 4260 

ACAGGATTCA TCTCTTCTAC AGTCTGAAGT CCTAACTCGA CAGGCTTGTC CAATCCAATA 4320 

TTCCTTCCAA GGAGGAGATT GGGATGACTA GACTTGACAG AAAAAGAATC ATCCGTTGGA 4380 

TTTTTGAGGG CTGCGCTATA AGAACCCGTT ACAAATAAAA TACCACAGCA TTCCGCCACC 4440 

TGAGCCAGCT TTTGATTGAT TTCTCTTCCC TTATTACTTC CACCAGTCAT GGCATTGATA 4500 

TAAAAAGGAA AGTCCCACTT TCGACCAGCA AACTCTGTCG AAAGATCGAT TTCATCCAGA 4560 

TTGTAAAGAG GCAAGGAAGA ATGAATCAGC TCCACCTCAT CAAAGCTATT ATAGGAACTT 4620 

TTCTGCTCAA GGGCATAGAG GATATGCTCG TCCTTACGAT TTGTCGTCAT GTCCTATCCT 4680 

TTCTTGATAT AAGAGCTCAA TCCCCAGATC GGCCCAACGA TTTTTTAAGG TTTTGGTTGA 4740 

TTGCGCATCA AAACTCAGGG CGATGCCACA GTCACCACCA CCAGCACCAC TACTCTTGGC 4800 
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AACGGTCTGC AAATCTTGAC TGGCTTCTTT CAACTGTCTA AGCAAAGGCG TGTAAATATC 4860 

TGTACTCAAC CCTTCTAAAA GCTTGCTGGC TACTTCTACT TCATCGATAA TCTTTTCTGA 4920 

TTTCCCCTGT TCCAAGGCTT CTACCAGAGA AGTCACCGTT TCTTTTGAGG AAGTTAAAAA 4980 

ATTTTGATTG ATATTTTGCT TGATTTGCTG GACCATGTGA CTCGATACAG CCACTTCCTT 5040 

GGTCCATCCC ACTAAGAAAT CACATTCTAA AGTTGGTTTC ACTTGTGAAA TTGAAAAGCC 5100 

CCAATCACGC TCCAGAACTG TCGCCAAGTT TTCTTCTTCT AACCAAGCAG CCACCTTCTG 5160 

GCGATCAAAT GACTGGTAGA GAACCAAATC CTCTGCCACA ATACAGGCAA GGTCGCCCAT 5220 

GGAACCATTG TCTCCTCGCT TAAGCAAGAC AGCGCTAGTC AGCTTGAACA AGAGCTCCTG 5280 

ATCAACAGAA ACATCATACA GAGCCAGTAA AGCCTTGACA ACCAAGACAA CGACGCTGCC 5340 

ACTAGAACCT AGACCAAACT TTTTCCCTTC TCGTTCCATT TTGCCACACA TTTCTAGAGA 5400 

AAAAGGTCTT AAATTCTGAC CACGAACAGC GAGGAAGTCT CCCATCAAAG CAATCGTTTC 54 60 

TTGAATCAAG CTATAGTCAG GATTAGGCCT TAAGTCCACT CCGAAATCAA ACATATCTGA 5520 

ATAGATACGG TAGCTGTCAG AAAAAGCAAT CTCAGCCCTC ATATAGATGG GAATATCCTT 5580 

TATCAAAGCT AACTGCCCTG GCTCTAAAAT AGCATATTCA CCTGCCCAAT AGAGTTTTCC 5640 

GCAAGTTTTA ACAGCAATCA TCTTGACTCA AATCCTTTGT TTTTGACACA ATCAAGCGAT 5700 

AACGATGACC GAAAATTTCT GATAAATGCT CCAAGTCTTT CTCCTGACAG AAGACCTTAA 5760 

CATTGGGACC AGCATCCATG GTAAAGTAGC AGGCCTCTCC TTTCTCACGA AGCTGGCGAA 5820 

CAAAGGCCAT AGCCTCATAA GAGGCATCCG TCAGATAAGA AAAGGCTGGA CTAGCAGTCT 5880 

TTCTCGTAGC ATGCATAGCC AGGGCATTTT TCTCCGTTAA TTCTCCAATC TTGGCAAAAT 5940 

CATTTTCCTT GAGATAAATC AGCATATCCT GATAG T CCTT CTCAGACTGA CGAACCCAGT 6000 

CGTCGAAAGT CGTCGAGGTT TCCACACAAA GTTTCATCCC GTCACGGCTA GAGATTGGTT 6060 

TTTTCTTGTC CTCTAGCACC AACATAATCA TAGCTAGTTT CAAGTCTGTC TCTACAGGGT 6120 

AAATTTCTCC ACTATCCTTA TCCCAGGCTC CTAGTGGTCC ATAAAAACTC CGAGAAGAAG 6180 

AACCTGAGGC AAATTTGGCT TCCTGTGCCA ACTGACTTCT ATCCAATCCA AGCTTGAAAT 6240 

AAGCATTACA AGCCTTGACC AGGGCGGACA AACCACTAGA ACTTGAGGAC AGACCCGCTG 6300 

CCGTAGGCAT ATTGTTTTGA GTATCGATAC GGACAAAGCC CTCACCAGCT GGACGATAAC 6360 

GGTCAATAAT CTTACTCATC TTGGCATGCT CGACCTCATT TTGTAGCTGA CCATTGATGT 6420 

AAAATTCGTC AGCTGTTACA TTGGCTGGTA AAGGCGACAA GGTCGTCTCT GTATACATAT 6480 

TTTCCAAAGT TAGAGAAATA CTGCTAGTAG CAGGCACCAT CTCTTTTTCT TTTTTCTTTC 6540 
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CCCAATATTT GATAATAGCA ATATTTGCGT AGGAACGTAC TGTTACAGCC TCTCTATCCA 

TGTCTGAACA GCTCCTTTCT CTTCTAATCT TTCTGCTACT TCTTGTGCGT GTGTCAAATT 

GGTTACCAAG GCTATGATAC AACCTCCTAG CCCACCACCG CTCATCTTGG CACCCAGAGC 

ACCATGGCTA AGAGTCGTTT CAACCAAAAA GTCTGCCTCA GGGCTACTGA CTCCAATTTC 

TTTTAAATGT AAATGCGCTT GACTGAGGAT TTGTCCCAGT CCTTCAGCAT CTTTTTGTGA 

AATCGCAACT TCTGCTTGCT GGGTTAATTC TCCCAAGCCA TGCAAAAACG CTAGCCCATC 

CTTGCCCTTA TTTTGAACCA CTTGGATGGC TTCACGAGTA TGACCATAAA CACCCGTATC 



GGCAATCACC 
AAAGCGAATA 
GGCAATCATT 
ATAGTAGTCA 
CCGTTTCTCA 
ATACTCCAGT 
ACTCTCTGCA 
AGGATAACCG 
GACACCAACT 
TACAAGTATT 
CCATTCACAA 
CATCTACGGT 
ATTGACCTGG 
AAGAGGAAAG 
CTCATTAACA 
AATCTTCGCC 
AGCTTAACAT 
GGAAAACGCC 
TGTAAAAATA 
TTGTTAGAAA 
TAATATTTGA 
TCTAACTAAA 
ATGAGCAACT 



AAATAGGCGG 
GGTTGGTCAC 
TCAGCTCGAT 
AATACTGCAC 
GGGATAGCCG 
GAGCCATAAA 
GGAACTACCT 
TAAACGACCG 
TTTTTTGTCA 
AATTCTTTCC 
TCGCTTCTTT 
CGTATTCACT 
AACCAACCTG 
TCGTGAGGGA 
AGTACTCGTT 
CATTCTTGAT 
TTTTTTGTGA 
TTATGAAGTA 
TATACTATAG 
TCGATTTGAC 
TAAGTTATCC 
GGATCCTATT 
ATTTTGGTTA 



ATAAATCCAT 
TAAGACAGGT 
TGACCAAGAT 
GAATGGCCGC 
AGTCAATCTC 
CCGCCATGGA 
TACAGGTCAC 
CATGTTCCCC 
TTTTTTCCTT 
TATCTATTTT 
CATTATTGAA 
GCCAAGGCAC 
TCGTCATAGA 
GCATGCGCCA 
TCGGCCATTT 
CGTTTAAATC 
ATACAGGTTC 
TGCTACGGGA 
TAGATTGAAA 
TGTCCTGATC 
TAAAAGTATT 
CAATTACTAG 
CAATGTCTAC 



CTCAAGTTCT GTAAATCCTA CGTTCTTGAT 
CTTAGCATCC AAACCACTAG GATTCATATG 
TTCTAGTACA TCATGAGGCA GATCAGCCTG 
TATGCTGATA GCCGCTGACG AACCCATCCC 
ACAACGAATG CAGGCTTCTG TGATATTCAA 
CAAGGTATCC TCCTCATAAA GGCGCCAAGG 



CTCCACCTCC AAAAGAGGCA 



TATTAAAATT ATCTTACTAT 
TTACTAGACG AAAAAACGTC 
ATTATATTTT CACAAAAAAA 
CCCATTCGCC ATTATAGTTG 
CTGAGCGCTA TAAGCGTAGT 
ACGACGAAAG AAACTCCATA 
TTGACAACCT GTTTTAGTGA 
ATAGGTGCGG TGTTTGGAGA 
AGTATCATAT GCTTTGCGTA 
TAAATAATCG ACCACGAAAA 
AAGTTATGCA CTTAATTTGA 



CTAGAATAGT ACACCTCTAC 
GATTTATCCT GTTATTATCT 
ATTATGTTGT TGTGTTATAG 
AACTATCACA TACTCAAGGT 
TAAATTTAAG TCAAACAAAT 



GGGAAATGGC 
GTGCCTGACC 
TTATTTTTCA 
GCGATTGTTT 
ACAGAATAGC 
ACCATCTGCC 
CCATTAAGTA 
CGTACAAAGT 
AATAGCGTTC 
TCATAACTCT 
TTTCTTAAGT 
CAATTCAAGA 
TTCTAAAATA 
CATTTTACTA 
ATTGATTGAA 
CAGCTCACAG 
AATTTAGTCA 



6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7S60 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 
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r\f\f\ I 1 AAAAA 


ft. ft IK "ft r "» n. 

► AATAGAGGAA 


. CATAAATATG 


ATTACAAAAC 


AGAATGTAAT 


AGTGTTCTAC 


8400 


nJ\ till! n\, I 


AGATAAAACT 


GTAAATTCTG 


AAGGAAGGAT 


CACTTCTTCA 


ACAGAATTTG 


8460 


WiAAi I l\AiT 


AAGTAATTTA 


TCATTCCAAC 


ACGGAATAGC 


TGGACTACTG 


TTTCCTCTAA 


8520 


A 1 AAA riVTA 


CCCCCCAGAA 


CTGCATTCTA 


AAATACTCTC 


TATCATCAAG 


AAGGCAGTGA 


8580 


UMl iAl»AAC 


GACACACACA 


TATGAATATC 


AATACTCACT 


GCTATTTGGT 


GATGCAGGCT 


8640 




ACTCCTACAT 


TTATTTTCTA TCAGTAAAAA 


TCAATACTAT 


CTACAATTAG 


8700 


WAAACGTCAC 


.^^^^^ft#Vft ft % ft & m * 

CGCTAAAAAA 


TTAATAGAGA 


ATTATGATAO 


TCTAGAGGAA 


ATAGACTTTG 


8760 


i_A II TitKJ AAA 


ATCTGGTGTC 


CTATTATCAT 


TAATAAAATA 


CTATCAATTT 


ACCAATGACA 


8820 


ATACTCTTAA 


AATTTTCATC 


CACAATAGTA 




TTATCATTAT 


TTCCTACAAA 


8680 


GAGATACAGC 


CAAAGAAAGC 


ATTTTAGACT 




TCATGGATAT 


TGTGGAATTG 


8940 


CATATGCTTT 


ATTTGCCTAT 


TCTAAAGTCT 




TATGTTTTAT 


AATGATCTCC 


9000 


ATACATTCCA 


TACTGAATTA 


AAAAAATTAT 


1 AUAAAAAV>1 


TACTTCTAAT 


ACTGAAAATT 


9060 


TAGGAAATTT 


ACAACTTTCT 


TGGTGCAAAG 


un/\l I 1 v»v.Vxv* 


AATAATCTTA 


TATCTTTGTA 


9120 


TGTACGATTG 


TGACGGAAAC 


AAAGATATTA 


TTlf!Tl A ATH 


TCAAGAATTT 


GTTTTTAACC 


9180 


ATCATCTAAA 


AATGATGACA 


GGATATTGCC 


ALuOAn 1 AAV, 


TAGCTTACTA 


CAAACCACTG 


9240 


TCTACAATCA AAACAAATTA CTGATGAAAA 


AAATCCAACA 


GGTAATTTTA 


GCATGTTCTG 


9300 


AACGAGATGA 


TCACGCTTTA 


CTGATGTTTC 


AAGGAGATAG 


TGGTAAACCA 


GATTTGTTTG 


9360 


ACTTCGGAAT 


AGGAAGCATG 


GGGTATATTG 


GTGTCTATTA 


AATAATAAAT 


TCCCATTCGA 


9420 


TGTGCAGACA 


TAAGGAGAAA AGTATGAAAT 


TATTTTGGAC 


AAACAACATA 


TATAGACAGT 


9480 


TGCTGCTAAA 


CAGCTGTTTT 


TCATCATTCG 


GCGACAGTAT 


TTTCTACCTC 


GCCATTATCA 


9540 


ATTATGTGGC 


TCAGTACAAT 


TTCGCTCCGC 


TAGCGATTTT 


ACTGATTTCC 


ATTTCAGAGA 


9600 


TGGTTCCCCT 


ACTATCGCAA 


CTCTTTCTCG 


GGATTCTAGG 


AGATTTTCAA 


GAAAATAGAG 


9660 


TCAAACACGC 


ACTCTGGATT 


GCCAAAATCA 


AAATCCTGCT 


CTACGCTATT TTGACAGTAT 


9720 


TTCTCGTCTT 


GTCGCCCTTT 


TCATTAGTTT 


CAGTCATTAT 


GATTGTCATC 


ATCAACCTCA 


9780 


TCTCTGACAC 


CTTGAGCTAC 


CTGTCTGCCT 


ACATGATGAA 


CGCCCTCTAC 


ATCAGTGTAA . 


9840 


TTAAGGACGA 


CCTGCATGAT 


GCCATGGGGT 


TCAGGCAGTC 


TCTGATGAGG 


GTTGTCCGTA 


9900 


TTGTCGCCAA 


TCTGGCTGGC 


GCATTCCTTA 


TCAATGTTAT 


AAGTATTCAA 


ACTATTTCCC 


9960 


TTATCAACAC 


TCTGACTTTT 


GTCATTGCCT 


TTTTGGGCCT 


GTATGTTATT 


CGACATACCT 


10020 


TGTATGAGGT 


TGAAAAAAGA 


ATTGAAATGT 


CACATACAGC 


ACTGAGTTTT 


AAGAAATATT 


10080 
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TTCAACATCT TAAACAGTCG CTGGCTGTGC TCCTCAGGTT AAAACATACC GTCATACTAC 10140 

TGTTTCTGAC GACCAGTATG ATTGCCATCT TGGATGTGTC CCCTCGGCTG ATTGCCCTCC 10200 

GCTTCATCCA ACAGACACTA GCACAACTGA GCATTGGGCA ACTCCTCGCC CTGCTCTCCA 10260 

TCATCATGTC TTGTGCACCT ATCCTTGGCA ATATGACCAG CAGTAATCTA TTTAAAAATA 10320 

TCCGTTTCAC CCACCTCTTG GTTTTCTGTG AGATTTCCCT ATTGACTCTA ATAACTAGTA 10380 

TCCTTTGTCA ACCCTATATC GTAATTTTCA TGACCAGTTT CATCAGTTCT ACGATTATCG 10440 

GCATTCTCAG CCCTCGCCTA CAAGCAGCTG TCTTTGCCCA TATCCCCAGT GACAAGATGG 10500 

GGACGGTGGG CTCTGCTCTG AGCACAGTGG ACATTCTCGC CCCCTCCCTG CTCTCCCTAT 10560 

TAGCCCTATC CATAGCATCG GGCGTTTCGG TGCAGTTAGC ATTGATATTT TTGTATCTTA 10620 

TTTTAATTGC TCTTATCTTT TGTCAATGGT TAGTCAAGTT CAACACTCAT AACTAACGAA 10680 

AAAGCATGTG TAGATTTCAC ATGCTTTTAA TCTCCCCAAT CGTCAGGTCA AGTACAACAA 10740 

AGTCACTTCT TTGATTAAGC GAGTGTTCTA ATATAATTAT AAGCGCCCTG TCATTACCGA 10800 

ACCCATTCGC CATTATAGTT GACAGAATAG CCATCTACGG TCCTATTCAC TGCCAAAGCA 10860 

CCTGAGCTAT AAGCATAGTA CCAGTTGCCA TTGACCTGGA ACCAACCTGT CTTCATGTCT 10920 

CCATTACCTG CATTTAGGTA GTACCAAGTT GAACCATCTT GATACCAACC AGTTGCCATA 10980 

GCTCCTGATG AACGGAGATA GTACCATTTG TTCCCAAGGT TTTGCCAACC TGTTTTCATA 11040 

TCGCCATTTG GGTGGTCTAA ATAATACCAA GTGGTACCTT CCTGATACCA GCCAGTGGCC 11100 

ATTGCTCCTG AGGAACGGAG GTAGTACCAC TTATTACCTA GATATTGCCA ACCTGTTTGC 11160 

ATAATACCAG TTGTTGGATC TAGGTAGTAC CAAGTCGAAT CATCGTTTAT CCACCCCGCA 11220 

CGTCTTTCAC CACCAAGGTA GTTTTCTCCA TTAATTTCCG TCTTAGCTAG ATAATACCAG 11280 

TTAGACTGAT CATAAAGCCA ACCTGTCTCT AAAGAATGAT TTTCATTAAA GTAATAGTTC 11340 

GTATAATAAC GCTTCTCTTC TTTATCTTCT GAATCTTCAC GTTTTTCCCC GTACTTTCTT 11400 

CCAACACTGT CTTTAGTTTT AATCTCTAAT GTTTTCCAAC CAACAAACTC TTGTAGCACT 11460 

CCATTTTTAT CGAAGTAGTA CCACTCTGAC TTTGGAAAAC CTTCTAATCT GATACCATTT 11520 

GGGTAAGGAC CAATTGTACT ACCTTTAGAT GGAAACGGGA TATATTGCCA GCCGACAACC 11580 

ATCTCTCCAG ATAGAGAATC AAAATAATAG TACTTACCAT CAATCACTCG CCAGTAGGTT 11640 

TCrrPGACGT CCCCCTTTTT GTAGTAGGTT CTTCCGTTTT CTTGGACAAA CTGCCATCCT 11700 

TCAGAATCAT CTGCAAATAC TGTACTGGTC CCTAGCAAAC CAAAGAAAAA TACTGTCAGT 11760 

CCAACTTGCA TAGTTTTTTT CAAAATTTTC ATCTATATAC CCTCCAATAT TAAATCCACT 11820 

CACCAGATGA GGCGAAATTA TAAACTTTAC CATCGATAGT TTGGCTACCT GTAACCATTG 11880 



I 
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(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11340 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(D> TOPOLOGY: linear 



11887 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 147: 
CCGGTATGTT CTGGAATACT ACCAATCTAA GCTGGCTGTG CCCTACAGTT 
GTACGAATAC CTTAAGGAAT ATGACCGATT TTTCAGCTGG GTTTTGCAGT 
AAACGCTGAT AAAATATCCG ATATTCCTTT ATCAGTTTTG GAAAATATGT 
CATGGAATCC TTTATCCTTT ATCTACGTGA ACGTCCCTTG CTGAATGCTA 
ACAAGGTGTT TCACAGACAA CTATCAATCG AACCTTATCA GCACTTTCTA 
GTATCTAACC GAGGAGGTTG AAAACGATCA GGGGGAACCT TATTTCTATC 
GAAAAAAGTT TCCACCAAGA AAAAGAAAGA AACCCTTGCT GCCAGAGCTG 
GCAAAAACTC TTTCTAGGTG ATGAAACAGA AGGTTTTCTA ACTTATATCG 
CCCACAACAG CTTTCAAATC GAGCTCTCTC ATCATTCAAC AAAAATAAAG 
AGCCATTATT GCCCTTCTCT TGGCATCTGG TGTTCGCTTA TCTGAAGCTC 
TCTAAGAGAT CTCAATCTAA AAATGATGGT TATTGATGTT ACTCGAAAAG 
TGACTCAGTC AATCTCGCTG CTTTTGCTAA ACCTTATTTA GAGAATTATC 
GAATCAACGC TATAAAACGG AAAAAACAGA TACAGCCCTT TTTTTAACTC 
TGTTCCTAAT CGTATCGATG CTTCTAGCGT TGAGAAAATG GTTGCTAAAT 



TTACAACCCT 

CTGGTATTTC 

CTAAGAAAGA 

ATACAACAAA 

GTCTTTACAA 

GTAATGTAAT 

AAAATATCAA 

ATCAAGAGCA 

AACGAGATTT 

TTAATCTAGA 

GTTGCAAACG 

TGGCCATTCG 

TCTACAGAGG 

ACTCAGAGGA 



TTTTAAAGTG CGTGTAACAC CCCATAAACT GCGCCATACA 
TGCGACTAAA TCACAAGTTT TAGTCAGTCA CCAACTAGGA 
TGACCTCTAT ACCCATATTG TTAGTGATGA ACAAAAGAAT 
TTACGTATTT TAAATTATGT AAATAAATAT CAAAAAAAGA 
ATTTATCCAA CTACCGCTTC AGCGATTTCT TCACGGCTAA 
ATATCAATGG TTTTTAGCGC CTTAAGAACA TCTTCGCGTT 
ACATCTTCTA CTGCAGCAAC GTCTTCAATA CCAAAGAAGT 
TGGATTTTTG ATTCAGTAAC GTTAGCAAAG ACTTCAACCT 



CTAGCAACTA GGCTCTATGA 
CATGCTAGCA CACAAGTCAC 
GCTCTGGATA GTTTATGATT 
AGTTGGCCAA CTTCTTTTTG 
TACCAGCGAA GTAGCGTGTG 
CGTATTTCAC CCCACGAAGG 
CACCATAAAT CTTGATGTCT 
TACCACTAGT GAATTTGATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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CCACGACGGA CGTTAAATTC ACGTGATTTA CCATAGTTCC AGTCCCAAGT TCCAAACTTA 1380 

GTATCCTTGA TGCGATTGAT TTCGGCCAAT TCTTCTTCTG AAAAGACGTA TTCAGTCATC 1440 

TCTGGGTACT CTTTTTTCAT GTATTCCAAC AGTAAATCAC GGAATTTTTC GACTGTGATT 1500 

TTTTTTGGTA ATTCATTGAT AATATTGGTT ACACGGGCAC GGACGGATTT CACACCTTTT 1560 

GATTCAAATT TATCTTTTGA AACCTTAAGG GCATTTGCGA GGACTGACAA ATCAACGTCA 1620 

AAGAGCAAGC AACCGTGGTG CATGATACGG CCGTTGATAT AGGCTTGGGC ATTGCCACAG 1680 

AACTTCTTAC CATCAATCTC AAGGTCATTA CGACCTGTGA ACTCAGCTTT AACCCCAAGT 1740 

TGAGCCAGGG TATTGATAAC CGGACTTGAG AAGCTCTTGA AGTCAAATGC CTTATTTTCA 1800 

TCTTCTTTGG AGATGATCGT CTAGTTGAGC TTATTTAAAT CGTGGTAAAC AGCTCCACCA 1860 

CCACTAATAC GGCGAACTAC CTCAATACCA TTTTCGCGAA CATAATCACG GTTGATTTCT 1920 

TCGATAGTGT TCTGGTGACG ACCAACAATG ATAGATGGCT TGTTAATCCA AAGTAGGAAG 1980 

ATTTGATCCT CATCCAAAAG GTGTTTAAAG GCGTATTCTT CCAAGGCAAT ATTAAAAGCA 2040 

GTGTCATTTG AATGATTGAT AATGTATTTC ATGATATCCC TTTACTTTAT ATGATAGAAA 2100 

CTGGAAATAA CCTTCCAGTC TAATCTATCT TCGTTTTATT TTTTCTTAGG TGAATGGATG 2160 

GCCATTCCTA GAACATCTGC AAACGCTTCG TACATCACTT CAGAGTAAGT TGGGTGCCCG 2220 

TGGATGGTCT TCAGCATTTC CTCAACAGTG ATTTCCATTT CGATGATGCT TGATGCTTCG 2280 

TTTATTAATT CTGCGGCTGC AGGACCAATA ATGTGTACAC CAAGGATTTC TCCGTATTTC 2340 

TTATCAGCGA TAACPTTTAC GAAACCTTGA GCTGCGTCAG ATGCAATAGC ACGACCGTTA 2400 

GCAGCAAAGT TAAACTTACC GATGGCAACA TCGTATTTCT CACGGGCTTG TTCTTCTGTC 2460 

AAACCTACTG CTGCTACTTC AGGGAGAGTG TAGATGGCTG CAGGAGTCAA ATTCAATTTG 2520 

GCAACTGCAT GATTTCCTTT AAGGGCATTT TCAGCGGAAA CTTCACCCAT GCGGAAAGCT 2580 

GCGTGAGCCA ACATCTTAGT ACCGTTGATG TCACCTGGTG CATAAATGCC TGGAACTGAA 2640 

GTTTCCATGT ATTCGTTCAC CTTGATACAA CCACGATCCA ATTCAAACTC AACCTCTCv.A 2700 

ATACCTTCAA GGTCTGGCAT ACGACCAATT GAAAGAAGAG CTTTGCTTGC GATGATATCG 2760 

TCTTTTCCTT CAACCTTGAT ACGAAGTTGA CCATTTTCCT CAATGATTTC TTGCAGTTTA 2820 

GTACCAGTCA AGATGGTCAT TCCTTTACGC TCAAGAATCA AGCGAAGGTT CTTAGAAACT 2880 

TCCACATCCA TAGCTGGAAC TATACGGTCC ATCATTTCGA TAACAGTCAC TTTTGAACCA 2940 

AATGTCATGA AGGCCTGACC GAGTTCGATA CCGACAACTC CACCACCGAT GATAACAAGG 3000 

CTTTCTGGCA CTTCGTTCAT TTCAAGAATG TCATCACTAG TCATGACAAG TGGAGATTCC 3060 

ATACCAGGGA CGTTGATCTT GTTGACTTTT GAACCACCAG CAAGAATGAT T TT C n W l T 3120 
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TCAAGCAATT CAGAACCATT 
TTATGAACAG TAACTCCGTA 
ACAACTTTAG ATTTAGTTTC 
ATCACGATAC CACGATTTCC 
TAGGTCTTGG TTGGAATACA 
ACAAGGGCAA CCTTACCGCC 
CCTCCACCAA TCACAACGAT 
CTTGCTACAG GTACAGGGCT 
CTTTCTTCAC CAAGGTAACC 
AGAATGGCAA TCAAGTACCC 



987 

TACCAAGACG TTCTTGTCTT TAGTGATTGT ACCAATTCCT 
GCTACGAAGA AGTCCTGCAA CACCACCAAC AAGAGTATTA 



ATGATTTCCA AAAGGATTTC 
ACGATTTGTC CTTCTGTCAT 
TCTTCCTTCC TTTATCTATA 
ATTGGCGTTT CAATCAACTC 



TAAAAGTTTT 
AGCATGACCG 
TCCACGGTTT 
GAATTGGGCA 
ATCAAAAGCA 
AGCTTCTGGC 
GATAACTTCC 
ATCTTCTTCG 
TCCTTCTTTT 



TCCATATCAA 

ATATTTTCAA 

AAGCAGGTTC 

GCTTTAATGG 

TCATCGCTCT 

GATGCTGCTC 

GTTACAGGGA 

GCTTCCAATT 

ACAAATTCTC 



CAGTGAAGTT AGGATTTTCA 
TAATTTCAGC GTTATGAAGG 
CACCAAGTTC AGATTTCTCA 
CTGCAACATA ACCAGCAGGA 
TACCATCATC GTTTGAGGTA 
CAGCTGTTGG GATGTTTTCC 
CAGTTTCACC ATCTCCTTTG 
CCATGCTGAC TTTATCAGTC 
CGACTTTTTT ATTCCATTGG 



ACGACACGGT 

TTGACGACAA 

TTAATAATCG 

GAATTTTGTA 

AAGGCTACAA 

AATCCATTAT 

TTGCCATCTT 

GCAAGCGAAA 



GGTCAATGGT 

CTGGCTTCTC 

GACCAAAGGA 

ACTCACTTGG 

CCAGTTCTGA 

CCATCCCAAC 

CTGTCAATGA 

GAAGGTCTGT 



ATCCACGCCG 

TCTTAAAAAT 

TTTCAAGTCC 

TAATCCTAAA 

GATTGTCGAA 

CTGAACACCA 

AGCCAATTTA 

AAGACTCATC 



GCTTTTGGCA 

GAATACTCTT 

TTCATAAACT 

CTCATGATTG 

CTGACACCAA 

AACATTCCCA 

CCATCCAAGG 

TTCTCAGCAT 



TAATTACTTC 

GCTCTTAAAT 

TAGCACCAGC 

GGCGAATCAC 

GGATAGCTGA 

AATTACTGAT 

TACGGCCAAT 

TGTAAACAAC 



TGCCATGGCA AGATTGACAT AGTTGTGAGT 
AGCGTTGATG TATGGGTGTT TCATAAGAGT 



TAAGGCCATG 

TAACATTGAG 

CATACCATCT 

AATTTCACCA 

GTTGGGTTGG 

TCTGAATGTT 

AACATCCTTA 

AGGTGTCATC 

GATAATAGTC 

CTTAACAACT 



AGAACCTTCT 
GTTGGCGCAG 
ATTGGAATAC 
TCAATCTGAG 
ACATCCTTCT 
TTATGTTCGA 
TAAGTTTCCA 



TACGAAGAGC 

TCAAGTAAGA 

GCTCGATTTT 

CAGGAGATTT 

TCATGATTTT 

GGGCAATTCG 

CGTCTTCTTT 



TACAGTAGTC TTCTTCCCAG TTGCTTCCAT GATTGGCTCA 
CAACATTTCA GTCATATCAA CTTCATAGTT GAGGGTGAAG 



TTCAACCATG 
ACCATATGGT 
GATGCTATCG 
ACCACGATGA 
TTTTGCAAGT 
GTGGACACGA 



CGTTGGGCAA TAACCTTACG 
GTTACGTTAT CAGGGACTTC 
TTTTCGATAT TTTCAGGAAG 
CCGGTTCCTT GGATTTCCTG 
GGCGAAATGC GAACCACGTT 
CCGTTTGCAC CTGAGCCAGA 



CATTGGTGTC 
TTCCACTTTT 
CAGGGCCAAA 
CCAAGCAATG 
TGTGTCTTTA 
AACGTCGTAC 



3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4360 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 
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AGGTTTATCC 
TCAGCCATGA 
TAGGAAATCG 
GCCCGTGCTC 
TTCTCGTCAG 
AGTTTTTCGG 
TTGTGCATAA 
GTCAAATGCT 
GTGGGCATCG 
CTTATCAAGC 
TAATTCTTCA 
ATCCGTTCCT 
TGGCACTTCC 
ATCACGGATA 
CTTAAGTCCT 
GCCAACTCCG 
CATGTAACGT 
CATGAAGGTC 
AGAGATGGCA 
CATTCCAACA 
CACATTTTCA 
CATTGTTTTT 
TGATTCAAGC 
CTTTACTTGC 
GTAATTGCGG 
ACGATATTTA 
CAAGACTGGA 
ATCGATGACA 
TTGATGGATA 
GTTAATGCAA 



CTAAATCATC 
CCTCTCCAAT 
ACGATGGCTT 
TAATCTAAGA 
CTTTATTTTA 
ATTGCATCTT 
GGCATCGGCA 
TCTGATTCTG 
TTGACCAGAA 
GGAACAAGGG 
GCAGCTTGAA 
TGGCGTTTGA 
CCTTTTTGGT 
GAAGACTTAA 
GGAATGTGAG 
TTACCAGCTG 
GTTTTAGCAG 
ATATCGACGA 
GCTTCAGAAA 
GAAGTACCGA 
TCGCGACGCA 
GTTTCCATTT 
GGTGGGAATG 
GCTTGGATTT 
AGGTTTTCGA 
CCAGGGTCAG 
CCATTGCCAC 
TTGTTACCGT 
TGTTCTATAT 
TAGAAAATGA 
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CGCTAACTTT CTAGCTGCAG 
TCTATTTATG ATACAAAGGG 
CGATGAAGCC AAGGAGATTT 
TATTAATGAC GAAGAGCTCT 
TTTACATAAC TTATCTTATG 
TGATACTTTC AACTGTTGGA 
CATCTTCTCC TGCACAACGG 



GAGTCGCTCT TAGCTTCTCA 



AAATAATAGC 
CAACCTTACC 
TACGTGGGTC 
CCACACCGCG 
TTTCACCAAC 
TAAATTCTGA 
GCAGGCCTTT 
TAAACCAAGA 
CACAACGAAC 
CTTGGTTGAC 
TTGGACGAAG 
TCGGACAGTC 
AGTCTCCTCC 
TTTCCTCAGA 
TATCTCTTTC 



TCAAATTTCA 
AGTCTTCTTC 
AACAATTTCA 
AAGCATTTTT 
CCCAAGTGGA 
CTTGTACTCA 
CATGTCCGCA 
CTCTAGAGAT 
AGTCATTGGA 
GATATTGTCC 
TCCTGTCATG 
ACGGACACGT 
GAAGACACCG 
CATAGCAAGG 
TCCTTAGTCT 



CGTCAAAAGC 
ATCTTTTTTC 
GCACCTAAAA 
TAACCCTATT 
ATCATTGCAT 
CGAATTGGTG 
CCGATATAGC 



GACTGAAAAA 

CGATCTTTTA 

GATACAAAGT 

CTTTGTTATA 

TTTCTAGGTT 

CATCTAGATA 

CACTTGTTTT 



GGCTTTCCTC 
CTTCCAATTC 
TTGGATCTTT 
ATGATGAGTG 
TGCGAACATG 
CTTCGATGAA 
TGGTCATTTT 



TGCAAATTTA 
TTCGGCACTT 
TTGTTTCCAC 
ACCGAGCCAG 
GTCCACAGCT 
CATTCCAGGA 
CTTGATATCC 



CTGGCAGGTT CCAGATAGAA 



ACTGAGTTTA 
ACTGAAATTC 
CCATAAGTAA 
ATTGTGTAGT 
AGTATAATAA 
GGTGTTCCAG 
TGTGAGTGCT 
ACCTGACCTT 
ATGGCAATAA 
GCTGCTCCTG 
TCTGGACCAA 
ACGTCTTCTC 
ATAATGGTGT 
GCGTAAATAT 
ACAGAAGCTT 
GCAATGTTAT 
AATTCCACTT 
CGATAAGTTA 
TTCTGAAATC 
ATTCCATAAG 
GCAGAGATAC 
GCCATGTTCA 



TGATGATATC 
CTTCTTCTGC 
CAACTGTTAC 
CTGGATCAAC 
CTGGGTTGTT 
GTGCCACAAC 
GGGCGGCAGA 
TACCACCAAA 
CAGAGAAGTC 
CTGCTGCTCC 
ATTCTTCAAG 
CCATCAAGAA 
CACGGAAGGA 
CTTCAAAGGC 
CTACTGCTTC 
TTTCAATAAG 
CTTCACGCGT 
CACTTTCAAT 
CTTCATAGAC 
CGGCGCTACG 
CGTAACCGTT 
CTGCTTCGTG 



4920 

4960 

S040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 
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GAAAACACCT TCATTGGTCG CACCATCTCC AAAGAAGCAG ACAACGATTT TACCGGTATT 6720 

TTGCATTTGC TGACTGAGGG CTGCACCGAC AGCGATCCCC ATACCACCAC CTACGATACC 6780 

ATTGGCACCA AGGTTCCCAG CATCAAGGTC AGCGATATGC ATAGATCCAC CTTTCCCTTT 6840 

ACAGGTTCCA GTGTATTTAC CAAGGATTTC AGCCATCATT CCGTTGAGGT CAATCCCTTT 6900 

AGCAATAGCT TGCCCGTGTC CACGGTGGTT TGAGGTAATC AGATCATCTG GATTGAGAGC 6960 

TAACATAGCC CCCACGTTAG CTGCCTCTTC ACCAACAGAA AAGTGCGTCA TTCCTGGCAC 7020 

TTTCCCTTTC TTTACTAATT GTGCAATTTT TAAGTCCATG CGACGGATTT CTTCCATCTT 7080 

ACGGAACATT TCTAGCAAAA GATTTTTATC TAAAGTTGAC ATCTTCTTGC CTTTCTAACT 7140 

TTCTTCTTAC CTTACTATTT TACCGCTTTT GGCAAATACT GTCAAAGTTT TTCTAAAAGA 7200 

AATTTCACAA AATAAAAAAC AAAACCCCGT GAAAACAAGG GATTTTCTTG TCAAGAATAT 7260 

TTTTTCACAA ACTTTTTAGC ATTTGGATTT TGCTAAAGAT TCAAATCTCT TCATAATCAC 7320 

AGTTAAACGC CAACGGTAGA GCGCCCCGCT CACAATCAAA CTAATAATCA AGCCGATCCA 7380 

GTAAGAATAA GCTCCAAAAT CTCTTAGGGA ATCAAATAGC GTAnCACAGG GATTGCTACG 7440 

CCCCAATAAC CAAGCAAACC AAGGTAAAAA GGAATAACTC TATCCTTATA CCCCCGCAAA 7500 

ATTCCCTGAA GCGGCGCCGC AAAGGTATCT GCTAACTGGA AGAAAAGACT ATAAGTTAAA 7560 

AAACGCACTG TCAAATCGAT AAATTTTGGG TCGTTACCAT AAAGACTGGC CACATTTCCC 7620 

CTAAAAATGT AAAGGAAGGT TAAGGTGAAG GCCGCAAAAA TGAGGGCAGT CCATCTTCCT 7680 

AGACCAATAT AGGTTTTCGC ATCATCAAAT CGCTTGGCTC CCACTTCATA GGAAACCACA 7740 

ATAGCCATAG CCGATGAGAT ACTCATAGGA AAGGCGTACA TAAGACTTGA AAAGTTCATA 7800 

GCTGACTGGT GACTAGCTAT AATCAAGGGC GAAAACTTAG CCATAATCAA GCCAACCACT 7660 

GAAAAGATAG CCACTTCCGC GAAGACAGTT CCCCCAATAG GCAGACCTAA ACGAACTCCT 7920 

TCCTTAATTT TATCCATATT AAGTGGAATT CGTTTCTCAA GGTGTAAGGC TTTGAGCTTC 7980 

TCCTGTTTAA ATAAAACCAG AACAGAAATC CCAAGCAAGA CCCAGTAGGC CAAGGATGTT 8040 

CCTAAACCAG CACCAGCCCC TCCCAGTTCT GGAACACCAA AGGCACCGTA AATCAAGAGA 8100 

TAGTTAAATC CGCTATTGAG AGGGAGTAAC AAAAGCATGA GGTACATGGA CAGTTTGGTC 8160 

AAGCCCAGCG AATCCAGCAA GGAACGAATG ACGCTAAAGA GCAACAAGGG GATAATCCCG 8220 

ATAGATAAAA ACCAAAGATA GCGAACCGCT ACTGCCGCTA CTGCTGCTTC TAACCCAATA 6280 

TGATTCAAGA TTATTGGTGC CAAGAAAAGT ACCATCCCCA GCAAGACCAC AGATAGGCCC 8340 

AAGGCCAAAT AAATAAATTG GTAAAAATCA GACGCAACTT CTTCCTTTTT GCCTCGACCA 8400 
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AGATGGTGAC CAATGATAGG 
GGATTCCAGA TACTGGTTGC 
GTCATTCCAG TATCAACAAA 
AAGAAAATTT TTAAAAATAA 
CTCTTTCTAT TCTGATTTAT 



CGGAGGTTTA 
TCTAACTAAA 
TTGAATTTTG 
CTCCCTACGT 
GTCCCTTACA 
TAGCTGCTTT 
TGATATCTGC 
GGTTGTATGT 
AGGTCAAGCT 
TATCCTCAAC 
TGACACCTGA 
GATTCATCTT 
GTGGCGCTAC 
CACTTTCTTC 
TGACATCACG 
GTTCTCTAGC 
GAAGGCTGAT 

TGCTCTCAAG 
TAAAGTCAGT 
TAATCTTCTC 
TTGGACTGGT 
CACCAGTATG 
CGACCAAGCC 
TGATCTCAGA 
CCAAACCAGA 



TTAGATTTTG 
CCTCCGTTAT 
GAACGATTAG 
TTTCGAAAAT 
GATGGTATAT 
GGAAGCAAAA 
TGGACCATTC 
CATTTTTTCA 
GAGAGACAGG 
ATTTTTTTCA 
TTTGAGACCT 
AGCATTGCTA 
TTCTGCTGTG 
CTTAGACAAA 
CGCCATCATA 
AATATTTTCA 
AAAGGTATTG 
CGGAATACCG 
CAAGTTTTTA 
TTTGGCAGCT 
ATTAGCCATC 
AATGTCTTCT 
AGGAGCAACC 
CGCAATTTTG 
CTTGGGATCC 



CACCAAGGCT 
CATAGATACA 
AGAGGCAGAA 
TACTAACTTC 
CTAAACCAAA 
AAGTAGTATG 
CATATTGAAC 
CTGCGGGACA 
AATTCATATT 
TTAACCTGCC 
TGGGAGTCCA 
TCAGCCAAGT 
AGTAATTCCA 
GATGTTTCTA 
TCTACATGAT 
TCGATAACGG 
CCTTGTGTTA 
ACTTCTGCAC 
TGCTGGATGT 
GCGTACTCAG 
TTAAAGCCAA 
AGTTTTTTGG 
TCATCAGAGA 
CCATTAAAGT 
GACTGGAGAA 
ATGACGACAG 
TTATGTGTTT 
ACAAGACCAG 
ATAATTTTCC 
ATTACACGAC 



990 
GACACAATCC 



CCAGCCAAGT 
TAATTGGCAA 
TCTCGTAAAC 
GAGTTTCAGA 
CCAACACGCA 
CGCATGGTCA 
GTAAATTCCA 
CTAACTCCTA 
CTTTTAAGGT 
CAAAGCGGTC 
AACCTGCTTC 
TCAAGCTCAA 
AGCCAGTCAT 
GAGGCGCGTG 
CACGACGCTC 
AAAGAAGTGC 
CTAACCCCTG 
GAACATGGGC 
CCACCCCAGT 
GAACACCCTT 
CTTCCTCCAT 
AACCAACCGC 
TTTTAGTAAT 
CTGCTTGCAA 
TAGTAAAACC 
GACCAGGTTC 
TAGCATCAAT 
CATCTTGAAC 
CATTTTTGAT 



CTGTTAGAAA 
CCATACTGTT 
ATTGGTAGAT 
ACTTTGTCTT 
CCATAGTTTT 
CATGTACGAC 
GCTTTTTCTT 
CTATTAGATT 
TCAAGCTTGA 
TTCACCGATG 
AGCCTTGGCA 
AAAGTTGTAA 
CTCACCAGCT 
ACCAGATGGC 
GTCAGTCGCA 
TGATTCCAAA 
TTCTGTCTTA 
AGCAAACTCC 
TTTAGTTGCA 
AGCACCGCAG 
CAAACCTGGA 
GCCTTCCTTG 
ACCAGCTTCT 
GGTCGCAACT 
AGTCTCCACG 
ACCTGCAGCG 
ACGGAAATGA 
CGTTTCTGCT 
TAAGACATCA 
TAGTAGCATC 



TCTAAAGAAA 
GTATTGACCT 
CAGGATTGGG 
ATACATACTT 
TCAAACTTAG 
AATAATAGCT 
TAGTTTCATA 
TCGCTTGTCT 
TAGACGATTT 
AATGGTGAAT 
TCAAAAATAG 
AGCTTGCCTG 
TCTACTAAAT 
GCTTTGGTAA 
ATAACTGTGA 
CGAAGCGGTG 
GAGAAATGCT 
ACTACTTTAA 
TAGGCAATCA 
ATATGGAAAT 
TCTTCCTCAT 
ACAATCTTAC 
AAGAGTGCCT 
GTCTTGACAT 
TCTGAAATGG 
GCTGCTACGG 
ACATGAATAT 
CCTTCTTCCG 
CAAACTTGAT 
TGCTTTCTCC 



8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
6940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
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TTTATTCATA 

CTGAAAAGAA 

TCGGCTTCCT 

TCAAGTCGCC 

TAAATTCAGG 

CAAGGATTTC 

CACCAATCCA 

GTTCTTTCCC 



GAAATCAACT 

GGGTTTATCC 

CAAAACCTCA 

CTCAAACTCC 

AAAAGTGATC 

ACGCGCCGCG 

TTTCCCTTCA 

ATTATCAATG 



TGGGTATCCA ACAATTTATC CCCATCATAA ACAAACTTGG 



TCTAAAAGCC ACTCAACAAA 
TCATAGGGAA CCCATTCTAG 



GGTCTGGTCA 
CGTCCCCTCA 
GTACCAGTCT 
GGCTTTCAGC 
CATTCCTGGG GCGTCTCTCC TCTCTCTAGC 
TGGACATCAT TGGGTTTCTT ATTACGATGG 
TAGCAAATCG TCGCTAACTG AGGCATATTT 



GTCACCTTAA AAACATAGGT 
ACACCTTTTA GAACTGGCTT 



CCTTCCCAAC 
TTGCAGTCAA 
AAATCTGGTG 
CCTGTTTCTT 
TTACCACCCA 
AGCATGAGCA 
TCTCCTTATC 



TAAGCCAATC GATTGGCTCT TGTCCTCTCT 
GCTTGGAACC CCAAAATCCT CTATAAACCG 
TCAAGTGATG AGGATTGGTA ACTAATCCCT 
CAAAAACGAC TGGTCTATCT AGATGATTGA 
CCCAGATTTG ACCAGCATGA CCATTGGCCT 
GAAGCAAGAC TCCTTGCTCA GCCCAAGCTG 



CTTTTAAGAA 
ACAAAGGACT 
TCTTCTTACG 
CCACCTGAAT 
GTCCAGCAGG 
TCAAATCATG 



CATCTGACAA TTCTTTCAAG ATATTTTGCA AGGATGGTGG 
CAGAAAAACT CAAGCCCTGC GCTTGACCTG GTCCGTGATA 
CCACCTTAAC TTCTTCAAGC AGTGTTGTCA AGAGAGCCTG 
GATAAATAAT CCCCTGAGAA TAGACCTGCT CCATAAACTG 
CCTCAGGTAA TTGCGCCTTA ATCAAAGCAT GCCAAGACGA 
(2) INFORMATION POR SEQ ID NO: 148: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TGCATTGGCC TTGGAAAAGG 
TGGATGGGCT GATTCGATAA 
TGCATAACCT CCCCAGAGTA 
CACAGCATCA GTAAAAGGCT 
AACAGTCAAA CAAGCATTAA 
AGATTTCTTA ACTCCGATAT 
AGCTGGGATA GAGTCAGGTA 
GGGGTCTTGC CCTAGAATTA 
AAAAACCTTT TCCTTGGGTG 
ATTGATTTTC CCGAAATAAC 
GTGTTCCATA GCCGACTCGG 



10260 

10320 

10380 

10440 

10SOO 

10S60 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
AAAAAATAGA CTTGTTAGAC TATAAATGTA GTAAGCCTAC ACAAGAAAAA TACATAGAGA 
TAAAGGTGAT TATTATGAAA TTCAAAAAAA TGCTTACTCT TGCAGCCATT GGCTTATCAG 
GATTTGGGCT TGTTCCCTGT GGCAATCAGT CAGCTGCTTC CAAACACTCA GCTTCAGGAA 
CGATTGAGGT GATTTCACGA GAAAATGGCT CTGGGACACG GCCTGCCTTC ACAGAAATCA 



60 
120 
180 
240 
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CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTGACAA CACTGCCAAA ACAGCTGTGA 300 

TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAAGG GAATGCTAAT GCTATCGGCT 360 

ACATCTCCTT CGGATCTTTA ACGAAATCTG TCAAGGCTTT AGAGATTGAT GGTGTCAAGG 420 

CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA ACGTCCCTTC AACATTGTTT 480 

GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG CTTTATCCAC TCCAAACAAG 540 

GTCAACAAGT CGTCACAGAT AATAAATTTA TTGAAGCTAA AACCGAAACC ACGGAATATA 600 

CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC CACTTCAGTA TCTTCTTTAA 660 

TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATCCAGA AGTTACGATT GATATTACCT 720 

CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA AACCGCTGAT ATTGGTATGG 780 

TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC CCATGATGCT ATTGCTTTAG 840 

ACGGTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG CCAAGTCAGT ATGGCTGAAC 900 

TTGCAGACGT TTTTAGTGGC AAATTAACCA CCTGGGACAA GATTAAATAA AATGTTTGCT 960 

CCATAAATCT CTAAAGAGAT GCAGACGTTT CATCGTACAA TAAGATAAAG AAGGCAAGTA 1020 

CGGACGTGTC GTATCTCCCT TACTTTCTTC ACTAGAAAGG ACAAGATGTG ACAAAACAAG 1080 

CCTTCAAAGA AGCAGTTTTT AGGGCAATTT TTTTCATGAG TGCAACAGTA GCTGTTGTAG 1140 

CTATTTTGCT AATCTGTTTC TTTATTTTTA GTAATGGCTT ACCTTTCATA GCTAACTACG 1200 

GCTTTGCCCG TTTTTTATTA GGCAGTGATT GGTCGCCAAC GAACATTCCG GCAAGCTATG 1260 

GTATTTTACC AATGATCGTT GGTTCCTTAT TAATTACCTT AGGAGCGATT GTGATTGGGG 1320 

TGCCAACAGG CATCTTGACA TCGGTGTTTA TGGTTTATTA TTGTCCAAAG CCCGTCTATG 1380 

GCTTCTTAAA ATCAGCTATC AACTTGATGG CAGCCATTCC ATCTATTGTT TATGGTTTTT 1440 

TCGGCCTACA ATTATTGGTG CCTTGGATTA GAAGCTTTTT AGGAAATGGC ATGAGTGTCC 1500 

TAACCGCTTC GTTACTATTA GGAATAATGA TTTTGCCAAC CATTATCAGT TTGTCAGAAT 1560 

CTGCTATCCG AACAGTTCCC AAAACGTATT ATTCTGGTAG CTTGGCTCTA GGAGCTAGTC 1620 

ATGAACGGAG TATTTTTAGT GTCATCTTGC CAGCTGCGAG ATCTGGTATT TTATCAGCAG 1680 

TTATTTTAGG AATCGGTCGC GCAGTAGGTG AAACCATGGC AGTTATTTTG GTGGCAGGCA 1740 

ACCAGCCGAT TATTCCAAGT GGACTCTTTT CAGGAACCAG AACCTTAACA ACCAATATTC 1800 

TTCTGGAAAT GGCTTACGCA TCAGGTCAGC ATAGGGAAGC CCTTATTGCA ACCTCAGCAG 1860 

TTCTCTTTTT CCTTATTCTC TTGATTAATG CCTACTTTGC CTACTTGAAA GGAAAATCAT 1920 

CTTATGAGTA AATACCTGCT AAAACTTCTC GTTTATTGTT TTTCAGCTTT AACCTTTGGC 1980 

TCTCTCTTTT TAATCATTGG TTTTATCCTC ATCAAAGGCT TACCTCATCT AAGTCTATCC 2040 
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CTCTTTTCTT 
GTTATTCTGC 
TATCTTGTGG 
GATACCTTAT 
GTCTTCTTAG 
TTGCCAGTCA 
CAAGCAAGTT 
GTTGCCATGC 
ACAGCTCCCC 
TCAGGCCGTT 
GAACCCTATG 
AGCTTATTAT 
ACACCTAGAC 
AGAAAGACAG 
AACCCTTAAC 
AGATGAGCAA 
GATGGTTTTT 
CCCAAGGACA 
AAAAGGGGCA 
ATCTGGCGGT 
TCTGTTAATG 
CCTCATTCAG 
AGCTTCACGT 
AGATACCGTT 
ACGGTTCGGA 
TTAGAACAAT 
CTGGCCTTAG 
ATCAACCAAG 
CCACAAGTCT 



GGACTTATAC 
TCTTTGGTGC 
AATATACAAA 
CTGGGATTCC 
GTTTTCAATA 
TTATTCGCTC 
ATGGACTTGG 
CAGGTATTTT 
TCATGTATAC 
CTCTAGCCCT 
CTACCGGCGT 
CTCGAAAACT 
TTATTTTACG 
ATTACTGCCT 
CGGATGAACG 
GATATTTATA 
CAACAGCCTA 
CATGGTATTC 
GCCATTTGGG 
CAGCAGCAAC 
GATGAGCCGA 
CAACTAAAAA 
ATTTCAGATA 
GACGTGTTTA 
TAAGGAAGGA 
CCTTTTTAGG 
CCTCCAAAGA 
GTCAAAGCGC 
CTGACCTTCG 



TTCTGAGAAC 
TCTTCTTTTA 
AAAAGATTCC 
TTCCATTGTT 
CTCTCTGTTA 
AACAGAAGAA 
GGCAGGTAAG 
AGCTGGAGTG 
ATTAGGTACC 
ACATATGTAT 
GATTTTGATT 
TGTGAAAGGA 
GGGATTTTCA 
TGATAGGCCC 
ATTTGGTTCC 
GTAGCAAATT 
ATCCCTTTGC 
GAGACAAAAA 
AAGAAGTCAA 
GCCTTTGCAT 
CTTCAGCCTT 
AGGATTATAC 
AAACTGCTTT 
CCAATCCAAA 
AAAACCTATG 
ACTAGGGCAA 
CAAGGAGATG 
TATCGAATTG 
ATTTGTGATT 



ATTTCCCTTA 
GCCTTGCCCA 
CTTTGTGTTA 
TTTGGTCTGT 
TCAGGAATCT 
GCCCTTTTAT 
TTACGGACTG 
ATACTAGCTA 
TCTACCAATA 
ATGCTGTCAA 
ATTACTGTTT 
GCTTCCTAGT 
AGCCTTAAAA 
ATCTGGTTGT 
TTCTTGCCAT 
CAACCTTAAT 
CATGTCTATC 
ACAATTAGAT 
AGATGATCTT 
TGCGCGAGCT 
AGACCCTATC 
GATTATCATT 
TTTCTTAACA 
AGATCAGCGC 
AGAAATCAAT 
CTTGTCCTTG 
GCAGAGCTAA 
ACCTGTGCCC 
AGCATCATGT 



TGCCAGCGAT 
TAGGGATTTT 
AAATCATGCG 
TTGGCATGCT 
TAACCTCAGT 
CTCTTAGTGA 
TTTTTAGAAT 
TTGGCCGTAT 
CGCCAAGTAG 
GTGAGGGGCT 
TAATGATAAA 
ATGGGAACAT 
AATATTTCGA 
GGCAAATCAA 
ATTGAAGGCC 
CACCTACGTA 
TATGATAACG 
GCCTTAGTGG 
AAAAAGAGTG 
TTAGCAGTAG 
TCCACTTTAA 
GTTACCCATA 
GGAGAAATTT 
ACAGAAGACT 
TTGACTTAGA 
AAACAGCTTC 
TTATCAATAA 
GTTTGTTGGC 
CTTCTTGTTC 



TATTTCCACC 
TGCTGGTTTT 
ATTGGCCTCA 
CTTCTTTGTA 
TATCATGGTG 
TAGCATGCCT 
TGTTCTACCA 
CGTTGGTGAA 
TCTCATGTCT 
ACATGTCAAT 
TACTCTATCA 
TTTCAGTCAG 
TTCAATTACC 
CTTTTCTAAA 
AAGTCCTCTT 
AGCCTGTAGG 
TGGCTTATGG 
AGAAATCTTT 
CCATGTCCTT 
AACCTGATAT 
AAATTGAAGA 
ACATGCAACA 
GCGAATTTGG 
ATATTTCAGG 
ATTGCATGAA 
AAAAGCCTTA 
GGATCATGCT 
CTTGCAGCAG 
AGACCTTGAA 



2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 
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CGTATGGGAG ACCATATGGC AGGCATTGCC AAAGCTGTTT TGCAACTAAA AGAAAATCAA 3840 

CTAGCCCCTG ACGAAGAACA GTTACACCAA ATGGGTAAAT TATCCCTCAG CATGCTAGCC 3900 

GATTTATTGG TTGCCTTTCC TTTGCACCAA GCCTCAAAAG CTATTAGTAT TGCTCAAAAA 3960 

GATGAACAGA TTGACCAATA TTATTATGCC TTATCAAAGG AAATCATTGG ACTTATGAAA 4020 

GACCAAGAAA CCTCAATTCC CAATGGAACT CAATACCTTT ATATCATAGG GCATCTGGAA 4060 

CGCTCGCTGA TTACATTGCT AACATTTGTG AACGCCTAGT CTACCTAGAA ACAGGAGAAC 4140 

TAGTGGATTT GAATTAATTC AACTAATCCT TAAAAGAGAA GAGTACGATT AAGTACTCTT 4200 

TTTTATGGTT GTAAAAAAGT TCATTTGACC AATTTAAGCA GTGTACATAG TGAGGAGTTG 4260 

TTTCAATTCT ATCGTGAACG AGGGAATGCT GAAAACTTTA TCAAAGAAAG GAAAGCAGGA 4320 

TTCTTTGGGG ATAAGACAGA TAGTTCGACC ATGATTAAGA ATGAAGTACG TATGATGATG 4380 

GGCTGTCTGG CTTATAATCT CTACCTCTTT TTAAAGCAGC TAGCTGGTGA TGAAGTAAAG 4440 

TCCTTGACTA TCAAGCGTTT TCGACGTCTC TTCCTTCATA TTGCCGGAAA ATATGTCTCT 4500 

ACTGCTAGAC GACATATTCT CAAATTCTCA AGTCTATACG CCTATTCAAA ACAGTTTCAA 4560 

GCCTTATTTG ATACAATCTG CCAGATAAAT CTGATACTCC CTGTTCCATA TAGAGCTAGA 4620 

GGGCAGGGGA AAACATGCCT AACAGAATAA GTCACCTTAT TTTAAAAATC GAGCATCAAA 4680 

CCAAGGGAGG AGTCTGCCCT TTTTTAGGAA AAAATCAAGA CAAATCTCCT CAATTATCTC 4740 

TCGAACATCA GAAATTAAGC AAAATCACCA GAAGGACAGT ATTTCAACTA GCTTTTCTGG 4800 

TAATTTTTGA ACTGTGTAGT TCGTTAGTGC CAGATATGAA TAATTTGGGA TGATAAATCT 4860 

TTCTTCCTCA GGTAGCCTAT CATAATACTC TTCAAAAATC TTATCAAAAA CACTCTCTTT 4920 

CTTTTGGGCG ATAGTTTCAT CTTCGTATGT AGGAGTCCTC ATCAAGAAAT ACTTCAATTC 4980 

TAGGTATTCC TTATCCAACT CTATATAACT TGGCATCAAC TTGTAATCTT CAACCCCCAA 5040 

ACGTTCAGCA ATATATTTTA ACTTTGTTAG TATTGGTCTG GATTCTCCAT TTTCAATTCT 5100 

AATTAATTGA CCGATACTTA ATTCAGACTC ATCACCACAA AATTCTGAAC GACTGATTI P 5160 

TTTAGCCAAA CGTAATCTTT TAATTTTTTC GCCAAACTCT CGCAACCTAC AAGAACTTCC 5220 

TGAGTTGTTT ACCTCTATTA TAAGCATATA CTGAATCAAA CTATCTATCA GATTTCTTCT 5280 

CACTTTAACT AAAGACTAAG AGTTTATCCC TTCGTCTCGG TTTTTGTGTA TTTTTCCACC 5340 

ATACCCCAGT AATGCAAGTG CAAAATCCCC TAGAATATGA TAGAATAAGA GAAAGAACTC 5400 

TATCAAGGAG GAAATCATGG AAAAACAAAC CGTCGCCGTC TTGGGGCCTG GTTCTTGGGG 5460 

AACCGCCCTT TCACAAGTCT TAAATGACAA TGGACACGAG GTACGTATTT GGGGAAATCT 5520 

TCCCGAGCAA ATCAATGAAA TTAATACACA CCATACTAAT AAGCACTACT TTAAAGATGT 5580 
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CCTTCTAGAC GAAAATATCA TTGCCTACAC CGACTTAGCA GAAACATTGA AACATGTGGA 5640 

TGCGATTTTG TTTGTTGTCC CAACAAAAGT GACACCACTT GTTGCCCACC AAGTTGCACA 5700 

AACCTTGGAC CATAAGGTTA TCATCATGCA CGCATCAAAG GGATTAGAAC CTGATAGCCA 5760 

TAAACGATTA TCAACCATTC TTGAAGAAGA AATTCCTGAA CATCTCCGTA GTGATATCGT 5820 

CGTTGTTTCA GGGCCTAGTC ATGCAGAAGA CACCATTGTG CGTGACCTAA CTTTAATAAC 5880 

TGCTGCTTCT AAAGATTTAC AAACAGCTCA ATACCTTCAG AAGCTATTTA GTAATCACTA 5940 

CTTCCGACTT TATACCAATA CGGATGTTAT CGGGGTTGAA ACTGCTGGTG CTCTTAAAAA 6000 

TATTATTGCP GTCGCTGCTG GAGCTTTACA TGGTCTTGGA TTTGGTGATA ATGCTAAGGC 6060 

AGCCATCATC GCTCGAGGTT TAGCAGAAAT CACCCGCCTA GGGGTAGCAC TCGGGGCCAG 6120 

TCCATTGACC TATAGCGGCT TATCTGGTGT GGGAGATTTG ATCGTAACGG GAACTTCCAT 6180 

CCACTCTCCT AACTGGAGAG CTGGAGATGC TCTCGGACGA GGAGAATCCC TACCTGATAT 6240 

AGAAGCTAAT ATGGGCATGG TAATCGAAGG AATTTCAACG ACTCGACCAC CCTATGAACT 6300 

AGCCCAAGAA CTTGGAGTCT ATATGCCCAT TACACAGGCT ATTTACCAAG TTATTTATCA 6360 

CGGAACCAAT ATCAAAGATG CCATTTATGA CATCATGAAC AATGAATTTA AAGCAGAAAA 6420 

TGAGTGGTCT TAACCCTCTA TAGAAAGGAT TTTTATGACA TCAAAAGTTA GAAAGGCAGT 6480 

CATCCCTGCT GCTCGACTAG GAACTCGATT TTTACCAGCA ACCAAGGCCC TTGCCAAAGA 6540 

AATCTTCCCA ATCGTAGACA AACCAACTAT CCAGTTTATC GTGGAAGAAG CTCTCAAATC 6600 

AGGTATTGAA GATATTCTAG TTGTCACTGG TAAATCAAAA CGTTCTATTG AGGACCACTT 6660 

TGATTCAAAC TTCGAATTGG AATATAACCT CAAAGAAAAA GGGAAAACAG ATCTTTTGAA 6720 

GCTAGTTGAT AAAACAACTG ACATGCGTCT GCATTTTATC CGCCAAACTC ATCCACGCGG 6780 

TCTCGGAGAT GCTGTTTTGC AAGCCAAGGC TTTCGTCGGA AATGAACCTT TTGTCGTTAT 6840 

GCTTGGTGAT CACTTGATGG ATATCACAGA CGAAAAGGCT GTTCCACTTA CCAAACAACT 6900 

CATGGATGAC TACGAGCGTA CCCACGCGTC TACTATCGCT GTCATGCCAG TCCCTCATGA 6960 

CGAAGTATCT GCTTACGGGG TTATTGCTCC GCAAGGCGAA GGAAAAGATG GTCTTTACAG 7020 

TGTPGAAACC TTTGTTGAAA AACCACCTCC AGAGCACGCT CCTAGCGACC TTGCTATTAT 7080 

CCGACGCTAC CTCCTCACGC CTGAAATTTT TGAGATTCTC GAAAAGCAAG CTCCAGGTGC 7140 

AGGAAATGAA ATTCAGCTGA CAGATGCAAT CGACACCCTC AATAAAACAC AACGTGTATT 7200 

TGCTCGTGAG TTCAAACGGG CTCGTTACGA TGTCGGAGAC AAGTTTGCCT TCATGAAAAC 7260 

ATCCATCGAC TACGCCCTCA AACACCCACA AGTCAAAGAT GATTTGAAGA ATTACCTCAT 7320 
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CCAACTTGGA AAAGAATTGA CTGAGAAGGA ATAACAAAAT CATTTATATA AAGATTAGCC 7380 
ACACATAAAT TAACTAAATT CTCTACTTGA ATCTACCTAT TTAATAAAAA CTAATGAAAA 7440 
CCCTATACTT GTATTTGTTT TTTCATTAAA ATAAGAGTAG AATAAATTAG TATAGTAAAA 7500 
CAAAAAAGCA CCGAATCGGT GCGCACTTTT TCAAGTTGTG TACGCACAAA GCCTTATTTT 7560 
AACTTTGCTA TGTTGTTTCT AATGGTTCCA AAATAATAAA TATTTTTTAAA TTTGACTTAA 7620 
CTGTTGGAGT AGTCATGGTT AAATTAAATC AACCGAGCCG AACATAAGTT GTTTAATTTT 7680 
GTGGAAGCTA TTAATAAAAA TATAATAAGG GAGAAAGATA GCTGTAATTT TAATTTTAAA 7740 
GTAATTGCGG ACACTATCAA AGAAAAAGAT TATGGAGAAC AAATTTGTAG AATTTATCGA 7800 

AAACAATAAA AAAGTAATCA TTTCATCAGT TGCAGTTGGT GTTGTATTGG TATTAGGGTT 7860 

TGGATGGTAT TCATATAACC AACAACAAGC AGAACAACAA GCAAAAATTG TACAATTAGA 7920 

AAAAGATAGC AAATCAGACA AAGAACAAGT TGATAAACTA TTTGAATCAT TTGATGCATC 7980 

TTCAGATGAA TCTATTTCTA AATTAAAAGA ACTATCTGAA ACTTCACTTA AAACCGATGC 8040 

AGGTAAAGAC TATCTTAATA ACAAAGTCAA AGAATCATCT AAAGCAATTG TAGATTTTCA 8100 

TTTGCAAAAA GGTTTGGCTT ATGATGTTAA AGATTCAGAT GACAAATTTA AAGATAAAGC 8160 

AACTCTTGAA ACAAATCTAA AAGAAATTAC AAAACAAATT GATTTTATCA AAAAAGTTGA 8220 

TGAAACTTTT AAACAAGAGA ATTTGGAAGA AACTCTTAAA TCTCTAAATC ATCTTGTTGA 8280 

TAAATATCAA AAACAAATCG AACTTTTGAA GAAAGAAGAA GAAAAAGCTG CTGAAAAAGC 8340 

TGCTGAAAAA GCAAAGGAAT CTTCTAGTCA AACTAATTCT TCTGGTAGTG CTTCTAATGA 8400 

GTCTTATAAT GGATCTTCCA ATTCAAATGT AGATTATAGT TCATCTGAAC AAACTAATGG 8460 

ATATTCAAAT AATTATGGCG GTCAAGATTA TTCTGGTTCA GGAGATAGTT CAACAAATGG 8520 

TGGATCATCA GAACAATATT CATCTAGCAA TTCAAACAGC GGAGCAAATA ATGTCTACAG 8580 

ATATAAAGGC ACTGGTGCTG ACGGCTATCA AAGATACTAC TACAAAGATC ATAATAATGG 8640 

AGATGTGTAT GATGACGATG GAAATTACCT TGGGAACTTT GGTGGCGGCA TTGCAGAACC 8700 

TAGTCAACGC TAATAACTAT TTTAGAGCTG TGTTGTTTCG AATGGTTCCA AAACACATTA 8760 

AAAGCTACTC ATTTTTTAAG TAGCTTTTTT CTTATTCAAG TTTACATATT ATACTCAATG 8820 

AAAATCAAAT TCAAACCACG TCAGCATCGC CTTACCGTAG GTATCGTTAC TGACTTCGTC 8880 

AGTTTCATCT ACAACCTCAA AACCATGTTT TGAGCTGACT TCGTCAGTTC TATCTACAAC 8940 

CTCAAAGCAG TGCTTTGAGC AACCTGCGGC TAGCTTCCTA GTTTGCTCTT TGATTTTCAT 9000 

TGAGTATTAG TCGTCACAAT CCCATTCCCT TGTAGAAAAG CAAAATGGCG AGTCCTACGA 9060 

ACAAGACTAC CGCTCCTAAT CTCTGGCTGG TGTTATACAT CCGTTTTTCT CCTCTAACTG 9120 
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GAAAGATAAC 
TTCCTGGAAT 
CTAGCTGTTG 
GCCCATAAAG 
AGAGATTGCC 
TCTCCTCTAC 
ACCCAATATG 
TAGCTGGCCC 
CTGCAGTGAC 
CAGGGTAACG 
TGGTTTTCAG 
GAAAAATGTT 
GTCGTAAAAG 
AAATCTCCCT 
ACCACAAAGT 
TTCAGCGCCT 
GCGATGACCT 
TCTATAGCCT 
AATTCCGATT 
CCGCAGCCAC 
CGTAGGGACT 
CAAAGTCCTC 
CAAAGAAGTC 
CTTGTTTGAG 
TTTTTACCCT 
GTAAAAAAGC 
TCACCACTGC 
CAAAGTAACT 
CTTTGCCTTT 



TGCTAGAAAT GCGCCACCAA CTGCACCACC GATATGGCCT GCTAGGCTGA 



CAGAACACTT CCAATAATGT 



GATATAAGGA TTGCGAGTTG 
AGAGGTAGAG GCGCCTGCTG 
CATCATTCCT GATAAAAGAT 
CTCCCTTCCA AGATAATAAA 
AACAAAAATG GCAGACAAGA 
ATACATGGCT CCAAATCGAA 
CAACATTAGT AAAAATACCA 



TCTATCAAAG 
GTATAAAGTC 
CCAGATAGCG 
CCAGACCAGG 
GTAGCTCCAG 
CCATGCGCCC 
GCTCAATCAG 



ATTTCCTTCA 
CTGAATTTGA 
GTCATAATAG 
AACATGAATC 
TAAGGCAAAG 
CTTGGGATAA 
TTCCTGCGTT 



TGGCTTCTTG ATAAAAGGGG 
GTTTTTGCTC TTGAGATATA 
TCATAGACAA GCCCTCTATT 
CCCAATAGCT AAGACTTCTT 
ATCGATACCT AGCCAAAACA 
GCCTGTCATA GCAGGTTCGA 
CATCAGTTCA CGCGCCAAGG 
TTCCACTTCG ACTTCCATAT 
CTTTTGCACC AAGAGACTCA 
TGTGTCTGTG ATGACATTGT 



TCCCTCGATT GGGTTGACAT 
AGCCGCCACC AAGGCGTCAT 
GAAACGGATC TTCACCTCGC 



TAACCACAAA 
CATACCGAAG 
CTAAGGATTT 
AGAGAAAGAA 
GTGAAAGCAT 
GACCCCAAAC 
ATAATGTATC 
AGGCCGTCAC 
TCAATTAATA 
CAAGGATATA 
CCTCCACCGT 
AAATCAATCT 
AAAGTTTTTA 
GTTTTGGGTA 
TGAAACTCAT 
TGTTGTAAAA 
GCCTTCATTT 
CTGCTCCCTT 
CCTTAGGACT 
TCACGCCATC 
TATCAATCAA 
CTGGATTGTT 
CAAAGGCAGC 
TGTCCTGTGT 
TGGTGGTTCC 
TGCGGCTAAC 
TGGCTTCATG 
AAGTTCCTGC 



AAGTGTCAGA TAGGATTGCC 



AACAATAATC 
AGGACTAAAT 
AAACTGCTTA 
ATTAACAATG 
CTGCTCGGGA 
TGCCCTGTCA 
TAAGAGGAAG 
CCTCCTGAAC 
TCGTACTCAA 
ATCCTATCCG 
GAGATGCATC 
CCAACTGTTG 
TTAAAACCTT 
GAGAAAAAGA 
GCCGCTCGGT 
CATGCAAGAC 
CTTTTTCAGG 
CATTTGAGGG 
AACCTTTGAA 



GCGGCAAATA 
ACAAAAACAA 
GAACCGAAAA 
AAATGTTCCC 
AAGAGGCGAA 
AAGTTTCCGC 
AAACTCGTCA 
AGGAATATCA 
ACTACGACCA 
ATATCCTTTC 
CACCACTTCC 
CGGATCATAG 
CTTGCCGTCC 
GAGGTAGGTT 
TAAACCTTGG 
TTGCTTGCGT 
AAACTAGACA 
TGATGAAGAG 
AGGAGATAAC 



CTCGATTCCG 
CTCAACAGGT 
TGCAACCCCT 
CAAGGCACGA 
AGCTTGAAAA 
AACTGACTCC 
AGGAAAAGCT 
AAAGAGTGTA 



TCTTTTTCGT 

AGGTATCCAC 

TCTGCAACTG 

ATAGTTCCAT 

ACGCCGAAGG 

ACTTGGGTCA 

GCGTGGCCAC 

TGAGTATTAG 



9180 
9240 
9300 
9360 
9420 
9480 
9S40 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
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TCGCAATCTG GCCGACTTTC AAATCTGGAC GAACATGGAG ACCATAGAAT TGATCTGGCA 
ACCAATCTCC AAAAGCACCG TCCTCATACA TGAGCATACC ACCAGCTTCA TTTTCTTCAG 



CAGGCTGAAA 
AGCCTAAGGC 
GAGAAGCAAA 
AACCAATGGT 
TACGAATTTG 
CCTGAGTCTT 



TAGAAAGAGC 
AATGGTCATA 
AGGTAGACCT 

TCGCTCCGGC TGACTTCCCT GCAGGTAGAC CAAAATCCCT 
AACAAAATCC 
GAACTCCTCC 



AGATTATTCT TGGGTTGCTC CTCAAGGGCG 
TGAAAATCAT GGACACAGCC ATGCATGCGA 
CGATAGGCAG GCCATCAATA 



TTGCCCGTAG TCAATTTCTC AATCACATCC 



AAGCCAATCT CTGGAATCTG GTGTAAATCT 



CGCTCAAGAC 
CCTTGGTGTT 
TCTGTCCGCC 
GTCCGCCAAG 
AGCAAATAAG 
CCTCTAGTCT 



GAATCAAATC TAACATCTAT CTGTCCTCCG ATATAGCAGA AAGAGGCTGG AAAAAGGGTT 
CCGCCTCTTT TTTACTTTTA CAATTACAAG GTACGAAGCG CATCCTCTAG CGCTGTTTTT 
TGTTGAGTTT GGGCATCAAT TTCTTTGATA ATACGAGCTG GAACACCTGC TACTACCACG 
TTTTCTGGGA CATCTTGGGT AACAATACCT CCTGCTGCGA CAACTGAACC ACTACCGATT 
TGGACTCCTT CGATAACCAC TGCATTAGCA CCGATAAGAA CATTGTCTCC GACACGGACT 
GGTTCAGCAC TAGCTGGCTC AATCACACCT GCCAAAACTG CACCTGCACC AACGTGGCTA 
TTTTTTCCAA CGATGGCACG GCCACCAAGG ATGGCACCCA TGTCAATCAT GGTTCCAGCA 
CCGATTTCAG CACCGATATT GATAACAGAT CCCATCATGA TAACAGCATT GTCACCAATT 
TCCACCTGGT CACGGATAAT CGCACCTGGC TCGATACGAG CGTTGATAGC ACGCTTATCT 
AGCAAAGGAA CTGCAGAATT ACGAGCATCT TGCTCGACAA CATAATCTTG ATTTTCTACC 
AAACCTTCAA GAAGCGGAGC CACATCCTTC CAGTCTCCGA ATAGGACATT TCCTAGTTTG 
ACAACAGAGC TAGGCACAGC AGTTGCGAGT TGCCCCTCAA AGGTTACTTT GACACTGGTT 
TTCTTTTCAG CATTGGCGAT AAATTGGATA ATTTCTTGAG CGTTCATTTT TGTAGCAGTC 
ATAGGTG 

(2) INFORMATION FOR SEQ ID NO: 149: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12566 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDE0NE5S : double 
(D> TOPOLOGY: linear 



10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11860 

11940 

12000 

12060 

12120 

12127 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 149: 
CCATCCTTCT GTTGATGTGA CAGGAATGAT GATAAATCAA CCAGTAGCTA GTCCCGAAGA 
GGTGACAGAG GCTTTGAGTC ACTTGGCGGT AGAGCACAAT AGTCTCATTG CTCGTCGAAT 



60 
120 
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CGTTGAGCCA 

GCTTCCAGAA 

GTCTTACTTC 

GCTTGGTTAT 

GACGGCCACC 

GACCCTGATT 

GAGCTTGTTT 

AGTGCTGGTA 



AATGAAGCTG 

GGTCTGACCA 

ATTGTATCAG 

CAAGGCTTTG 

CCTATGGTGC 

TATCGGATCA 

GGAGTTGCTC 

TCCAGTCTTT 



999 

GAGAAACACG CTTTACCTAT GCCACTTATG 
TTTCCTCCAA GGAGAGTGCA GAAACGAGTG 



GAAGTTTGGA TGGAGTGAGC TTACAGACCA 



GGCAACGGTG 
TTCTACCTTA 
ATTGAAAGGG 
AGCTGTATTG 
GGAAATGGAG 
TGGTTGGTCT 
GACATTTACT 
CAATTTCTCA 
GTCAGGGAAT 
TTCAGAGTTT 
TGAGAGCTTG 
CTTTTCATCT 
AGCTTTTACA 
CCAGTACCTA 
TGTTGCAGGG 
AGACAGCATC 
AAGCCAGCTA 
TATTGGGACG 
CTATTTTGAG 
TGAGCTTCAT 
ATCTAGTATT 



CAACTGGTCA 
CTAAGTGTCG 
AAACTCCCTC 
GTGGTCGGAT 
AGAGCTAGCA 
AGTGCATTTG 
GAAGAACGGT 
GATGGAGCAG 
GTTATCTATG 
ATGGACAAGA 
CGAGAGCAGT 
GAAAGTGTAG 
GAAACAGGAC 
AAAGATCCGA 
ATGTTGTGGG 
ACAGCTCTAA 
TTTTTTGCCA 
ATTTTGACCC 
CAGTTCAGAC 
GGCAAGTATT 
TTGACAAGAG 



TTTCGAATGG 

TACTGAGTTT 

AATCCCTTCG 

TCAGACCAGT 

TGGGATTGGG 

TCATTGCTCT 

TCTATCTACT 

TCAAACGTAT 



AGAAGATCCA TTTTCGATAG 



AGCTATTTTT 

TCAGGCAGGG 

GTTAGAAGAT 

GATTCTCTGG 

TCTACTTTAT 

TGGTTTACAG 

GATGACATTG 



CTGCTGACCT 
ATTCGCTTAA 
GTGAGACAGC 
TATCAAGCTG 
GGATTGACCT 
GAAAATAGTC 
ATGATCGTGG 



CGAGTGCGAC AGCTCTCCTA CCCCACTACC 
ATAAATGGAG CCAGTCCTCA GACCGTTACC 
CCGATGAAGA ACGAACGCGT AAGGATAATC 
TAGCCAATAC AGACTCTTTT TATATTATGA 



AAGTGGACCT 
TCTCACCGCG 
TGCAAAACTT 
CTGTCTACTA 
AAGTGACGAG 
AGGAACGTTT 
TTATTGTAGT 
GAACTACGGC 
AAGAGAAAGG 
AGGTACTAAA 
TGTCTACGGC 
GGGAACTTAT 
TACTGGCGCA 
ATGGTTTGAT 



AGATGGCAAT 
CTATCTGATA 
GTCTGAGGGA 
CCAAGGATTG 
TCAGAAACAC 
CCTCTATAAT 
TCTAACGCCG 
TAATAGTGCC 
TCTGTATCAC 
TGACAAACGG 
TATCTTGTTA 
GATTAAACGT 
AGGAGGAGTT 
TAGCGCTCTA 



CGTCTCAGTG 
GAAGAAAAGA 
GAGTTTGGGC 
TTTACAGATT 
TACCTCCCAC 
GATGGGTACA 
CAAGCGACTG 
TTGAAACTAG 
AAGGTTTCTT 
GTGGAGTTTT 
TTTGATTCCA 
CTTGCTGGTA 
CTCTTGCTTG 
GTTGTAGCTT 



GTGAGGGAAA 
ATTTATTAGC 
CCTTGAAAGA 
TCTTACTATT 
TTATGAGTCT 
TAGCTGGTGA 
TTATCTGCTC 
CCTTGTTTAT 
TGGCAGGGAT 
TGGTGGATCT 
GGCAACTCTT 
GTGAAATGCA 
GTCTATCCTT 
GTGAGTGGCA 
GCAATGTTGA 
ACTACACACC 
TTACCGTTTC 
TGATCTTGCC 
ACCTGCAAAA 
AGGTAAGGCT 
AGACAACACG 
GAACAAGACC 
ATCGATATGG 
ACTTGGTAAA 
ACTCTCTCCT 
TGAATCTTCT 
TGACAATCTA 
GCCTAGTCCT 
TGTTTACGCT 



ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
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1000 

TAACGCCCTC TTGATTTTAG TAAGGCAGGA CAAAAAAGAA GAAGCTCGTA GCATGGCAGT 1920 
ATTGAAAGGA AAATAAGATG ATTGATATTC AAGGATTGGA AAAGAAATTT AATGACCGCG 1980 
CGATTTTCTC TGGTTTGAAT CTCAAGCTGG AGAAGGGCAA GGTTTATGCC TTAATCGGAA 2040 

AGAGTGGAAG CGGAAAGACG ACGCTGCTGA ATATCTTGGG AAAGCTAGAA AAGATAGATG 2100 

GTGGAAGGGT TCTCTATCAG GGGAAAGATT TAAAAACCAT TCCCACTCGT GAGTATTTTC 2160 

GAGACCAGAT GGGCTATCTC TTTCAAAATT TCGGCCTCTT AGAAAACCAA TCAATCAAAG 2220 

AAAATTTGGA TTTGGGTTTT GTTGGTCAGA AAATCTCAAA AGTAGAACGT TTGGAAAGGC 2280 

AAGTGGGGGC TTTAGAAAAA GTTAATCTAG GGTATTTGGA TTTAGAACAA AAAATCTATA 2340 

CTTTATCTGG GGGAGAGGCC CAACGAGTTG CCCTTGCTAA GACTATTTTG AAAAATCCAC 2400 

CCTTGATTTT GGCAGATGAA CCAACAGCAG CTCTTGATCC TGAAAATTCA GAGGAGGTTA 2 4 SO 

TGAATCTCTT GGTGGATTTG AAAGATGAAA ATCGAATTAT CATCATTGCG ACCCATAATC 2520 

CCCTAGTCTG GAATAAGGCT GATGAAATCA TTGATATGAG GAAACTTGCT CATGTGTGAA 2580 

AAAATCCGTA TTCGCAGGGT ATCTGATTAT CCTAGTGCCA GAGGTGGTTT AGAAGATATC 2 640 

CTCATCATGG AAAATATGAC CAATCATCTC CTTTTGGTTC AAATCCGAGT GCATGGCTAT 2700 

TTGCTTGATT TTGCTAGTAT TGAAGGGCAA AGGCAAAAGC ATTATCGTTT GAAAAATTTA 2760 

CCTCAGACGG TTGAACTGAC AGTGGATGAT GTGGAGGAGG ATGTGGATTT GACCCTACCT 2820 

GAAAATCGAA GTTATCAAGA AGCTGATTTT TTTGAACGCA TGTTTCGAGA GAACTGCTAA 2880 

GGCCACTTTT AAAGATTTCC AAGACTATCT TTCTTCATGA GGAAAGATAG TTTTTTGGTA 2940 

TGATTTTCAT TCCCAAAATA CAAGGGGAAT GTGTTACAAT AGTAGTAACA GATAATAGAA 3000 

AAGAGAATAG ATGAGAATTG CAGATTATAG CGTGACCAAG GCAGTGCTGG AGCGTCACGG 3060 

TTTTACCTTT AAAAAGTCCT TTGGGCAAAA TTTTTTGACG GATACCAATA TCCTTCAAAA 3120 

AATTGTGGAT ACGGCTGAAA TTGATGATCA GGTCAATGTC ATCGAAATCG GGCCAGGTAT 3180 

TGGTGCCTTG ACAGAATTTT TGGCTGAGCG TGCAGCCCAA GTCATGGCTT TTGAGATTGA 3240 

CCACCGTTTG GTGCCAATTT TGGCAGATAC CCTGCGTGAT TTTGATAATG TGACCGTAGT 3300 

TAACGAAGAT ATTCTCAAGG TTGATTTGGC GCAACATATC CAGAATTTTA AAAATCCTGA 3360 

CCTGCCAATC AAGGTAGTGG CTAATTTGCC TTACTACATC ACGACGCCTA TTCTCATGCA 3420 

CTTGATTGAG AGTGGCATTC CTTTTTGTGA GTTTGTGGTC ATGATGCAGA AAGAAGTAGC 3480 

GGACCGCATT TCAGCCCAGC CTAACACCAA GGCTTACGGT AGCTTGTCTA TCGCCGTGCA 3540 

GTATTACATG ACAGCCAAGG TTGCCTTTAT CGTGCCTCGT ACGGTCTTTG TGCCAGCGCC 3600 

AAATGTGGAT TCAGCCATCT TGAAAATGGT GCGTCGTCCA GAGCCAGCCG TAGCAGTAGA 3660 
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1001 



AGATGAGAAC 
GTCGAATAAC 
GGCTTTGGAC 
ATTTGCCGGT 
TTAAAGCCTT 
CGCGTGGGAA 
CTGCCGAGGA 
TTCGTCCGCC 
ATTTTAACAG 
CCATTGTCTA 
AGCAGACCTA 
TGTTAACAGG 
TCAATAAAAT 
GCGGTCGCCA 
ATACACCAGG 
CTTTCCCAGA 
ATGAGCCGTC 
TTGACAATTA 
TCAGCAAAAA 
ATTCTGGCAG 
GCAGAATATG 
GCAGGTGTGG 
GTGTCAAACC 
ATCCATGTAG 
GTTAAGCCTT 
CATCTAGTTG 
CTGCCAGAAA 
AACTTTGAAA 
GCCGGTGCGA 



TTTTTCTTTA 
TTGACAGGTT 
CAGGCAGCCT 
CTAGCAGACG 
GGCAGGTTTC 
TTTCCGTAAA 
AAATTCAGAA 
TATTGTCAAT 
CAATTTGCTG 
TATTTCCAAA 
TGCTGACATC 
CAAGGTTACG 
CGCACCAGAC 
TACCACTCGA 
ATTTTCATCC 
GATTGCTACT 
TTGTGCCGTC 
CCTGCAATTC 
AATTCCAAAA 
CAGATTATGC 
CCCATATCGA 
TCGAGAGCCT 
CTGAGCATCA 
AAGCAACGCC 
CAGTCGTTAT 
ACCAAGTTTT 
CCATGGATAA 
TCGAAGTGGA 
CTGTTTTTGT 



AGGTTTCCAA 
ACTTTGGTAA 
TGTCACCAAG 
CACTTAAAGG 
TACTATGTGG 
AAAGGCCATA 
GGCTATATCC 
ATCGATCAAG 
GATCGTTTCT 
ATGGATTTGT 
GGCTATGACT 
GTCTTTATGG 
CTCAATCTTG 
GCTGTTAGTT 
TTGGACTATG 
CTTAGCCGAG 
AAACCAGCTG 
CTTAGTGAAA 
TAAGGAGAAA 
CAACTTTGAA 
TATCATGGAC 
TCGTCCTCAT 
TCTGGAAGAT 
TCATATTCAT 
CAATCCTGGC 
AGTCATGACG 
GGTCCGTGAG 
TGGTGGCATT 
AGCAGGTTCC 



GGCTAGTTTT 
GACTGAAGAG 
TGTGCGTGGG 
GCAAGGACTC 
AGAGTGATGG 
CCCCTTATGT 
TCAAAATTCA 
CTGTAGTAAT 
TGGTTCTTTT 
TGGAAGATAG 
TTGTGACCAG 
GGCAGACAGG 
AAACGGGAGA 
TTTACAATCT 
AAGTATCAAG 
ATTGTAACTT 
TTGAAGAGGG 
TTGAAAATCG 
CCTATGTCTC 
CGTGAAATCA 
AGTCATTTTG 
AGTAAGATGG 
TTTGCGCGTG 
CGCGCCCTCC 
ACATCAGTTG 
GTTAATCCAG 
TTGGTTGCTC 
GATGACCAAA 
TATGTCTTTA 



ACCCATCGCC 
GTCAAGGACA 
GAAGCTCTCA 
TAAGATGCAG 
CCAGGTTTAT 
TGGGCACTGG 
CGAACGGAAA 
CATGTCCGTC 
GGAGCACAAG 
GGGAGAACTG 
TAAAGAGGAA 
TGTTGGGAAG 
AATTTCAGAC 
CAACGGGGGT 
GGCTGAAGAC 
CCGTACTTGT 
TGTTATTGCA 
TAGAGAAACC 
AATACAAGAT 
AACGTCTAGA 
TACCGCAAAT 
TTTTCGATTG 
CAGGTGCAGA 
AAAAAATTCG 
AAGCCATCAA 
GTTTTGGTGG 
TTCGTGAGGA 
CTATTGCTCA 
AGGGAGAAGT 



GCAAGACCTT 
AGCTGACCAA 
GCTTGGCAGA 
GGACAAATCA 
CAAACACGCG 
GTAGATTTCT 
AACAGTCTGG 
AAGGAACCTG 
GGCATCCATC 
GATTTTTACC 
CTCCTGTCTT 
TCAACTCTTC 
AGTCTAGGTC 
AAAATCGCAG 
CTCAATCAGG 
ACCCATACCC 
ACCTTCCGTT 
TATAAAAAAG 
TGCTCCGTCA 
AGCAACTGGG 
CAGTTTTGGT 
CCACTTGATG 
CATCATCAGT 
TTCACTCGGA 
GCACGTCCTT 
GCAAGCCTTT 
AAAAGGTTTG 
AGCCAAAGAA 
CAATGAGCGA 
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GTACAAACTC 


TCAGAAAACA 


ACTGGACTAG 


GGTTGCACTT 


TTTGCAGGCG 


GAAACCGCGG 


5460 


TCATTATCGG 


ACAGATTTTG 


ATGCTTTTGT 


TGGGGTGGAT 


CGAGGCTCGC 


TCTGGGTCTT 


5520 


GGAAGAAGAC 


TTACCTCTTG 


CTCTAGCAGT 


CGGAGATTTT 


GATTCTGTGA 


CGGAAGAAGA 


5580 


GCGACAGGTG 


ATTCAAAAAG 


GTGCCCAGTA 


TTTTGTCCAA 


GCACGACCAG 


AAAAGGATGA 


5640 


TACAGATCTG 


GAATTGGCTC 


TCTTAACCAT 


CTTTGAACAA 


AATCCTCAGG 


CTCAGGTCAC 


5700 


TATTTTCGGT 


GCCTTGGGTG 


GCCGTATTGA CCATATGTTG 


GCCAATGTCT 


TTCTGCCTAG 


5760 


CAATCCTAAG 


TTGGCACCCT 


ATATGCATCA 


AATAGAAATT 


GAGGATGGGC 


AAAACTTGAT 


5820 


TACTTATTGT 


CCAGAAGGAA 


TCAGTCAGCT 


AGAACCTCGT 


TCAGACTACG 


ACTATCTAGC 


5880 


CTTTATGCCA 


GTTCGGGATA 


GCCAGCTGAC 


TATTCTTGGA 


GCCAAGTATG 


AGTTGACAGA 


5940 


GGAAAATTTT 


TTCTTTAAAA 


AAGTGTACGC 


TTCTAACGAA 


TATATAGATA 


GGGAAGTGTC 


6000 


GGTAACTTGC 


CCAGATGGTT 


ATGTGGTCGT 


ACTGCATAGC 


AAGGACAGGA 


GGTAGGATGG 


6060 


AAAGTTTACT 


TATTCTATTA 


TTAATTGCCA ATCTAGCTGG 


TCTCTTTCTG 


ATTTGGCAAA 


6120 


GGCAGGATAG 


GCAGGAGAAA 


CACTTAAGTA 


AGAGCTTGGA 


GGATCAGGCA 


GATCATTTGT 


6180 


CAGACCAGTT 


GGATTACCGC 


TTTGACCAAG 


CCAGACAAGC 


CAGCCAGTTA 


GACCAAAAAG 


6240 


ATTTGGAAGT 


GGTTGTCAGC 


GACCGTTTGC 


AAGAAGTGCG 


GATTGAATTG 


CACCAAGGTC 


6300 


TGACCCAAGT 


CCGTCAAGAA 


ATGACAGATA 


ATCTCCTCCA 


AACTAGAGAC 


AAGACAGACC 


6360 


AACGTCTCCA 


AGCCTTGCAG 


GAATCAAATG 


AGCAACGTTT 


GGAACAAATG 


CGCCAGACGG 


6420 


TCGAGGAAAA 


ACTAGAAAAG 


ACCTTGCAGA 


CACGCTTACA 


GGCTTCCTTT 


GAGACAGTTT 


6480 


CTAAACAACT 


GGAGTCTGTC 


AATCGTGGCC 


TTGGAGAAAT 


GCAGACAGTT 


GCCCGTGATG 


6540 


TCGGAGCTCT 


TAACAAGGTT 


CTCTCTGGAA 


CCAAGACGCG 


ACGGATTCTG 


GGAGAATTGC 


6600 


AACTGGGGCA 


AATTATTGAA 


GACATCATGA 


CACCTGCCCA 


GTACGAACGA 


GAATACGCAA 


6660 


CGGTTGAAAA 


CTCTAGTGAA 


CGAGTGGAGT 


ATGCCATCAA 


GTTACCCGGA 


CAAGGCGACC 


6720 


AAGAATACCT 


*>» a n * v» i \*w^n 


ATTGACTCTA 


AGTTTCCACT 


uQiCAuA i I AT 


TACCGCTTOG 


£. -i art 

O/80 


AAGAAGCCTA 


TGAGACAGGT 


GACAAGGATG 


AGATTGAACG 


CTGTCGTAAG 


TCACTCCTAG 


6840 


CAAGCGTCAA 


GCGCTTTGCT 


AGGGATATTA 


GGAACAAGTA 


CATAGCACCA 


CCTCGGACGA 


6900 


CCAATTTTGG 


AGTTTTGTTT 


GTTCCGACAG 


AAGGTCTCTA 


CTCAGAAATC 


GTCCGCAATC 


6960 


CGGTCTTCTT 


TGATGATTTG 


AGACGGGAAG 


AACAGATTAT 


TGTTGCAGGA 


CCAAGTACCC 


7020 


TATCAGCCCT 


TCTTAACTCC 


CTATCAGTTG 


GTTTCAAGAC 


CCTTAATATC 


CAAAAGAGTG 


7080 


CCGACCATAT 


CAGCAAGACT 


CTTGCCAGTG 


TCAAGACCGA 


GTTTGGCAAG 


TTTGGTGGTA 


7140 


TTCTGGTCAA 


GGCACAAAAA 


CATCTCCAAC 


ATGCCTCTGG 


CAATATTGAT 


GAATTATTAA 


7200 
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ACCGTCGTAC 
CTGCGCTTGA 
AAAAAAGATG 
CGAGCTGGGA 
AAGCTCTGGG 
ATGAAAGGAC 
CTGCCTCAAG 
GTCAAGGAAA 
CAACGGATTG 
GCCAAGACCA 
CGTTTGGCAG 
GGGATTATGT 
TACACAGTGC 
ACAGTTATGG 
ATCCTCAGTC 
GCAGAGATTA 
CTTGCTTTGG 
TTCTATAAAC 
TGTTCGTTTT 
AAAATCTCTT 
AGGGTTTGTC 
TTCTAGTTTG 
GAAGAAGTGA 
CTACTCTCAA 
TOGTCATTAT 
GGGCTGGCGG 
TTGAAGACTT 
ATCTGTCTGA 
AAAGCTTTAT 



CATAGCTATC 
TCTACTCCAT 
AGTTATTTGA 
AAAACTACCT 
ATCCCCAACC 
GCCGAGAAGT 
CTGGTGAACC 
TTCGTGACTA 
TCCGAAATCT 
ACCACCATGC 
ACGCTATTAG 
TGCATGACTT 
GAGGTAATCT 
AACTCGGCAT 
ACCACGGCTT 
TCCATATGAT 
TGGATAAAGG 
CAGATTTAGA 
TTTATGTGAA 
CAAACTAGGG 
AGTTCTATTG 
CTTTTTGATT 
TCGGATGGTT 
TACTTTTGCT 
CAAACGCGCC 
AGGTGTCATC 
GCGTACCAAG 
TTTGCTTAGC 
GGACCAAAAA 



GAGCGGACGC TCCGTCACAT TGAGTTGTCA 
TTTCAAGAAA ATGAGCAAGA ATATGAAGAT 
AGGCTTTTAC CTAATCAAAT CAGCTGACCT 
AGCCTTTACC TTCCAAGATG ATAGTGGCGA 
TCATAACATT GAGGCCTTTA CCGCAGGTAA 
TTATAACAAT ACCCCTCAAG TCAATCAAAT 
CAATGACCCA GCTGATTTCA AGGTCAAGTC 
CATGTCGCAA ATGATTTTCA AAATTGAAAA 
CTACACCAAG TATGATAAGG AATTCTACTC 
CTTTGAAACG GGCTTGGCCT ATCATACGGC 
CGAAGTTTAT CCTCAGCTCA ATAAGACCCT 
AGCTAAGGTC ATCGAGTTGA CGGGGCCAGA 
TCTTGGACAT ATCGCTCTCA TTGATAGCGA 
CGATGATACC AAGGAAGAAG TCCTTTTGCT 
GCTTGAGTAT GGAAGCCCAG TCCGTCCACG 
TGACAATCTG GATGCAAGCA TGATGATGAT 
AGAGATGACC AATAAAATCT TCGCTATGGA 
TTAATAATTT AAGAAAAATG AGCATTTTTT 
TATGGTATAA TAAGTAAAAG ACAAAAATGA 
TAGTATCGCC TTGTCGTATG TATATATGCA 
ACAATCTCAA AACAGTGTTT TGAACCACCA 
TTTTGAATAA AAATGGAATA GGAAATAGAA 
GTCATTTCCA ACTATTTGAT TAATAATCCT 
GAAAAGTATG AGTCTGCTAA ATCATCCATC 
TTTGAGGAAA TTGAAATCCG TCATATCCAG 
TTCACACCGT CTATTTCGAG TCAGGATGCT 
TTGTCAGAAA GTGACCGTAT CTTGCCAGGT 
ACACCAGCCA TCTTGAAAAA TATTGGTCGT 
ATTGACGCGG TTATGACCGT AGCAACTAAG 



GAAGGTGAGC 
TACTCACATG 
CAGGCAAACT 
GATTGATGCG 
GGTTGTCCAC 
TACTCTCCGC 
ACCAGTTGAT 
TCCTGTCTGG 
CTATCCAGCT 
GACCATGGTG 
CCTCTATGCG 
CCAGACAGAG 
AATTACCAAG 
TCGTCATGTC 
CATTATGGAA 
CTCAACAGCT 
TAATCGTTCC 
AGGATAAGAA 
ATACTCTTCG 
GGTATATTAC 
GCGACCAGCT 
ATGAAATTAA 
TATAAACTAA 
TCAGAAGATA 
ACAGTGACTG 
AAGGAAATGG 
GGTTATATCT 
ATTATTGCCA 
GGTGTGCCAC 



7260 

7320 
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TTCCAAATGC 


AGTTGCCAAT 


GTCCTCAATG 


TCTCTTTTGT 


CATTGTGCGC 


CGTGACCTGA 


9000 


AAATTACCGA 


AGGTTCAACT 


GTTAGCGTCA 


ACTATGTTTC 


AGGTTCAAGT 


GGTGACCGTA 


9060 


TCGAGAAAAT 


GTTCCTTTCA 


AAACGTAGTC 


TTAAGGCAGG 


CAGCCGTGTC 


TTGATTGTGG 


9120 


ATGACTTCTT 


GAAAGGTGGC 


GGAACGGTCA ATGGTATGAT 


TAGTCTCTTG 


CGCGAGTTCG 


9180 


ACTCAGAACT 


GGCAGGTGTA 


GCGGTCTTTG 


CGGACAATGC 


CCAAGAAGAA 


CGTGAAAAGC 


9240 


AGTTTGACTA 


CAAGTCACTC 


TTGAAGGTAA 


CCAATATTGA 


TGTCAAGAAC 


CAAGCCATCG 


9300 


ATGTTGAGGT 


TGGCAATATC 


TTTGACGAAG 


ATAAATAAGA 


GATAGAACTA 


AAGGTTGGAA 


9360 


CGATTGTCCC 


AGCCTTTCTT 


TGCAAACAGA 


ATAGAAGGAA 


GCTTATGAAA 


ACACCATTTA 


9420 


TCAATAGAGA 


AGAGTTAGAA 


GCGATTGTTG 


CCGAGTTCCC 


GACTCCCTTT 


CACTTGTATG 


9480 


ATGAGAAGGG 


GATTCGTGAG 


AAGGCAAGAG 


CCGTCAACCA 


AGCTTTTTCG 


TGGAACAAGG 


9540 


GCTTTAAGGA 


ATATTTTGCA 


GTTAAGGCTA 


CTCCAACTCC 


AGCTATTTTG 


AAAATTCTCC 


9600 


AAGAAGAAGG 


nt,TGGTGTG 


GACTGCTCTA 


CTTATGTAGA 


GCTTTTGATG 


AGCCATAAAC 


9660 


TGGACTTTCT 


GGGTTCTGAG 


ATTATGTTCT 


CTTCCAACAA 


CACGCCAGAC 


AAGGAATACG 


9720 


CCTATGCACG 


TGAATTGGGT 


GCGACCATTA 


ACTTGGATGC 


CTTTGAAGAT 


ATTGAACATC 


9780 


TGGAGAGAGT 


AGCAGGCATT 


CCAGAAATCA 


TCTCTTGTCG 


TTATAATCCT 


GGAGGCGTTT 


9840 


TTGAACTGGG 


GACAGACATT 


ATGGACAATC 


CTGGGGAGGC 


TAAGTTTGGC 


ATGACCAAGG 


9900 


ACCAGCTCTT 


TGAAGCCTTT 


GCTATCTTGA 


AGGAAAAAGG 


AGCCAAGACT 


TTTGGGATTC 


9960 


ACTCCTTCCT 


AGCGTCCAAT 


ACCGTGACCC 


ATCTCTATTA 


TCCAGAGTTG 


GCTCGTCAGC 


10020 


TCTTTGAACT 


GGCTGTTGAA 


ATCAAGGAAA 


AGTTGGGCAT 


TTCGCTAGAC 


TTTATCAATC 


10080 


TTTCTGGCGG 


TATTGGTGTT 


AATTATCATC 


CAGACCAGGA 


GCCGAACGAT 


ATCGCCTTGA 


10140 


TTGGTGAGGG 


AGTTCGTAAG 


GTGTATGAAG 


AGGTTCTTAC 


GTCAGCAGGT 


CTTGGTCAGG 


10200 


TCAAGATTTT 


CACCGAATTG 


GGTCGTTTTA 


TGCTGGCACC 


TCACGGTGCT 


CTAGTCACAA 


10260 




TAAGAAGGAA 


ACCTACCGTA 


CCTATCTAGG 


TGTGGATGCC 


TCAGCAGTCA 


10320 


ACCTCATGCG 


TCCAGCTATG 


TACGGAGCTT 


ACCATCATAT 


TAGCAACGTG 


ACCCATCCAG 


10380 


ATGGACCAGC 


TGAAGTGGTA GATGTGGTCG 


GTTCACTCTG 


TGAAAACAAT 


GATAAATTTG 


10440 


CAGTTAATCG 


CGAACTGCCT 


CATACAGAAA 


TCCGTGATTT GCTGGTCATT 


CATGATACAG 


10500 


GTGCCCACGG 


ATTTTCAATG 


GGCTACCAGT 


ATAATGCCAA ATTACGTTCT 


GCGGAAATCC 


10560 


TCTATACCGA 


AGAAGGTAAA 


GCCCGTCAAA TCCGCCGTGC 


AGAGCGCCCT 


GAGGACTATT 


10620 


TTGCAACCTT 


ATATGGCTTC 


GATTTTGAAG 


AATAATCTGA 


TAATAGATTG 


AAAATGAAAT 


10680 


TGAAAAACAG 


ATTGCTTTCT 


AAAAAATAGG 


CAAAAATCTT 


GTTTTTCCTT 


CAAGTCGTGA 


10740 
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TATAATAAAA 

TTATGGACAA 

TGGTATTCGC 

CCCAGACACT 

GACAGAGGCC 

AGGAGAAGCT 

AGGAATCCAC 

TGAGAAGTTT 

CAACGAAGAA 

CCAGTATCTA 

GACAGACAGA 



CTATAAAACG 
GTGACAGGAA 
TTTATTGTCT 
TGGGCTATGG 
TTGCGCTACC 
CTCTTGCAGA 
TGTACCTTGG 
GACAAACTCA 
CAGCACAAGA 
TCAGATATTG 
GATGATGACT 



TGATAAGTTT GAAATTCTAC 
AATTCCATAT TCCCTCGAAG 
ACAACTCATG GATACCGAAA 



AAGCCTGATG 
TTTTTCTTCT 
TGTGATAGCA 
ACTTGCTTTT 
AACGATATAG 
AAGTGTGACA 
AATCATTTTC 



GAAACATCGG 
TATCTCGAAC 
GTTGGTTGTT 
GGTGGGTTCT 
TTGACGATAA 
CTAGCTGGTG 
TTGATTTGTT 



CGATTGAAGA 

AGGAATGCCT 

AGCAGAGAAG 

GATACCGTGC 

AGTAGTCGCT 

CTTTTCAGAG 

GCGTACGATA 

GAAGTACTTG 



TAGTAAAGGA 

AGGTAGCTTG 

AAAGGCATTT 

TCTTTTTGAG 

TCTTTGAGAC 

TGAAGCGTTG 

ATGGTTACGA 

ATGGCTTCAG 



TTTTCAAGGA 
TGGTGCATTC 
TTTTGCAGGG 
AGTCCAATAA 
GTGGTTTCTG 
TTGATTTCCT 
ACACCTGTGC 
TGGCTGTCAC 
TTGTCACTAG 
GAAAACCTGT 
TGATTGAACT 
CTTATCACAC 
GAGTCAAACC 
GTTATCAAGA 
GCTTTTGACT 
GTTGTTTTCC 
CAGGGGTAAC 
TGCCTAGTTC 
ACTGTTGGAG 
AGAAGAGGGA 
CTCTTTGCAT 
CACCAGCACA 
CTTGAGCAAC 
GAAGGAGGAT 
CAGCAAAGAG 
GCGTTTCGTG 
CCTTCCATGA 
TAATGATAGC 
CCATAGGCGC 



AGGTAACGAT 
GACAGAAAGC 
CTGTCACATG 
GTCACGTGAA 
GGGAAATAAG 
GATTGCTCTC 
TCTTCCTTTC 
TGACTTGGTT 
CCAAACCAAT 
CTGGATTCGC 
TGGTAAGTTC 
CATGGGTGAG 
ACCAACAGCA 
TTATATGAAA 
TGCAAAAAGA 
AGCGTTGCGA 
GTCTTTTCGT 
TTCACGGACT 
AATCATCATG 
GAAGACGACG 
TTCATCTTCT 
GGCAACCAAA 
CCCTTCAGTA 
AGGGAAACAT 
AGCTTGTTGG 
TGGCTCAAGG 
TTGGTAGATA 
GACACCAAAG 
TCCGATCGTA 



ATGTCTGAAG 
TTTGGGTCAG 
CGTTGCCAGT 
CGGACGGTAG 
GGTGGGATTA 
TTCACCAAGG 
CCTAATAAAC 
CTTTTGGATA 
AAAAATATCT 
CACGTGCTAG 
GTCAAGACCC 
TTCAAGTGGC 
GATCGCGTCA 
CGTGTACATG 
CTTAGCAAAT 
TTTTTGTGTT 
CCACTTGGTT 
TTTTTGCGAA 
AAACCACCGA 
ATCATGAGTG 
ACTCCGTGAA 
ATCATACTTG 
TGTTGGGCAG 
CCTACACCGC 
GCTTCGAGTT 
ACGTGCTTGA 
CCAACTGGTA 
CCTAGACCTT 
TTCCAAATAA 



AAACAATTGA 
TAGATGGGCC 
ATTGCCACAA 
ATGATGTCTT 
CAGTCAGTGG 
CTAAGGAACA 
CACGTTACCT 
TCAAGGAAAT 
TGGCTTGTGC 
TTCCAGGATT 
TCAAAAATGT 
GTGAACTTGG 
AGAACGCTAA 
GATAGAAAAG 
CAGCTAAGCC 
TTTTCTTGCT 
TAGAGAAAGC 
GTTTTGGACG 
CAACCCAGTA 
GGCTCATGTA 
GTGAAAGGAG 
GAGAACCTAG 
CAAAGTAGAT 
CAAACATGCT 
TTTCTTCTTG 
GGGCGTTCAT 
AGATAATCAA 
TATCAGTAGC 
ATCCTGTTGG 



10800 
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10920 
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11040 
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12240 

12300 

12360 

12420 

12480 
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CTGACCTGTG GTTTTATCGA CATTGACACA GCCAGTCAAG ACAAGCAACA TAGCCACTCC 12540 

CATAGCCGAG AGTGCAAAAT CGGGGT - 12566 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TGACACTCTG TAGGATTGTC GTTAATTGAT TGCTCGTACT CTCTACAATA ACCACCAAAC 60 

TAAAAACGAC ATAGAAAGAT AGCATCAGCT GTAGCCATAG CGCCTTTGAC ACCTTCTGGA 120 

TGATTATGAG TTACCTCTGC AGAAAGACTC GTAAGTCCTC TAGATGATGG CCATATACCA 180 

GTTTTCGCAT AAAAACCACA GTCCATGATC CAAGCACATG GAGAAATACG CATAGCTGAT 240 

CCATTCCCAA AG CT ATT AT A AGGCTCACGG TTATCGCTGT TTAGCCATGC ATTAAACCGA 300 

GCACCGTAAT CAGCATTCGG ATACATTCTG CCATATTTCT TCATCCCGTC AATGAAGTCA 360 

TCTTTTTCTC CACCATTCAT AATTGCTTCT GCAACACCAC AGCTCATAAC CGTGTCATCT 420 

GTAAAAAAGC AGTCCTTCCG AAATAAAGGA AAGTCCTTTG TTTTGATATT GTTCCATTCG 480 

TAAACAGAAC CGACAATATC TCCAATAATT GCTCCAAGCA TCAGATTCCT CCTTGTTCAT 540 

TTTGATGCTT TTTATATTGG TTATCTACCA TATTTATTTT AGAAAATAAC ATCCTGTTGG 600 

ATTTTAAAAA TTTCATTTTT TTCAAAATAG GGTTTTACCA TTTCTTTCCA CCTAGCTCTA 660 

TGAAAATTGA TTGATTTTAA AGGAGATAGG CCATAATTTC CCAATGCATA ACCATCATTT 720 

ACTTCAACAA CAAGTGTTCT GCCATCGCGA GTAACACCGA TATCTAGTCC ATAAGCTATT 780 

GGCGCATCTT TCCAACATGA TATCGCTTCA TCAATTACAC TTGCATCAAA TTGTGCATGA 840 

TAATCACCTG TATAGGGTCG AACATCTAAT ACGCGACCAT CTAACACAAA ACAACGCCAT 900 

TCAGCTATGA ATTCTACAAC CTCACTAATC CATATAGGAT AGTCGAAAGG TAGACCAATA 960 

CCTATTAAAT CATGGGTTCC ATTAACAACT CTTCCAGTAA AGACTTTTGA ACCAGCTTTA 1020 

GGCTTAATAA ATTTTCCCCA ATTATCAGGT ATATTCACAA TCTCTCCTAA AATACCAGCA 1080 

TAAATCTTTC GACCATAAAA CTCTTTAAGC TCAATAGGAT AGTCATGAAC CGGAACCTTT 1140 

AAGCCCATCA TTTTTAGTAA TGCTCTAGTC TCCATTATAT AATCTACAAC TATATCTTCA 1200 

CTTGTTAACT CTTTTATTTC AGAAAAAGAT TGATATAAAA TAACTTCTTC TCCTTGTAAG 1260 

TAGGCACCTA CTTGAGCATT GTATTTATTA ATTGAAACCT CACTTGGTAA TTTACTTTGT 1320 
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CTAATATAAA CAACCATTTC ATCACTCCTA 
ATAATAGCAA TTTTGCTCTT ATTTTTTTGA 
ATCAACGCTA TTTAGGAGTA ATTCAATAAC 



ATCTCTAGCA TCTTCTACTA AATTTTCAAG 
CATTATTCTA TTTTTATCGG TTGCCATTTT 
TAACACTAGT GTCAACTGGC TTTTATAATA 
CACAGTAACT TTAAATCTTT GGTATTAAAA 
TGTCTCAAAT CAGTTATCAA ATCTAGTATA 
TCTAAACAAG AAAAAGACTT ACACTCAAGG 
TGACTCTTTT ACTAGCAAAG GTATATACTC 
TCCAACTTCT TCTTTAACAT ATCCTTCTAC 
GTGACACAAG AAATGCCAAA CTTCGATCCC 
TTCACTTCCG AAAAAGCTTC TGTCGATTTC 
TTTACGATAG TTCGTTTCTC TTGTTTCGAC 



ATATGCATCA 

AATCTCTTTC 

AACAATATTT 

CTCCACTTCA 

TTTCCCATCT 

TTCCAATATA 

GTCACCTGAC 

GTCTTATACC 

ATCTACTTTT 

TGCGTCAGTG 

TAAACTTACA 



ATTTTTGAAT ATCCTTCGAT 
CAAATAATGT TTACATTTTC 
CCGTTAAAAA TAATTTCCAT 
AAACCATCTT CTGTTTCCAG 
TCTATGGAAT CAAATGCTAC 
ACCATCGATA ATCTCTCCAT 
CAATTCAGGC TCTCTGTATC 
CAGAACACAC CTTATCGACC 
ACTTTGCTGA TGCTTCAACT 
GGACTCAAAA GGTTCGGGGA 
CATTCAACTT GTTCATCATT 



AATTGGTAGC TTAAAAGTAA TGGATTTTAG 
AACTTGAATT TCAGAAATCA AAGCTGAAAT 
TTTATAGAGC TTATCAAAAT AGATCAGAAC 
AGCTTCAATA GTCTGTTTCT TTGCTTTCGC 



TATCACTAGT 
GTAAATAGCC 
CTGGGACTTT 
TTTCTCTAGA 
CATCACCTCA 
CATTAGTTTA 
AATTTTCACA 
AATTATGAGC 
GTTTTCTTCC 
ACAAGGAACT 
ATCTTCAATC 
TTTTTTTCTG 
ATATCCGCGG 
ATAGGCTTTA 
CACTCTATCA 
CTCAGGATCG 
ATAATCCGGT 
AGTGTATCCC 
TAAATCTTTA 
TTTCATTATC 
ATCTCATCAT 
TTCGGTCTCA 
CGTACAAGCA 
ACCTTTTGAG 
GTCCAAACCT 
CCATTGTCCG 
TAACTGCCTA 
CTTATATATG 
ATCAATTAGT 



GTTACACCAA 
CCCATAATAT 
GTTAGTCGCA 
TTTTTATCAT 
AGTTAATTCT 
AAAGTGGAGA 
ATATTTATAG 
GGCTACTCTA 
CCCCCTTCGT 
TTGGTTGACT 
TCTACAAACA 
TAAAGAATCG 
CTTTCTAAGA 
ACTTCATGGT 



TTTGTAAAAA 
CATCGAAATA 
TTCCCCTTCT 
CCAAGCTAAT 
ATCACAGGTG 
GGATTTTTAA 
AAATAAAATC 
ATACTTTCCC 
TATAACGTTT 
ATTGAATCTC 
TTGGGTCTAA 
CTTCACCGTC 
AGTCTTTTGC 
TGTTAACGAC 



TTTTTGAGCC ATAAATTTGA 



AACATAAATT 
ATGTTTTTAG 
GGGATTTGAG 
GAATAATCAT 
AGGCTAATGT 
ATTTCCTACT 
CCTCGTCGCA 
GTGATACCGC 
GATTAACTAC 
ATGTTGAGAT 
TTAGATTGTT 



TAGATAAAGG 

GATTAAAATA 

CTACAAAGGC 

TTTGGTACAA 

AAATAAGCAC 

TACTTTACGA 

TTGGCTGAAC 

CTCAGCGTGA 

GTTTCTCTAA 

TTTCTTCTAT 

TTTCTTCATA 



CGCTCTACAT CATTCATGAC 



TTATCTCCTG TAAGCTTTTC 
GATGATTCTA ATTCATCTAG 



13B0 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 



WO 98/18931 



PCT/US97/19588 



1008 

TTTGTCATAC ATACGATATA GTCTATCATC TAAATCCTGT TTCCTTCTCT TATAATGCTT 3120 

ATCTTCAACA TCTAAATTAT CTATTTCCTC AATTAGCTTA AACTTTGTAG AATGACTCTT 3180 

TCTCAATTCC TTTTGGTAAT TATCTATTTC TTTTTCTATT TCAGAGGTAT CCACCTTCAT 3240 

GTTGATTTTT TCTTGCATCA TAGAAGCAAA TTTCGGATTA CTTACTATCT TGACAATCAC 3300 

CTCTGCAACA GCATCATCTA ACAATTCTTC TCTAATTTGC TTHCTGAATG TACACTTATT 3360 

ACCTCTTATC ATCTGCCTAT GGTTACAACC ATAGTAATAA AAATCTTTAT ACTTTGTGCC 3420 

ATCTTTCTTT TTCTTGATAC ACTTGTTCCC AAACATTCCC ACTCCACATA TCGGGCATTT 3480 

TACAATTCCA GAAACCAAGT GTCTGCGTGT ATCTTTTCCT TTATTCACAT GCTCATATTT 3540 

CTmCTTGA GATTTTAGCT TAACCTGAGC ACCTTGCCAA ACTTCATCGG AAACTATAGC 3600 

TTCATGTATC CCTTCAGATA TTAGATATTC ATCTTGTTCA ACCTGCTTAT ATTCATTTCT 3660 

TGTACCATGA ACTTTTTCTA AAGTTCTTCT TCCAAATGCT ATTTTCCCAT TATATACAGG 3720 

ATTCTTTAAT ATCTTTCTTA TAAGACCTGC ATCAAACAAA GGATTCTTAC CATTCTGTCT 3780 

TGGGATTTTT CTAATTCCAT GATTCTCTAA GTATTTAGAT ATCCCATTGG CTCCTATCGT 3840 

AGTATTTACA TACTGCTCGA AAATCGTTCT TATTGCAACT GCCTCTTCCT CATTTATAAA 3900 

CAGCTTGCCG TCTTCAAGTT TATATCCATA CGGAGCAAAG CCACCATTCC ATTTTCCTTC 3960 

CCCTGCTTTT TGAATGCGAC CTTCCATTGT TTGAATACTG ATGTTTTCTC TTTCTATTTC 4020 

AGCCACAGCT GATAAAACAG AAATCATTAG TTTCCCAGCA TCTTTAGATG AATCAATGCC 4080 

ATCTTCAACG CAGATAAGAT TAACTCCATA ATCCTGCATT ATATGAAGTG TAGAAAGAAC 4140 

ATCAGCGGCA TTTCTTGCAA ATCTTGATAA CTTAAACACA AGAACAAAAG ATACTCCATC 4200 

TTTTCCAGAT TTTATATCTT CCATCATTCG ATTGAACTGT ATTCTACCTT CAATAGACTT 4260 

GTCAGACTTC CCGGCATCTT CATACTCTCC AACAATTTCA TAATCGTTGT AAATAGCAAA 4 320 

AGCTTTCATT CGTGATTTTT GTGCCTCTAA CGAATACCCC TCTATCTGTA TTGACGTAGA 4380 

TACTCGTGTA TAGAGGTATA CTTTTATTTT TTCTTTTGAC ATAGTATTAA CCTCAATATA 4440 

ATTTTTCTAT ATCATATATA ATTTTTTTAA TTTAAGTTTG GACTATCATT TCAAGTATAT 4500 

TATAACACTT TTATTAGTCC GTCTCAATTT GTGTTTTTGC CATGTCAAAA CTATTTTTCA 4560 

TCTCTTGATT TTTTGCTGGC GTTGGATCGG GTAGATTATC TAAATCTAAA GCACCAGCAT 4620 

ATTTTGCAAT CAGATTTGCT ATTAAATCAG CCAATCCATT CCAGTCATTG TCCAATATAT 4 680 

ACCTCCTCTA AAGTTTTATA TCTAATAATT ATTTGTTTAA TTAAGTTTTT TGACATTGAC 4740 

AAGTGCTTTG GATTAGCAAC ATAGGAATCT CACTTCCGCC TCTATTCCGG ATGAGCCGGC 4800 

TTCAACCTTA GAAGTATCAT TACCCTCATT TTCTTCATAG CGGATAGGGT ATCCCTCCCT 4860 
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ATATTCAAAC TCTTACTTAT CCCTCACTTT CTTTTTGCTT AGCAGAACTT TTTTTGCCGA 4920 

ATTATTCAGC CGAAAGATCT TGACGGATAG GTTATTACGC TCCAAAAATA ATTAACGTCT 4980 

TGTCTTGGTC TATTCAATTG TTAAGGTTCA AAATTTATCG AGAGTTATTA ATCTTTTTAA 5040 

AATTTGACCA TCAGAAAATA TTTATCTTGA TGTAACAAAA TTCTATAAAT TACCCTCTTA 5100 

TACTTAACAG TGAAAAGAAG TCTTTCTTGG TAACCAATTT TGAAATAGAA TTTGCTTATA 5160 

TAAAAAGGTC CAATTCCCAC TGCATAAATA GCAGTGAAAA TTAGACCCTC TTGGTAACTC 5220 

TCATCTAAAA GTCTTCTA 5238 
(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

GACGATTTAC GAAGAATCGA ACAAGAACCT GCTCCTATCA ATTCCCAACC TCTATCTCTA 60 

AAATCTTGCA GTTCATGCTT ATACTTTTTT AAGAAATCTA GAATCATAGA TACGGTAGAT 120 

GACATCGTCT GGTTGACATT GGTCAAAATA GAACAAACCA AAACGACTCG TTCTATACCT 180 

CCAACCTTTC AAATGCATCT CATGTAAATG TTCTTCTTCC TTGTCCAAAT CAACAATCGT 240 

GAAAATCCGA AATTCTACTC TGCTATTCAT TGTCTTACCC CAAAATTAGA AAACATGCCT 300 

GGCGTTATTT ATTAGATAAT TCTTTCCACT TTTGACTCAA TCTCCAAAAA ATATAAGAAA 360 

TCTGAATCGC AAAAACTATC AATAAAACCC AATCTATTAT GAAAATCAAA AACACTTTCC 420 

AACTGAAAGA ACTACCTCCA GTGACAAACT TTGAGAAAAA CGGTAGTAGA GCTAAAAAGA 480 

GAAATAAAAT AGGAAGCATC CGCATTGTTA AAATCCGTTT GGCATAAAAA AATCTTTATT 540 

TAAACGAAAA TATTATGGCA AAATTTACGC CAGTTTTTGA ACGGCTGATG TAGATATTTT 600 

ATACTTTCAA AATGTTTAAA TGTGATTATT TATTTTTGAA AAATAGATCA CCAGCCCGAC 660 

TGAAAGTGCT TATAGAATGA TAATAAGTCG CCTGCCGAAA ACAGCGAAAA ATAGCGGTGT 720 

TATGCGGAGA TAATCTCACG CGATGCGAAA GTATATTGCA TACTTATTTT CAACAATTTA 780 

GCAGAGTATT TTTATAAGTG TGATATAATA GAAGTATAAT TTGTTCTGAT ACTTTATTTT 840 

ATGGAGAAGT AGATTTTTAG AATGCGGAGG CTTCAATATG GTTGAGTTTA TAAAGTCTAA 900 

GAAAGAAATG AGTGAGGAGG ATATTAAAGC AAATTTCATC ACTCCTGCTA TTGTATCCAA 960 
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AGGATGGAAA 


AATGGTGAGC 


ATATCGCTTA 


CGAAGAATAC 


TTCACTGATG 


GTCGAATTGA 


1020 


AGTTAGAGGA 


GATAAGGCTC 


GTCGTAAAGA 


AGGAAAAAAA 


TCAGACTATT 


CACTGTATTA 


1080 


CCAATTTGGA 


ACTCGAATTG 


CAATTGTTGA 


GGCAAAGGAT 


AATAAACACA 


GCGTTCGAGC 


1140 


AGGATTACAA 


CAAGCTATTG 


AATATGGAGA 


GATTTTACAT 


GTTCCATTTG 


TTTATTCTTC 


1200 


GAATGGTCAT 


GGCTTTATTG 


AACACGACCG 


TATCACGAGA 


GAAGAACCTG 


AGCTGGAGTT 


1260 


AGACGAATTC 


CCTACTCGTG 


AAGAATTATT 


TTCTCGTATG 


ACGAAGGAAA 


AAGGATTGAC 


1320 


GTACGAAATT 


ACAGAAGCTA 


TCTCAACTCC 


ATACTATACA 


GACGCCTTCT 


CAATGAAAAC 


1380 


GCCACGCTAT 


TATCAGCAAA 


TAGCTATCAA 


CCGTACTATT 


GAAACAGTTG 


CCAGAGGACA 


1440 


AAAACGAGTA 


ATGTTTGTGA 


TGGCAACAGG 


AACGGGGAAA 


ACGTTCATGG 


CTTTTCAAAT 


1500 


TATTCATCGC 


CTTCGAAAAG 


CTGGTTTGGC 


TAAACGAGTT 


TTATTCTTAG 


CAGATAGAAA 


1560 


CATCTTACTA 


GACCAAACGA 


TGGCTGAAGA 


CTTTAGGCCA 


TTCGAAAAGG 


TAATGACGAA 


1620 


AATTACACCA 


AAACTTTTGA 


CTGCTCCTGA 


AAAATTAAAT 


TCTTTTGAAA 


TTTATCTAGG 


1680 


GCTTTATCAG 


CAACTAACTG 


GTGAAGATGG 


AACTGAAACA 


CATTATCAAA 


AATTTGACAA 


1740 


AGACTTCTTT 


GATTTAATCG 


TAATTGATGA 


AGCGCACCGT 


GGTTCAGCTA 


AGGAAAACAG 


1800 


TAACTGGCGT 


AAGGTAATTG 


ATTATTTCAG 


TTCTGCGACA 


CAGATTGGGA 


TGACCGCTAC 


1860 


TCTTAAAGAA ACCAAGAATG 


CTTCCAATAC 


GGAATACTTT 


GGTGAGCCAA 


TCTATACTTA 


1920 


TAGTTTAAAA 


CAGGGAATCG 


AGGATGGTTT 


TTTGGCTCCA 


TATCGTGTTA 


TGAGGGTTAA 


1980 


TTTAGATGTG 


GATGTGGATG 


GTTATCGTCC 


AGAAACTGGA 


AAAGTTGATG 


CTAACGGACA 


2040 


ATTAATAGAA GATAGGTACT ACGGCAGGAA AGATTTTGAT AAAACCATTG 


TCATTGATGA 


2100 


TAGAACGCAA 


AGAGTTGCCA 


AGTTTGTTTC 


TGATTATATG 


AAGCAAAACA 


ATGCACGATT 


2160 


TGATAAAACA 


ATTGTTTTTT 


GTGTTGATAT 


TGACCATGCC 


GAGCGAATGC 


GTGCTGCACT 


2220 


TGTAAAAGAG 


AATCTAGACT 


TAGTCCAAGA 


AGACTATCGT 


TATGTCATGC 


AAGTAACTGG 


2280 


TGACAACGCT 


GAAGGAAAAG 


CTCAACTGGA 


TAACTTTATG 


GATGTCAATT 


CTAATTTTCC 


t U ft 


CGCTATTGTA 


ACAACGTCTA 


AATTATTAAC 


GACAGGAGTT 


AATGCTAAAA 


CATGTCGTTT 


2400 


GATTGTTTTA 


GACTCTAATA 


TCCAATCCAT 


GACTGAATTT 


AAACAAATTA 


TTGGTCGTGG 


2460 


CACACGTCTT 


TATCCTCAAA 


AGGGGAAAGA 


ATTTTTTACG 


ATTATTGATT 


TTCGAAATGT 


2520 


TACCAATTTG 


TTTGCTGACC 


CTGATTTTGA 


TGGTGATCCA 


GTGAAGGTGC 


TAGAAACAGG 


2580 


TGCGAAAACA 


GTCAGTGGTT 


CTACGCCCGG 


TTTCGTAGAT 


GAGGAAGGTG 


ACCCAGTAGA 


2640 


AAAATATATC 


GTTACAGACA 


AGCAGGTTAC 


CATTCTTAAT 


TCTACTGTTC 


AAGTATTGGA 


2700 


TGAAAACGGG 


AAACTGATTA CCGAAAGCCT GACCGACTAC ACTCGAAAGA ATATCTTAGG 


2760 
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TAGCTACGCC ACTTTGAACG 
TATCTTAGAC GAACTTTATA 
AATATCAGAA CAAGAAATCG 
AGAATTAACC AAAACGGAAC 
TAGTGAGGAA GCGCGTGCTG 



TGGAGAACTC 

CTTCAAAATC 

ATTGGAGCAA 

TGTAAAAAGA 

TATTGAGCAA 

GGAATTAGAA 

GGCTCATGCT 



GAAAGCATTG 
ATCAATACTT 
GAGCTATTTA 
ATTCAAGATA 
ATGTCTTGGT 
GAAGACGAGT 
CAAAATGGGG 



TAACAAGTTA 

AACGATTGTT 

ACGCCAAGTC 

GTTTAATGAT 

ATTTTATACG 

AGAATCAATG 

TTTAAGTAGT 

TATTGAAAAG 

TGATGACCCT 

TGATGAAAAA 



TTCAAAGAGT 
AAATCAGCTT 
ATCAATGTTA 
ATTTACGAAA 
CCACGTGCAG 
GCAGACCTTG 
CAACGTAAAA 
AAAGCATTTC 
AAAATTGTTC 
TTTGACATTA 



1011 

ATTTTATCAC AGTTTGGCAT ACGGCAGATA AGAAGAAGCT 
AAAAAGGAGT TTATCTAGAT GCTATTCGAG AGTCGGAGGG 
ATCATTTTGA TTTACTCCTA AAACTTGCCT ATGGTCAAAA 
GTATCAATAA ACTCAAACAA AGCGGATATT TATATAAATA 
TTTTGGAAAT TTTACTGAAC AAATACATGG ATAAAGGTAT 
AAACATTAAA ACTTCCAGAA TTTCAGATAT ATGGTGGAAC 
ATTTTGGAGA TAAAAAACGA TATTTACAAG CAATTAAAGA 
CAGTAGCTTA ATGAAAGGAA AGTATGTCAA TTACATCATT 
TCACTCGAAA CGATGCTGGT GTTAATGGTG ATGCTCAACG 
TATTATTCTT AAAAATTTAT GATAGCCGTG AAATGGTTTG 
ATGAGTCAAT TATCCCAGAG GAATTAAAAT GGCGAAATTG 
AACGGGTATT GACAGGCGAT GAATTACTTG ATTTTGTCAA 
TGAAAGAGCT TGAAATAACT TCAAATATGC CTATTCGAAA 



TTGAAGATGC 

TTGATGAAGT 

AAATTCTTAA 

CGACTGATTT 

CTTGCGGAAC 

CTAGTGAAGA 

CTCATCTTTT 

ATGGAAATAC 



GAACAACTAT 

TGATTTCAAT 

AGATATTCAA 

TATTGCCGAA 

AGGAGGCTTC 

TACCAAAAAA 

AGCAGTTACA 

TTTGGAGAAA 



ATGAAAAATG 

AGCCCTGAAG 

AATGCTGGGA 

GTTCTTGACC 

TTGACTTCGA 

TATAATACAG 

AATCTGTTTC 

AATGTTCGTG 



GCGTCTTGTT 

ATCGTCATTC 

ACTCAGGAGA 

CAAAACTTGG 

CTCTGAACCG 

CTGTTTTTGG 

TTCACGAAAT 

AATATACGGA 



AATAAAAAAT 
TGTCATTATG 
TCTATTTGGT 
GCATACGATT 
CATTCTTTTC 
GCCAGATGGT 
TGTTCGTGAC 



AACTTTCCAG 

TATCGTTTGA 

GAAGGTGTAA 

ATTAGGTTGC 

TTTGATAAAA 

TATAAAAATT 

TGGTGGGAAA 



TTATGATGAA TCCACCTTTT GGAGGGTCAG AATTAGAAAC 
CAGAATTACG GAGTTCTGAA ACAGCTGATT TATTTATGGC 
AAGAAAATGG TCGTGTTGGA GTTATTTTAC CTGATGGTTT 
AAACTCGCTT GAAACAAAAA CTGGTAGATG AGTTCAACTT 
CTCATAGTGT CTTTGCACCG TATACAGGAA TCCATACGAA 
CAAAGAAAAC AGAAGAAACT TGGTTTTATC GTTTAGATAT 
TCTCGAAAAC TAAGCCGATG AAGTCAGAAC ACTTCAATCC 
ATCGTGAAGA GATTCTGGAA GGTAAGTTCT ACAAATCTAA 



2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 
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ATCATTTACA CCTAGTGAAT TGGCTGACTT GAATTATAAT TTAGACCAGT CTGACTTTCC 4560 
AAAAGAGGAA GAGGAAATCT TAAATCCCTT TGAGTTGATT CAGAATTATC AAGCGGAAAG 4620 
AGCAACTTTA AATCATAAGA TTGATAATGT ATTAGCTGAT ATTTTGCAGT TCTTGGAGGA 4 680 

CAAATAATGA CACCAGAACA ACTTAAAGCA AGTATTCTCC AAAGAGCGAT GGAAGGGAAA 4740 
TTAGTGCCGC AAAATCCCAA TGACGAACCT GCAAGTGAAT TATTAAAGAG AATTAAAGCT 4800 
GAAAAAGAAA AACTTATCAG TGAAGGAAAA ATCAAACGAG ATAAAAAGGA AAGTGAGATA 4860 
TTTCGTGGTG ATGATGGGAA ACATTATGGG AAGTTTGCTG ATGGAAGCAC TCAAGAAATT 4920 

GATGTTCCTT ATGATATTCC TGATACTTGG GAGTGGGTGA GGTTTTCTAC ATTGGTTGAA 4980 

ATTGTCAGAG GTGGCTCTCC ACGACCAATC AAaGATTATC TTACTTCTGA AGTAGATGGA 5040 

ATAAATTGGA TAAAAATAGG TGATACTGAA AAGGGTGAAA AGTATATAAA TAATGTTAAA 5100 

GAAAAAATCA AAAAATCAGG GCTTAACAAA ACTAGATTTG TAAAAAAAGG TACATTTTTG 5160 

TTAACTAATT CTATGAGTTT TGGTAGACCT TATATTTTGA ATGTTGATGG TGCAATACAC 5220 

GATGGATGGT TGGCTATTTC GAACTATGAA AACTCATTAA ATAAAGATTA CCTATTCTAT 5280 

ATTCTTTCAT CAAATGTAGT TTATTCTCAA TTTCTATCTC TAATTAGTGG AGCTGTTGTG 5340 

AAAAACTTGA ATAGTGATAA AGTTGCTTCT ATTCTTATCC CTCTCCCCCC ACTATCCGAA 5400 

CAACAACGAA TAGTAGAAGC AATCGAATCA GCTTTAGAAA AAGTAGATGA ATATGCTGAA 5460 

AGTTATAATA GACTAGAACA GCTAGATAAA GAATTTCCAG ATAAACTAAA AAAATCTATT 5520 

CTTCAATATG CTATGCAAGG AAAATTAGTT GAACAAGACC CAAATGATGA ATCAGTCGAA 5580 

GTTTTACTTG AAAAAATACG AGCAGAAAAA CAAAAACTCT TTGAAGAAGG CAAGATTAAA 5640 

AAGAAAGATT TGGACATTTC TATTGTTTCC CAAGGAGATG ATAACTCTTA TTATGGGAAT 5700 

ATACCTATGA ATTGGGTTGT TATAAAAATA AAAGATATTT TTTCAATAAA TACAGGTCTT 5760 

TCTTACAAGA AGGGCGATTT AAGCATTAAT AATAAAGGTG TTAGAATTAT ACGTGGTGGT 5820 

AATATTAAGC CTTTAGAATT TTCTCTGTTG GATAATGATT ACTACATTGA TACACAATTC 5880 

ATCTCCTCTG AGCAAGTTTA TTTAAAACAT AATCAGCTAA TAACACCTGT ATCAACCTCT 5940 

TTAGAACATA TTGGAAAGTT TGCAAGAATC GATAAAGACT ATGATGGTGT TGTGGCTGGT 6000 

GGATTTATTT TCCAATTAAC ACCATTCGAA AGTTCAGAGA TTATTTCAAA ATTTCTATTA 6060 

TTTAACTTCT CCTCTCCGTT ATTTTATAAA CAATTGAAAG CAATAACTAA ACTATCAGGT 6120 

CAAGCTTTAT ATAATATTCC TAAAACTACA CTGAGCGAGC TATTAATTCC GTTAGCTCCT 6180 

TTTGAGGAAC AGGAACTTAT TACTCAAAAA GTTGAGAAAC TTTTTGAAAA AGTAAATCAA 6240 

CTTTGAAAAT GATTCTTTTC ATCTCTTCAT GATTAGAAAT AGGGATTAAT AATTCGGAGA 6300 
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TACTGGTACT 
TAAGTGGCAA 
ATATCGATAT 
TAGATAATTT 
TTAATGCTAA 
CTGATATAGA 
TTTTTCCTAT 
TTTCATATAT 
AAAAATAGCA 
ACAATATTTT 
TCCTGTCCCA 
GCAAATAAAT 
GCATAATTTT 
AAAAATAAAT 
AATTTTTTAG 
CCGATTTCTA 
CCTAAAAACC 
AGAAAAAGGT 
ACAAATTTTA 
AGTCGCAAAT 
AAATCTTCCA 
CCCACAGTAT 
AAACTCCTAT 
GTGCGTCAAT 
GGGGTGCGAG 
TACTACCCGA 
AGATGAAATG 
CAACAGCCCA 
CTAGACCTGT 



ATTTAATGTT 
AAATATCATT 
AATGGCTTCA 
AAAACTCATT 
TTTAGAAATA 
TACCCAAGGT 
TCTGAAGTTA 
AGGATAAGAC 
GGTAGTCAGT 
GTTGTTCAGT 
TCAAATCTGC 
GCCTAAAAGA 
TTAACTGTTG 
AACCACTTCG 
GAATGTACAG 
CATGCTCTAC 
AAACGACAAA 
AATCACCATG 
ATTTCAAATC 
TGACAGTTTT 
TTTTGAGTTC 
TTTCAGCTAA 
CTTATGTTTG 
ATGGGCGAAA 
GGGGATTAAA 
CGCTTTTTCT 
GCAATTAATA 
GCCATAAGGA 
TGAATTCCAC 



TTCCCTTTGA 
AAGTAATCTC 
TTATGAGTGG 
AATAAAGTTC 
GATTCPCTCG 
ATTTCAGTTC 
ACTAGGCTAG 
GTTGTTTCGT 
TTGACCACCT 
AGCTGTTTTC 
AAGCAAGGCA 
ATGAGGGTAA 
AGCAACTCCT 
GCGATTTTCT 
TCTACGAATT 
TTTTAGTTTA 
ATGCCATTTT 
GCTAATGACA 
ATCATGACCA 
AGGTTTAAAA 
GTAATTCTCC 
ACCAGCTTTC 
ATAGAAATTC 



TAGCATCTTT 
TGATAATATT 
CAGGAATATC 
CTTTAGGTGA 
CATTAGTTAC 
CCCAAAAAGT 
CAAATTTAAT 
CTTTGTTCCC 
GTTATTTTTT 
CTTAGATAAA 
ATATCATTAT 
ATTACGTTAG 
CTTGCTGTAA 
GATTCTAACC 
TTACCACCTT 
ATAAGTTCAC 
AAAATACCAT 
TCTTCTAAAA 
ATAAAAGCCA 
TTGTCTAATA 
AAGAAAAATC 
TTCAAATGTA 
CACCGCACGT 



AATTGTTCGA 
TATTCTTTTA 
TCAATAACTG 
ACCCCCGATA 
TTTTTATTTT 
ATTAAATCAC 



TTTTATCAAC 
TAGTTTTCGT 
ACTGAACAAT 
AGACTCTTAT 
CATGGTAAAT 
CATCTCTTAG 



TGAATCACCT 
TTCTTTATTA 
CAATATGGCA 
AATGTCTATT 
ATAACCAGAT 
ACCTTCACTG 
ATATCTCCAT 
ATAATAAGAG 
ACCAATTAAC 
TTCGAGTAGT 
ACTTCGCTAA 
GATTCATTTT 
TTGGTTCGTT 
AACTAAGACA 
TTGAGTAAAT 
TTACACGAGC 
CTTTTTTCAA 
ACGGTTTTTG 
GATATTTATT 
AATATCCTTT 
GAACACCATA 
ATTCAAAATC 
AAAACTATTA 
GATTCTGGAT 
TAATAATTCT 
AGGAGAGGAA 
AACTGTAACA 
AGCTAATCGT 
TAATCTTTCT 



AAACTAGAGA 
CCATAGGGGA 
ACTTTTCCAA 
TTCTTTGATT 
ATAGGCATAT 
CCTGGAGGAG 
GCTTCTGGGA 
CCATAATCAC 
AATTTTATCT 
TTCTATACTT 
AAAATTCTTA 
GTATTTATCA 
AAATTTATTC 
ACTATTTCTT 
GTCAAAATAA 
CCCAGTTGCA 
ACTACGTTTA 
CTGTACTTTG 
TACTCCTTGT 
GTATTCAAAT 
AAGGTACGAA 
TTTCAACGTA 
TACTAAATTA 
TGTTCAGGAA 
TTTTGTTTTG 
AGAAAATTAT 
TGGCTATCTG 
CCTAACGTAC 
TTCTGGTAAC 



6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 
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TATGAACTGT TTCGGGATCA ATAAATCTTG CTAAGTCAAT AGAAAAGCCA GACCATTGAT 8100 

TACATTTCTG AGCAATCACA GGCTATATAG GAATATTTGA ATATTTTGGA GACTTCCCTC 8160 

TTTGAATGTA GGAGGTTATA TCGTTTAACC TCACCCATTC CCAACTTTCT GGTATTTCAC 8220 

AAGGTACTTC CTCATAATAA GAGTTATCAT CTCCTTGGGA AACAATAGAA ATGTCCAAAT 8280 

CTTTCTTTTT AATCTTGCCT TCTTCAAAGA Grm ' MTf TTCTGCTCGT ATTTTTTCAA 8340 

GTAAAACTTC GACTGATTCA TCATTTGGGT CTTGTTCAAC TAATTTTCCT TGCATAGCAT 8400 

ATTGAAGAAT AGATTTTTTT AGTTTATCTG GAAATTCTTT ATCTAGCTGT TCTAGTCTAT 8460 

TATAACTTTC AGCATATTCA TCTACTTTTT CTAAAGCTGA TTCGATTGCT TCTACTATTC 8520 

GTTGTTGTTC GGATAGTGGG GGGAGAGCAA TTAATAATAG ATTAAAATTA TAATCATTGA 8580 

TTGCAGGATA ACTTGTTCCA GTAGATTTAT TATTAACACG ATTGATAAAA TTATCTGATA 8640 

ATAAATAATA TTTCAAATAT GTTTCGTTAA GTAAAGTATC CAAAACAATA AATGCTGTAC 8700 

TAGCTATCAA ATACTCTTTA AGTTCTCTAA CTACAGCAAT ATTTTTTAGA TATGGTCTAA 8760 

CTGTTGAAAA TAAGACACTA TTCTGCGAAA CTAATTTTCT AGCACGGGAA GGCGCTTGTT 8820 

CAGGTGAAAG ATATTGTAGA TTTTTGTAGT TGATTATGTT CTTTTTTCTA TCAATACTAG 8880 

ACGTATCTAT ATACCTAAAG GATTTCTCTG GCTTATTTTG CCCAAAATTC CAATAAATTG 8940 

ATTTTATCCT CACCCACTCC CAAGTATCAG GAATATCATA AGGAACATCA ATTTCTTGAG 9000 

TGCTTCCATC AGCAAACTTC CCATAATGTT TCTTATGTGC TTCAAGTATA TAAAAAGGCG 9060 

TAAAAATACG CCTATAGATA ATGGGGTTGA AATAGGTTTA TTGTTGATGA GATTGTAGAT 9120 

AATTCAATTT TTTACTTCCA ATCGAATATT CAAATCCTCC ACCTTTTCTG CCTGTAATTG 9180 

TTCATCATAA AATTCAATAT CTTCAGGATT TTCCCCTTGG CAACCTCGGC AGAAATATTC 9240 

TTCCGCTCGA TCAGGATTCA AAAATCGACA AGCACAAACA AAACAGTCGC CATCATCATT 9300 

TATTGAGATA ATATAGTAGA TTGAAATAAG ATGTAAACAA ATCGATTAGG AAAGTTAAAT 9360 

TAGTTTCTAG AAATTTTTAG CAGATGTAGT GTACTATTCT AGTCTCAATT TACTATGGCT 9420 

TCAAATATAT CTTTCGAAAA AATATTTACA GATGTGTAAT TTTGAAGCTT GCAAAAGTTA 9480 

GTAAACTTGT AGATTTCGAT TTGAAGTAAC TfOTTTTLVT GCCCGATATT GTTTTTGAAA 9540 

TTGAATTTTT CCATAGTGAC TCCTTAATTT TCTTCTACAC GTCTGATGAT AAATCTAATT 9600 

CGCAAAAGAG TCAAGAGGAT TTTTCGAAAA ATAAATAGCG ACCGAAATCC CTATTTTAAG 9660 

GGTTATAGGT ATTTGATGGC TTAGACTGCT GTGTGACTGT TTACCCACAG GCAATCTTTC 9720 

TTCTATATTA GTATTAGTAA AGGTCTAAAT AATTATCAAT TTCCCATTGT GAAACGAAGG 9780 

TTGCATAACT TGCCCATTCG ATTCGTTTGG CTTCAAGGAA GCTAGTATAG ATGTGATCTC 9840 
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CGAGAGCAGC 


TTTAACCACT 


TCATCTTCTG 


TCAAAGCTTT 


CAAAGCCTTG 


TGAAGAGTTG 


9900 


ATGGAAGCTC 


TGTAATACCA 


GCTTCCTTGC 


GCTCTTCTGC 


TGTCATGATG 


TAGATATTTT 


9960 


CTTCGATAGG 


AGCTGGTGCT 


TCGATTTTAT 


TTTCAATACC 


ATACAAACCA 


ACTTCCAAAA 


10020 


G AACAG C CAT 


AGCAACGTAA 


GGGTTCGCCA 


TTGGATCCAC 


TGAACGCAAC 


TCAAGACGAG 


10060 


TTCCCATACC 


ACGTGAAGCA 


GGTACGCGCA 


CAAGTGGCGA 


ACGGTTACGA 


CCAGCCCAAG 


10140 


CAATGTAAAC 


AGGCGCTTCA 


TAACCTGGAA 


CCAAACGTTT 


GTATGAGTTA 


ACTGTTGGGT 


10200 


TCATGATGGC 


AGTATAGTTG 


TAAGCATGCT 


TGATCAAACC 


GCCTAGGAAA 


TGGTAAGCTG 


10260 


TTTCTGACAA 


CTGCATTCCT 


TTTGGATCAT 


TTGGATCAAA 


GAAGGCGTTA 


TTTCCTTCTG 


10320 


CATCAAACAA GGACATATTA 


CAGTGCATAC 


CTGATCCAGC 


AATACCAAAT 


TTTGGCTTCG 


10380 


CCATAAATGT TGCGTAAAGT 


CCGTGTTTGC 


GAGCAATGGT 


TTTAACAACA 


AGCTTAAAGA 


10440 


TTTGAATCTT 


ATCACAAGCA 


CGGAGAACTT 


CATCGTACTT 


AAAGTCAATC 


TCATGCTGTC 


10500 


CAACCGCAAC 


CTCGTGGTGA 


CTCGCTTCTA 


CTTCAAATCC 


CATTTTGGTC 


AAGACATTCA 


10560 


CAATCTCACG 


ACGTGTGTTG 


TCCGCAAGGT 


CAGTAGGTGC 


CAAGTCAAAG 


TAGCCACCCT 


10620 


TGTCATTCAC 


TTCAAGTGTT 


GGGTCCCCAT 


TTTCATCCAA 


CTTAAATAGG 


AAGAATTCTG 


10680 


GCTCTGGACC 


AAGGTTGAAG 


GATTTGAATC 


CAACTTCTTC 


CATGTGACGA 


AGAGCTCGTT 


10740 


TCAAATTACC 


ACGAGGGTCA 


CCCGCAAATG 


GTTCACCTTC 


TGTTGTATAG 


ACATCACAGA 


10800 


TCAGACCTGC 


AACACTTCCA 


TTTTCATCTC 


CCCAAGGGAA 


GACTGTCCAT 


GTATCCAAGT 


10860 


CCGGGTACAA 


GTACATATCC 


GACTCATTGA 


TACGTACAAA 


ACCTTCAATA 


GAAGATCCAT 


10920 


CAAACATAAC 


CTPGTTCGAC 


AAGACCTTAT 


CTAACTGTTC 


ATCTGTAGCA 


GGAATTTCGA 


10980 


CGTTTTTCAT 


GGTTCCCAAA 


ATATCTGAGA 


ACATAAGACG 


AATAAAGGTA 


ACATTTTTTT 


11040 


CCTTGACTTC 


ACGACGAATA 


TCTGCAGCTG 


TGATTGGCAT 


AAGTTTTCTC 


CTTAATCTAT 


11100 


GACTACTTGC GGTTGCCTAA CCGCGACCAA AAGGTGACTG TACTGAAGCA AAACGCCCCT 


11160 


GTTGGAGGAG 


TTCATTGTGA 


ACTGCACGAC 


GTACTTCAGT 


CTGACTAACC 


GCTTTCTTGG 


11220 


ATTTCGCTTC 


ACGTTCAGCA 


TATTTTTTCT 


TAATGGCAGC 


GAT ATT AT AA 


CCTTCAGAGA 


11280 


TATAATCTTT 


GATTTCAAGC 


AGACGATCCA 


TGTCATTCAA 


GGAATACATG 


CGACGATTTC 


11340 


CTTCGTTTCG 


ATCGGGCTTG 


ATCAACTCTT 


GATCTTCATA ATAACGAATC 


TGACGCGCCG 


11400 


ATAGATCGGT 


CAACTTCATA 


ACACTGCCGA 


TAGGAAAAAC 


AGCCATATTT 


CGGCGAAATT 


11460 


CTTTTTCCTT 


CATTTACAAT 


TTCCTTCTTT 


CTGTCTATTA 


TAGTCTAAAA 


AAAGACAAAC 


11520 


GTCAATTGAT 


AATGTTATAA 


AATGTAACAT 


TATTTTTCTT 


TTTTCTCTAA 


AAAGAGACGA 


11580 
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ATACGATCAA TATCGTAATT TACGATAATT GCGACAAAAA CTCCCATAAA CGTTTCTAAT 11640 

ACACGCACAA ACACGTACAA AATTGTCTCA CCACTTGGAA TTGATAGGGT AATGATTAAC 11700 

ATAGCTGCTA CACCACCAAT AACCCCTGCT TTGTTATTCA TGGCTACATT TGTCATAATG 11760 

GTTAACATGG TGCAGATTGG AACAACTACC AAGGTCACCC AAAAGGCTTC GTGGAAAAAG 11820 

GTATTTAATA AGAAGAAGAC CAAGGCATAG AGTCCACCGA TACTATTTCC TAGAATACGC 11880 

GAAGTCCCAA AATGAACACT CTCATCAAAA CTCTCCCTCA GGCTAAAAAC GGCTGTCAAA 11940 

GCACCAATTT GAAGACCTTT CCAGCCAAAA AAGCCAAAAA TCAAGAGAAC TAGAAAAACA 12000 

GCAATACCTG TTTTAAAGGT TCGCATACCA AGTTTGAACT GGGATTTATC GAATTTATAT 12060 

TTTTTAAAAT AACTCATAAT CTCAACTTTC TATTTCCATT TTATCATAAA TCGCTGATTT 12120 

TTATGAGTAA TAGTTGAGAG GAAGCGTTTT TATTTTAAGC AAAAGAAAAG AGGAACTTTC 12180 

ATCCCTCTCT TCTTTGATTT ATTTATAAAA TCTTATTTTT CTGTCAAGGC TGCAAGTCCT 12240 

GGAAGAACCT TACCTTCAAG AACTTCCATT GATGCTCCAC CACCCGTACT AATCCATGAG 12300 

AACTTGTCTG CACGGCCAAG GTTAATCGCT GCGGCAGCTG AGTCACCACC ACCGATGATT 12360 

GATTTAACTC CTGGTTGTTT CACGATAGCG TCCATCACAC CGATTGTACC AGCTTGGAAA 12420 

TCTGGGTTTT CAAATACACC CATAGGTCCG TTCCATACGA CTGTTTTGGC ACCAGTCAAA 12480 

GCTTCGTCAA ATTTGGOGAT AGATTTTGGA CCGATGTCAA GACCAAGGAA GCCTTCAGAA 12540 

ACTGCTTCAC CTTCAGTGTC ACGCACTTCA GTGTAACCAG CAAATGCGTT AGCTTCTTTT 12600 

GAGTCAACTG GCAAGATCAA TTTACCATTT GCTTTTTCAA GAAGACCTTT CGCAACATCC 12660 

AATTTGTCTT CTTCTACAAG TGAGTTACCG ATTTCGATAC CTTGTGCTTT GTAGAATGTG 12720 

TAAGTCATCC CACCACCGAT AAGGACGTTA TCAGCTTTTT CAAGCAAGTT TTCGATAACA 12780 

CCGATCTTGT CTGAAACTTT TGAACCACCA AGGATAGCCA CGAATGGACG TTCTGGAGTT 12840 

TCAACTGCTT CTTGGATGTA GGCAATTTCG TTTTCAAGAA GGAAACCAGC AACTGCTTTT 12900 

TCAACGTTTG CTGAGATACC AACGTTAGAT GCGTGTGCAC GGTGAGCTGT ACCGAATGCA 12960 

TCGTTTACGA AGATACCATC TCCAAGTGAT GCCCAGTATT TACCAAGTTC AGGATCGTTT 13020 

TTAGATTCTT TCTTGCCGTC AACATCTTCG TAACGAGTGT TTTCAACCAA GAGAACTTGT 13080 

CCATCTTCAA GAGCGTTGAT TGCCGCTTCT AATTCAGCAC CACGAGTGAC ACCTGGGAAA 13140 

ACAACATCTT GACCAAGTTT TGCTGCCAAG TCAGCTGCTA CAGGAGCAAG TGATTTACCA 13200 

GCTTTATCAG C TT C TT C T T T CACACGTCCA AGGTGAGAGA AAAGAATTGC ACGTCCACCT 13260 

TGTTCGATGA TOTACTTAAT AGTTGGAAGA GCTGCTGTGA TACGGTTATC GTTAGTGATT 13320 

ACGCCATCTT TCAATGGTAC GTTGAAGTCA ACACGAACGA GGACTTTTTT ACCTTTCAAG 13380 
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TCAACCTCTT TAACAGTAAG TTTTGCCATG TTACAAAAAC TCCGG 13425 
(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90S base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNE5S: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

GATTTATCCT ACCGGnGAAT TTCCGGAGGG GTTCTAGCAG CAATCTTAGG AATCTATGAA 60 

CGAATGATTG GCTTTCTGGC CCATCCCTTT AAAGACTTTA AAGAAAATGT TTTGTACTTT 120 

ATTCCAGTTG CCATCGGTAT GCTTCTGGGA ATCGGCTTAT TTTCCTACCC GATTGAATAC 180 

CTGCTTGAAA ATTATCAGGT TTTTGTATTA TGGAGCTTTG CGGGAGCTAT TATCGGTACA 24 0 

GTTCCTAGCC TCCTCAAAGA ATCAACTCGA GAATCTGACC GAGACAAGAT TGATTTAGCT 300 

TGGTTATGGA CAACCTTTAT CATTTCTGGA TTAGGACTCT ATGCCTTAAA TTTTGTCGTT 360 

GGAACCTTAA GCGCCAGCTT TCTTAACTTC GTCCTAGCAG GCGCACTATT GGCCCTTGGC 420 

GTCTTGGTTC CTGGCCTCAG CCCATCAAAT TTACTTTTGA TTTTGGGACT CTATGCTCCT 4 BO 

ATGTTGACTG GTTTTAAAAC TTTTGATTTC TTGGGAACCT TCTTTCCGAT TGGAATTGGT 540 

GCAGGTGCAA CTCTCATCGT TTTTTCAAAA TTGATAGATT ATGCCTTAAA CAACTACCAC 600 

TCACGCGTCT ATCATTTCAT CATCGGTATC GTCCTATCAA GTACCCTTTT GATCTTAATT 660 

CCAAATGCAG GAAACGCTGA AAGTATCCAA TACACAGGAC TTTCACTTGT CCCTTATGTC 720 

ATCATCGCCT TCTTCTTTGC GCTGGGAATC TGGCTTGGTA TTTGGATGAG TCAATTGGAG 780 

GATAAATATA AATAATGGCA AAAAAAGTTA AAATCAAAAA AACATTGGTG GAACAAATCC 840 

TATCTAAAGC AGCTATCCCT CATCAGGGGA TTCAAATCAA TGCCCTAGAA GGAGAGCTTC 900 
CTCAA 

(2) INFORMATION FOR SEQ ID NO: 153: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



905 
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CTTGAATTAA ATAAAAAACG TCATCCGACT AAGCATTTTA CTGATAAGCT TGTTGATCCC 60 

AAAGATGTGC GTACCGCTAT CGAAATTGCA ACCTTAGCGC CAAGCGCCCA CAACAGCCAG 120 

CCTTGGAAAT TTGTGGTGGT ACGTGAGAAA AATGCTGAAC TGGCAAAGTT AGCTTATGGT 180 

TCCAATTTTG AACAGGTATC ATCAGCGCCT GTAACCATTG CCTTGTTTAC AGATACGGAC 240 

TTAGCCAAAC GTGCTCGTAA GATTGCCCGT GTTGGTGGTG CTAATAACTT TTCTGAAGAG 300 

CAACTTCAAT ATTTTATGAA AAATCTGCCA GCTGAGTTTG CCCGTTACAG TGAGCAACAA 360 

GTCAGCGACT ACCTAGCTCT CAATGCAGGT TTGGTTGCCA TGAACTTGGT TCTTGCATTG 420 

ACAGACCAAG GAATTGGTTC TAACATTATT CTTGGTTTTG ACAAATCAAA AGTTAATGAA 480 

GTTTTGGAAA TCGAAGACCG TTTCCGCCCA GAACTCTTGA TCACAGTGGG TTATACAGAC 540 

GAAAAATTGG AACCAAGCTA CCGCTTGCCA GTAGATGAAA TCATCGAGAA AAGATAGAAA 600 

GAAGAAAAAA TGACAGCAAT TGATTTTACA GCAGAAGTAG AAAAACGCAA AGAAGACCTC 660 

TTGGCTGACT TGTTTAGCCT TTTGGAAATC AATTCAGAAC GTGATGACAG CAAGGCTGAT 720 

GCCCAGCATC CATTTGGGCC TGGTCCAGTA AAAGCCTTGG AGAAATTCCT TGAAATCGCA 780 

GACCGCGATG GCTACCCAAC TAAGAATGTT GATAACTATG CAGGACATTT TGAGTTTGGT 840 

GATGGAGAAG AAGTTCTCGG AATCTTTGCC CATATGGATG TGGTGCCTGC TGGTAGCGGT 900 

TGGGACACAG ACCCTTACAC ACCAACTATC AAAGATGGTC GCCTTTATGC GCGCGGGGCT 960 

TCGGACGATA AGGGTCCTAC AACAGCTTGT TACTATGGTT TGAAAATCAT CAAAGAATTG 1020 

GGTCTTCCAA CTTCTAAGAA AGTTCGCTTC ATCGTTGGAA CAGACGAAGA ATCAGGCTGG 1080 

GCAGACATGG ACTACTACTT TGAGCACGTA GGACTTGCCA AACCAGATTT CGGTTTCTCA 1140 

CCAGATGCTG AATTTCCAAT CATCAATGGT GAAAAAGGAA ATATCACGGA ATACCTCCAC 1200 

TTTGCAGGAG AAAATACAGG TGTTGCCCGT CTTCACAGCT TTACAGGTGG TTTACGTGAA 1260 

AATATGGTAC CAGAATCAGC AACAGCAGTC GTTTCAGGTG ACTTGGCTGA CTTGCAAGCT 1320 

AAACTAGATG CCTTTGTTGC AGAACACAAA CTTAGAGGAG AACTCCAAGA AGAAGCTGGC 1380 

AAATACAAGG TGACGATCAT TGGTAAATCA GCCCACGGTG CTATGCCTGC TTCAGGTGTC ..440 

AATGGCGCAA CTTACCTTGC CCTCTTCCTC AGCCAGTTTG GCTTTGCTGG TCCAGCCAAA 1500 

GACTACCTTG ACATCGCAGG TAAAATTCTC TTGAACGATC ATGAGGGTGA AAATCTTAAG 1560 

ATTGCTCATG TGGATGAAAA GATGGGTGCT CTTTCTATGA ATGCCGGCGT CTTCCACTTC 1620 

GATGAAACAA GTGCTGATAA TACCATTGCC CTCAACATCC GCTATCCAAA AGGAACAAGT 1680 

CCAGAACAAA TCAAGTCAAT CCTTGAAAAC TTGCCAGTTG TTTCTGTTAG CCTGTCTGAA 1740 

CACGGTCACA CGCCTCACTA TGTGCCAATG GAAGATCCAC TTGTGCAAAC CTTGTTGAAT 1800 
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ATCTATGAAA AACAAACTGG CTTTAAAGGT CATGAACAAG TCATCGGTGG TGGAACCTTT I860 
CGTCGCTTGC TAGAACGCGG AGTTGCCTAC GGTGCTATGT TCCCAGACTC GATTGATACC 1920 

ATGCACCAAG CCAATGAATT TATCGCCTTG GATGATCTTT TCCGAGCAGC AGCAATTTAT 1980 

GCCGAAGCTA TTTACGAATT GATCAAATAA AACGATAGAA GTCTGAGATC TTATCCTTGG 2040 

ACTTCTTTTT GGAGGGAAAG TAGATGTCTC AAATCGAAAG AATCAAACAG GCTATCATGG 2100 

CGGATTCGCA GAATGCCAGC TATACAGAGC GTGGCATTGA GCCTCTCTTT GCAGCGCCAA 2160 

AAACTGCTCG CATCAATATC ATCGGTCAGG CTCCGGGACT TAAAACTCAA GAAGCAGGCC 2220 

TTTACTGGAA AGATAAAAGT GCTGACCGCT TGCGGGACTG GCTAGCTGTG GATGAAGATA 2280 

CCTTTTACAA TTCAGGTTAT TTTGCTGTTT TGCCTATGGA TTTCTACTTT CCAGGACATG 2340 

GCAAGTCGGG TGATCTTCCG CCTCGTACAG GTTTTGCAGA AAAATGGCAT CCGCAGGTCT 2400 

TACAGGAATT GCCTGATATT CAGTTAACCC TCTTGATTGG GCAATATCCC CAAGCCTACT 2460 

ATTTACAGGA GAAAATCAGT GGGAAGGTAA CGGAGAGGGT GAAACACTAT AAAGACTATC 2520 

TGCCAGCCTA TTTTCCGCTA GTTCACCCAT CACCACGAAA TCAAATCTGG ATGGCCAAAA 2580 

ATCCTTGGTT TGAGGCAGAA GTAGTGCCAC ATTTGAAAAA AAGAATTAAA ACCATTTTAT 2640 

AGTCAATGAA AATCAAACAG CAAACTAGGA AGCTAGTCGT AGGCTGCTCA AACTACAGCT 2700 

TTCAAGTTGC AGATAAAACT GACGAAGTCG GTAACATACG CACGGTAAGG CGACGCTGAC 2760 

GTGGTTTGAA GAGATTTTCG AAGACTATTA GAAGAAAAAG AATGAAAGAA ATAGCCTTTG 2820 

ACGCATTTTA CCAGCTTTAC CAAAACGACC AGCTTTCTTT AGTGGATGTG AGAGAAGTGG 2880 

ATGAGTTTGC AGCTCTTCAT TTAGAAGGTG CCCACAACCT ACCGCTTAGT CAATTGGCTG 2940 

ATAGTTATGA TTAATTGGAC AAAGATCGCT TGCATTATAT TATTTGCAAA TCTGCAATGA 3000 

GATCGGCGCG TGCTTGCCAA TTCCTATTAG AACAAGGTTA TAATGTTATC AATGTCCAGG 3060 

GTGGCATGTT AGCCTTTGAA GAACTTTAAA ATTTTGCATT TCTCCTACTT GGTGTGGACT 3120 

GGGTAGGAGA GTTTTATTTT TAGATAATTC TTATTTTTAA GAAAATTGAA AACATTTAAT 3180 

ATTTGCCTCG TGATGCTTTT TTCAGACTCC TAATCGTGGT ATACTAGGTC AGTATTTTAT 3240 

AAATATGAAG GAGATTTTTA TGGCTAAAAA AGGTACCCTA ACAGGTTTGC TCCTGTTTGG 3300 

AATATTTTTT GGTGCGGGGA ACTTGATTTT TCCGCCTTCT CTAGGTGCTC TATCTGGAGA 3360 

ACATTTTCTT CCTGCCATCG CAGGTTTTGT CTTTTCAGGC GTTGCTATCG CCGTCTTGAC 3420 

CCTTATTATT GGAACGCTAA ATCCTAAAGG ATATATCTAC GAGATTTCAA CGAAGATAGC 3480 

GCCTTGGTTT GCGACTCTTT ACCTCTCAGT TCTTTACTTG TCAATCGGTC CATTCTTTGC 3540 
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1020 








TACCCCACGT 


ACTGCTACAA 


CAGCTTACGA 


AGTACGGATT 


AGCCCCCTTT 


TGTCGGATGC 


3600 


AAATAAAGGA 


CTTGGCTTGA 


TTGTATTTAC 


GGTTCTGTAT 


TTTGCGGCAG 


CCTATTTGAT 


3660 


TTCCCTTAAT CCATCAAAAA 


TCTTAGACCG 


CATTGGACCT 


ATTTTAACGC 


CAGTCTTTCC 


3720 


AATTTTGATT GTTATCTTGG 


TCGTTCTGGG 


AGCTATCAAA 


TATGGTGGAA 


CAAGTCCTCA 


3780 


ACCTGCTTCA 


CTGCTTATCA 


AGCTTCTGCC 


TTTGGTACAG 


GTTTCCTAGA 


AGGTTACAAT 


3840 


ACCTTGGACG 


CCCTTGCCTC 


AGTGGCCTTT 


AGCGTAATCG 


CAGTTCAAAC 


CTTGAAACAA 


3900 


CTTGGATTTT 


CAAGTAAGAA 


AGAATACATT 


TCAACTATTT 


CGGTTGTTGG 


TATCGTTGTT 


3960 


GCCCTTGCCT 


TCAGCGCTCT 


TTACATCGGT 


TTAGGTTTTC 


TTGGAAATCA 


TTTCCCAGTA 


4020 


CCAGCTGAAG 


CGATGAAGGG 


TGGAACACCA 


GGTGTTTACA 


TCTTGTCACA 


AGCCACTCAA 


4080 


GAAATCTTTG 


GCTCAACAGC 


TCAACTCTTC 


CTTGCAGCTA 


TGGTTACCGT 


AACCTGCTTC 


4140 


ACAACGACTG 


TTGGTTTGAT 


TGTGTCAACA 


GCTGAGTTCT 


TTAATGAGCG 


CTTCCCACAA 


4200 


ATCAGCTACA AGCTTTATGC 


GACAGCCTTT 


ACCTTGATTG 


GATTTGCTAT 


TGCCAATTTG 


4260 


GGTCTTGATG 


CGATTATC 










4278 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEENESS: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 



ACCCGATCAA 


ATGACAAAAG 


CTAACTTTGG 


TGTCGTAGGT ATGGCCGTAA 


TGGGTCGTAA 


60 


CCTTGCCCTT 


AATATTGAAT 


CTCGTGGTTA CACAGTTGCT ATCTACAACC GTAGTAAAGA 


120 


AAAAACGGAA 


GATGTGATTG 


CTTGCCATCC 


TGAAAAGAAC TTTGTACCAA 


GCTATGACGT 


180 


TGAAAGTTTT 


GTAAACTCAA 


TCGAAAAACC 


TCGTCGTATC ATGCTGATGG 


TTCAAGCTGG 


240 


ACCTGGTACA GATGCTACTA 


TCCAAGCCCT 


TCTTCCACAC CTTGACAAGG 


GTGATATCTT 


300 


GATTGACGGA 


GGAAATACTT 


TCTACAAAGA 


TACCATCCGT CGTAATGAAG 


AATTGGCAAA 


360 


CTCTGGTATC 


AACTTTATCG 


GTACTGGGGT 


TTCTGGTGGT GAAAAAGGTG 


CCCTTGAAGG 


420 


TCCTTCTATC 


ATGCCTGGTG 


GACAAAAAGA AGCCTACGAA TTGGTTGCGG 


ATGTTCTTGA 


480 


AGAAATCTCA 


GCTAAAGCAC 


CAGAAGATGG 


CAAACCATGT GTGACTTACA TCGCTCCTGA 


540 


TGGAGCTGGT 


CACTATGTGA 


AAATGGTTCA 


CAATGGTATT GAGTACGGTG 


ATATGCAATT 


600 


GATCGCAGAA 


AGCTATGACT 


TGATGCAACA 


CTTGCTAGGC CTTTCTGCAG 


AAGATATGGC 


660 
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TGAAATCTTT 


ACTGAGTGGA 


, ACAAGGGTCA 


ATTAfiACAfiC 


1 nL 1 l\iA 1 I \j 


AAATCACAGC 


720 


TGATATCTTG 


AGCCGTAAAG 


ACGATGAAGG 


PCAAnATfVMl 


tuw i vviTAG 


ACTACATCCT 


780 


TGATGCTGCA 


GGTAACAAGG 


GAACPGCTAA 




UAATCATCTC 


TTGACCTTGG 


840 


TGTACCATTG 


TCACTGATTA 






TACATTTCAA 


CTTACAAAGA 


900 


AGAACGTGTA 


CATGCTAGCA 




AAAAV.WUA. X 


L. 1 IvJ AAC T 


TTGAAGGAGA 


960 


CAAGGCTGAA 


TTGATTGAAA 




AuLvLi 1TAC 


TrCTCAAAAA 


TCATTTCATA 


1020 


CGCACAAGGA 


TTTGCTPAAT 




/^TV^^I* 11 A Af* A A 

1\. X AAAl* AA 


AACAACTGGA 


ACTTGCCATT 


1080 


TGCAGATATC 


GCATCTA*Tf*T 




L iTiT ATCATC 


CGTTCTCGTT 


TCTTGCAAAA 


1140 


GATTACAGAT 




ui-UATGCAGA 


TCTTGCCAAC 


CTTCTTTTGG 


ACGAGTACTT 


1200 


CTTGGATGTT 


ftU 1 Vjv. X Anu * 




AGTACGTGAT 


ATCGTAGCTC 


TTGCGGTTCA 


1260 


AGCAGGTGTG 






AGCTATTACT 


TACTTTCATA 


GCTACCGTTC 


1320 




l-t-Avjl. 1 AA\. X 


TGATCCAAGC 


ACAACGTGAC 


TACTTTGGTG 


CTCACACTTA 


1380 


CCAACGTAAA 


uALAAAUAAVj 


gaacl ri icca 


CTACTCTTGG 


TATGACGAAA 


AATAAGTAGG 


1440 


TCAGCCATTV: 


OuAAAWlalxAT 


TTTvvTTACTT 


GAGAAAGAAC 


GAAATCTAGC 


TCATTTTTTA 


1500 


At* X 1 XwOAAl. 


TCCAGAAAGA 


GCAGTATCGG 


GTTGATCTGG 


TAGAGGAGGG 


GCAAAAAGCC 


1560 


CTCTCCATGG 


CTCTTCAGAC 


AGACTATCAT 


TTGATGTTAT 


TGAACGTTAA 


TCTGGGAGAT 


1620 


ATGATGGCTC 


AGGATTTTGC 


AGAAAAATTG 


AGCCGAACTA 


AACCTGCCTC 


AGTCATCATG 


1680 


ATTTTAGATC 


ATTGGGAAGA 


CTTGCAAGAA 


GAGCTGGAAG 


TTGTTCAGCG 


TTTTGCAGTT 


1740 


TCATACATCT 


ATAAGCCAGT 


CCTTATCGAA 


AATCTGGTAG 


CGCGTATTTC 


GGCGATCTTC 


1800 


CGAGGTCGGG 


ACTTCATTGA 


TCAACACTGC 


AGTCTGATGA 


AAGTTCCAAG 


GACCTACCGC 


1860 


AATCTTAGGA 


TAGATGTTGA 


ACATCACACG 


GTTTATCGTG 


GTGAAGAGAT 


GATTGCTCTG 


1920 


ACACGCCGTG 


AGTATGACCT 


TTTGGCGACA 


CGG 






1953 


(2) INFORMATION FOR SEQ ID NO: 155: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6474 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(D» TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
CCGGCAGTAC ACGAGCTTGG GGAACAGCCA CTGGAACGAT GAGGTGTGAG CTCAAAATAT 



60 
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CCTCCAGTTA TGTTTTTCCT AATAGTATAC CGGAAGAGTG AAAGGATTTT ATAATGGAGC 120 

GGTTACAAAG AACCTACTTT CTATTAAACA GTATACTATG AAAATGTGAA AATTTAACAT 180 

TTTTTTGTAC AAATTTTATA AATTATTGCC TTTTTAATAT CAATAGTTAA TCTCTTATCC 240 

AGATCCCCCT TGTGTAAACT TTATCTTTAT AAGCTTCAAG GCCCCTATCC CATCTATTTG 300 

CAACAATTAG ATCACTTTGT TTTGTAAATA GTTCAAAATT CTTTTCAATA ATTACGTTAT 360 

CTATACTAAC GTTTAAATTT GGTTCATATA CTAAAATTTT TATACCGACA ATCAATAGTT 420 

CATTAATTAT ACTTAAAATA GCTGACTCTT TGTAATTATC TGAATTATAT TTCATCCCCA 480 

ATTTATATAT TCCTACTATC TTTGGCTTTC GTTCCAATAT TTGTTTAACT ATGAACTGTT 540 

TTCTATTTGT GTTTGAAATA TCAATCGCTT CTATCACTGG GGCATTTATT TCTATAAATT 600 

CTTTTTTTAA TTGTTTAGTA TCTTTGGGAA GACAATATCC TCCAAATCCA AAAGAAGGAT 660 

TATTATAAAA ATTTCCAATT CTTGGATCTA AACAAACACC TTTTATTACA ACTTCAGCAT 720 

TTAAGCTTCT CCTCTCAGCA AAAGAATCTA GTTCATTAAA AAAGCAACAC GGAGAGCTAA 780 

GAATGTGTTA GAAAAAAGCT TAATTGCTTC TGCTTCAGTA GGAGAAACTA ACATAACATT 840 

TTTAATATTG GCAGTACTAT GAGTACTAAT CGAAAGGAAC AACTCTGCAA TTTTTCTTCC 900 

TTCAACTGTC TCATCTCCAA CAACTATGCG ACTTGGATAT AAATTATCAT ATATAGAACA 960 

ACCTTCTCTC AAAAATTCAG GGACAAAAAT GATATTTTTT GTATCAAACA GCCTTTTTAA 1020 

TTTGTTTGAA AAGCCGATCG GAACTGTTGA CTTTAAAATA ATCTTTCCAT TAGGTTTTAC 1080 

CCTCAGAATC TTCGATACCG TTTGTTCGAT TTCATATGTA TTAAAACTAC CAATTTTCTC 1140 

ATCATAATCT GTCGGAAGCG CAATAATATA ATAATCAATA TTATTTTTAA TTTCAGAAAA 1200 

TGTATCAAAA AAAGTAATAT TTAAGTTATT CTCGCAAAAA AACTTCATAA GCTCTTCATT 1260 

TTTAGATGGA AGAATGCCCT TTTTTAAATT ATTTATTTTT ACAGAATCTA TATCATATGC 1320 

AACAACTTTA TATTTAGATG CAAATAGTAA CGCGTAGGCC AGCCCAACAT GCCCCAAACC 1380 

AATTACTGCT ATATTCATAA AACTACTTCC TTATTTCTTA ATCCAAAATC TAATAGAA'j A 1440 

AGCTGCCCCA TTCCTTAAAT ACAACTCTTT AATATTGTTT AAAAGTTTTT CAACTGATTT 1500 

CCAGATTATC AAAATCTGAG ATTTATAGCA CAATATTGAT GATATTCTAT CAATATAATT 1560 

TTTTTCATCA AGTTCCTCTT GATACATTTT TAATTCTTTA CTTTTTCCCA TATAACTAAC 1620 

CATACTACTA TCACTTACAT ATGGGAAGTC CTCATAATAT ATTACTTTAT AACGCATAAA 1680 

TTCAAGCGCC CTTCCAATAC TATTCACAAA AACATGAGCA ACATGGTCAC CAAGTGAAAG 1740 

CGGACAATAT ACGACACATT TGTCGTCTAA ATGCATTAAC AGCTCTTTTA TGATATCATT 1800 

CTTTAATGTG TCCTCATTTT TTAATTCACT ATAGATATGA CGGTATAGAA AATTGCCATT 1860 
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TCTATCTTTC CTATAGAGAC ATTCATAGTA CGATAAGTGT CTAAAATCAC ATTGTAGACG 1920 

TTCACAAGCT AACCTGTCTT CTTTCTTCCT TTCTTCAATC GGATATTTCC CAAGGTTACA 1980 

CAACTTATGA AATTGCTTAG CAGAGGCCTG TAGCTGTTGG CTCAAAGGGT AACCAGAAAA 2040 

TATAGTAATA ACAAGTACAA TTTCTCCTTC TGAAGTTAAT TTTGAAATAT AATCACCACA 2100 

GGAAAAAATT GCGTCATCTA AATGTGGAGA TAAAAAGATA TACTTAGTAT TCTTACTCAT 2160 

AACCATTCCC TCTACAATTT ATCTAAAAAC TCACTAAGTG TCTGATTAAA TTCCACATCA 2220 

TCAAAAAAAT TCACCTTATT CTTAATAATG AATATTTCGT TAAATAAACA TATATATAAA 2280 

TATTTCAATA TCCTTTCAAT ATCATCCTCT AAATTCTCCT CAATATTTTC TATCAGCCCA 2340 

TTTACAATCT TATTAAAAAA GATAAGCTCT TTATCTCTAA AATTAAATAT TTTCATACAA 2400 

CTGTTGTATC GAAAAATATA TAAAATAATT TTTACTAATG TTTGAATATT TAAACAACTA 2460 

AATAAATGAG TTGTACCCGG GACACTATTT ATGTTATCAA GAACACTATC TTGAAACCTC 2520 

AACTCACAGT TCTTTTTGTG AAATTCTTTT TTATCGTTTA GATCTGATAT TTTTTTAGAC 2580 

ATTTCAACAA TCTCAGACAT TTTATATGGA TATCTAGGAT GAATGCCAAA ACTATGCAAA 2640 

ATGAACTGCA CCCCAAAAGT TAGACAGAAT AAATCTAACT TTTGGGGTGC AGTTCATAAG 2700 

ATTGGGATAT TTTTTTTTAG CTAGAACTAG TAGAAATATA TACTCAAATA ACAGATACCT 2760 

TAAGGGTTTC TCATCTACAT AAAAAAATGA TACTTTTTTC TCTTCAGTAA TTACCTCATA 2820 

AGCTTCACAA TAGAATCTCA TGTTTCCCTC CCCTATATTC TTAAATAAAA TCCTTTGGAA 2880 

ATTGATATAT CTTAGTAAAA TATTGTTTAA GTTCCGGATG CGGAGCATGG CTAACAATAA 2940 

TGACAGTCAA ATCCTCTCTA TCTAATATCT TACGTTCAAT CGCTAACGAA GTTCTCCTAT 3000 

CGATAGCAGA AGTTCCCTCG TCAATTAATA CTATTTTCTT ATTTCTAATT AGCCCTCTAG 3060 

CTAAAGTAAT TTTTTGTTTC TGCCCTCCTG ACAGTAATCT CCCATCATCA CCAACATAAT 3120 

AATCTAAAAT GTTATTAGGA AAATCTTTTA CACTCAAACC AACTTGCTCT AAAGACTGTA 3180 

GTATTTCTTC ATCAGTATAA TTTTCTTCCA ATAAAATATT ATCTCTAATC GTACCTTCAA 3240 

ACAAATAAGC TTTTTGATCT ACATATAGAA CATTCGAAAC CATATTTAAA TAGGAGGTTT 3300 

TTTTTATATC ATCCCCGCAG AATCGCAATT CTCCACTATA ATCTCTCAAA AAGCCATTCA 3360 

ATAATTTTAA TAATGTAGAT TTCCCGCTTC CACTTTCACC TAAAATTAAA TACTTTTCAT 3420 

TACGTTGAAA ACAAAAATTT AAGTTTTTTA ATATTTCTTT ATCTCCATAC TTATAGCAAA 3480 

TATTTTTTGC TTCATATAAC GGAAAATCTC TATTCACCTC ATTTGGTTCG ATATCATTCA 3540 

TTTTATTTGA CTCAATTGGA TTAATTGAAT ACAATTTTAA AAAAATAGGC TTCGTACCAA 3600 
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TAATAGAGGA 


TAATTGACCT CCTAATTCAC CTAGCGCTGT AAAAATAACA 


CCTGTTAGTG 


3660 


CTCCTATTGC 


TTCAATAGTA CCAATTTTCA CTATTCCTTT TATTGCAAGA 


TAGCCTGTTA 


3720 


AAAAAACGAG 


AGATATCTGA AAAAAAATAT TGAGAAAGAA GCTAATAGCG 


CCTGCTAACG 


3780 


TTTCTACAGT 


TGTCTTTCTT TGTATAACCA TCTTTAATAA AATTCCTGCT 


TCTTTAATTT 


3840 


TCTTAGCCAA TACATATAAA AGATTCAAGG ACGCTAACAC ATCAAATCCA 


TTCAATATAG 


3900 


TCTCACTAGA 


TTTTAAAAAA GCTTCATTTT GGTTAGTTAA ATTTAGACTA 


ACTTCTCGCA 


3960 


TTTTCGATGC 


AAAGATTTTT GGTACAACTA GCATAATCAT TAATGAAAAC 


AAGGTGGCTA 


4020 


CACTCAATGA 


CCAATGATAG TGATTAAGAG TCACAACTGC AAATATAGTA 


CCAGAAATTC 


4080 


CTTTTATTAC 


TAAAAAAAGT TGTTTAAACG CCTGATCATT TAAAGTCTGA 


ACATCATTAT 


4140 


TTAGCCACGA 


AAGATATGTT CCTGATGATT TACTATGAAA TTCTTGATAG 


GTAGAGTTAG 


4200 


AGATGTCTGT 


GGCAACTCTA TTTCGAATCT CTAGATTAAA CTCTTGGATC 


ACTTCAACCT 


4260 


GATAATTTTT 


CACTACCCAG TCAAGGAATA TTATCCCACA CCAGACAATC 


ATTTGGTAGA 


4320 


TTGACAATTT 


CAAAAACCGC TCTAAATTCA TCGCAATTAA TTCATTCAAC 


ACCAGAGCAT 


4380 


TAATAGTTGC 


TGCATAAATT AGCAATAATT GACCAGCAAC AATAAATATC 


GTTAATAAAC 


4440 


TAAATTTTTT 


TATATTTGAT TTTATAATAG TATACACAAT AGTTTCTCAC 


TTTCTAAATT 


4500 


TTAATTGAAC ATAGTTT'l'CA TATATACAAT AGAAAAAACC AAAATGATAT 


AATAACATAT 


4560 


ATTTCAAAAA 


AGAAATTCGT TAAAAATTTT TTCTTCTCTT GCCTTCTTGA 


TTACTTTTAA 


4620 


AGCCTTGCAT 


TTGTCTCCTA TTAATAGTAA CCGCTTTATG TTTAAAGAAT 


AATATTTCTT 


4680 


TGTAACCAAT ATTCTCTCGT TGAAACTCAA TAAATTAAAA TATTTCCTAC 


AGTAATTATA 


4740 


ATATTCTTCA 


TCTGCATTAA TTGTTTTTTG TGTCACTCCA GTGATACCGT 


TTTCTTTACT 


4800 


GTGAGCGTAG 


TAATTCACCA AGAATTCTCG CACTATATCA ATTTGGTATC 


CTTGAACAAG 


4360 


TAGTTTTAAT 


AAAACAACAC CGTCCTGATG TGAATCTATT TTCTCAAAAC 


CATTAATTAA 


4920 


TTCTAGCACC 


TCTTTfTTAC ACAACCAAAA TGACGTACCT GCTATATTGT 






AACAAACAAG 


GGATTTCCAA CAAAATCGGT CTTCTCCTCT TCTCGTGTAC 


CATTTGGATA 


5040 


AATTATTATT 


CCATAACTAC AAACTAAAGC TAAATTCTTC ATTCTACTCT 


TTTTAAAACA 


5100 


AGCCATCAAC 


TTTAAAATTC GATCTGGCAT ATATTCATCA TCATCGTCTA 


AAAATGATAT 


5160 


ATACTTACCT 


CTAGAATTTT TGATACCTAT GTTTCTGGCA TTAGTTGCAC 


CTAAATCTTC 


5220 


ATTACTTAAA ATTAACTTAA TTCTATGATT GGTATAGCCA AATTGATGGA 


TAATTTTATT 


5280 


TCTTAAATTT ACATTACTAT AATTATCATC AATAATTATA ACTTCGATAT 


TTTTATAACT 


5340 


TTGATGTAAA 


CAACTTTTCA CAGCTCTAAT CAGAGATTCA TACCTATTAT 


GTGTTGGTAT 


5400 
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TATAATACTT ACTAATTCTT GATCTATATT CCTATCCATG ACTACTCTTC TCTAATAATT 5460 

CATCATATAC TCTCATGGTT TCTACAAACA TTTTTTGCAC AGAAAAATGT TTTCTTATTT 5520 

TTGATTTACT ATTCTCACCT ATATATTTCA AATACTCAGA ATCATTGAGT AAAAAATTAG 5580 

CACAAGCACA CACTCCCTCA ACATCTTCCT TCTCAAATAA AAATCCATCA ACCCTATGTT 5640 

CAATAATTTC ACTTAACCCG CCAACATTAC TAGCTAAAAC CCGAGTTCCT TGTGACATTG 5700 

ACTCTAAAAC ACACATAGGT ATTCCTTCTG TATCAGAAGG AATATACAAT AAATCCGATA 5760 

TTTGGTAAAC TATAGTAGCT GGATAGATTT CACCAAGTAA CCTGAAATTA TCTCTACATT 5820 

TCAAATGGCA AATTTTTTCT TTCAAAGCAG CCCACATACT ACCATTTCCA GCCATAATAA 5880 

AAATCACATC TTCTCTGACT AAAAATAATT TTTCTGCAAA TTCAAGGAAT CTATCCGGCC 5940 

TTTTTTCTGG ATCCAACCTT CCAACATAAC AAATGATTTT TTGTTATTTG GAATACAAAA 6000 

TTCTTTTTTA AAGTCTTGAA CACCTACTAC ATCTAAATCG CTATTTGATA CATTAATTCC 6060 

GTTATTTATT GCAACTATCT TCTTATTTTT TATTATACTC TCCAATCTTT TTTTTCATAG 6120 

TTTCAGATAC ACAAATAAAA GCATCTCCCA TAGAATATGT CCAAAAATCA AAATAAGTCA 6180 

AGAATTTCTT TTTTAAGTTA TATTCAACCC ATCCATGGCA TGTTATCACT GTCTTAACCT 6240 

TTCCAAATCC ATTCTTGTCA AGTTTTTTTA ACATATATAA AAAATAATTA GTTGAGTAGC 6300 

CATGACAGTG TATAAGTTGG ATTTTTAATA ATTTTAAAAT ATTTTTAACG TGTAACCCAC 6360 

TTTCAAAATT ATTTGAACAT TGAGTACAAT CAACATAGGC AATATCTAAA TTTTTATAAT 6420 

CATCAATAAC CTTTGAATCT CTAGATACAA TTATCAAAAT AGGGAATAGA GACA 6474 

(2) INFORMATION FOR SEQ 10 NO; 156: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4792 base pairs 
(BJ TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

TATTTAACGA TTTTTTTCAT GTCATTTCCT CCAAAATAGA ATACCTTATA ATCTTAACAG 60 

AAAAAGAGCA TTTACGCCAT TATATGATAT CTATCTCTGT GATAAGTTTT TTTTATGGGT 120 

AATTTAAAAG ACCAAACGCA AGATGGCAAT CAAGACCACT CCAAAGAGAA CTGTTCCGAC 180 

TAGATTGCGG TAGCGAAAGG CTACCCAAGC TGTTGGAAAG ACGGCTAAGA AGTCCAGTCA 240 

TTTGATTTGA GGAAGACTGC CAACCTTACC TGTCACTACG CTTGAAAGAA TCAGGGCAAA 300 



WO 98/18931 



PCT/US97/19588 



1026 



GATAATGGAA 


ACAGGCAAAA 


ACTTCAAAAA 


ACGCTCAACA 


ATCGCAGGCA 


GGCCCTTATA 


360 


CTTGACCAAG 


ATGAAGGGAA 


TCATACGGGG 


AATCCAAGTC 


ACCAAGCCAG 


AGAAAATAAC 


420 


TGCTAATAAA 


AGATACTTAC 


TGACCATCTA 


AAACCACCCC 


CATGCTACAA 


CCAAGTAGCG 


480 


TCGCAAACAG 


AACAGCTAGT 


GACTGAGACA 


TCACTGTCAA 


GAGCAAAAAG 


AAGGACACCG 


S40 


CAACAACTGC 


TAGGATAATG 


AGCAGATTGC 


GGACAGGAAT 


CCGTCTTTGC 


ATAATCTGAA 


600 


ATTGCGAAGC 


AAAATACCAA 


TAAACATCCC 


AACCAGGGCA 


AAATCCAAGC 


CAAAGATTTC 


660 


TGGATTTGGT 


AGCAGGCCAC 


CCAGAGCCGT 


TCCGACTACT 


GTCCCCACAA 


ACCAAGCCAC 


720 


ATAGCTGTTA 


AGATTGTTTC 


CGTGCATCCA 


CATAGGATTT 


ACCTTGTCTG 


TATGGGCCAA 


780 


TTCACCCATC 


AAAACGCCAT 


AGGTCTCATC 


TGTCAAGATA 


CTAGACATAC 


CGATATTGTA 


840 


CCAAAGACTG 


GTATGACGCA 


AATAAGTCGA 


TGCGTGTAAA 


CTCAACAAAA 


AGAGACGCAA 


900 


GTTGATTAGA 


AAAACCGTCA 


TAGCAATAGC 


TGCCACAGGA 


GCTTGAACCA 


CAATCAGTGC 


960 


CAACATGGCA 


AACTGGGCAC 


TCCCAGCATA 


AACAAAGAGA 


CTCATCAAGC 


CCATCTCAAC 


1020 


AGGTGTCACA 


TAGGGCGCAC 


CGATAATTCC 


ACAGGCCAGG 


CCGATACTGA 


CATAGCCAAG 


1080 


AGCCGTTGGC 


ATGGCTGCCT 


GCGCCCCCTC 


CTAAAATCCT 


TTTTCTTTCA 


TCTTTCTCCT 


1140 


CATATTGTCT 


TAATAATACT 


CAATGAAAAT 


CAAACAGCAA 


ACTAGGAAAC 


TAGCCGCAGG 


1200 


TTGCTCAAAA 


CACTGTTTTG 


AGGTTGCAGA 


TAGAACTGAT 


GAAGTCAGCT 


CAAAACACTG 


1260 


TTTTGAGGTT 


GTGGATAGAA CTGACGAAGT 


CAGCTCAAAA 


CACCGTTTTG 


AGGTTGTGGA 


1320 


TAGAACTGAC 


GAAGTCAGTA 


ACCATACCTA 


CGGCAAAGTG 


AAGCTGACGT 


GGTTTGAAGA 


1380 


GAGTTTCGAA 


GAGTACAAGT 


AGGCTGAAAA 


GAATCCAACC 


ACAGCATGGA 


CTATTATATA 


1440 


GCAGATTGAA 


ATAAGATGAG 


AACAAATCGA 


TTGGGAAAGT 


AAAATTAATT 


TCTATAAATG 


1500 


TTTTAGCAAT 


TGTTTCGTAC 


TATTTTAGAT 


TCAGTCTATT 


ATAACACATT 


CAGAAAAGAG 


1560 


AAAAAAGTCT 


GTTGATTTTG 


ACCATCATAA 


AAAGACTGGC 


AATCCAGTCT 


CAAACATATA 


1620 




TCTCCACTAA 


ATACTTTCAC 


GAATATTCAG 


AAGCATAACA 


AAGGCAACTA 


1680 


GAAGAAATAG 


CAATAAAACA 


AAGCTAACTG 


CCAGAGTTCC 


AAAGCTAGTA 


GCAATGGTTA 


1740 


CCAAAGCTAT 


TGTAAATAAG 


CTAGGTAAAA 


CAACCGTAAT 


GGCACCGATA 


GAGGATTGAA 


1800 


CTGCTCCCAT 


TGACTCCTCA 


GGTATTTGTT 


TAAAAACGAG 


TTCTTGCAAT 


CTAGGAGAGA 


1860 


GAACACCTGC 


GAAAAAGGCA TCCAAGGTAC 


TAAAGATGAG 


AATCCAGTCA 


AAACGAACTG 


1920 


TGGCAAATCC 


TACTAGAAGA 


AGCAACTGGA 


TGACAAGTGA 


GGCATAGAGA 


GCTGTTTTTA 


1980 


TGGAAATGGT 


ATGTTGCAGA 


TAGCCACTTA 


CAAGGCTTCC 


GACAATCAGG 


GCTGATAATT 


2040 


CTAGTGTGGC 


TAACAAGGCA 


AGAGATTGAC 


CAGTTTGTAA 


ATTCAAAAAG 


GGCTGGTTCC 


2100 
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TTAAAAATAG AGTGGAAATA GGAACCCTAA CATTTATCAC TGCTTGACTA GTAGAGATAA 2160 
TAAACAAAAC CAAGAGCACC TTATTCATAT TCCATATCAA TTTCGATGAT TGGAGCAAAT 2220 
GCTGGCAAAA GGATTTTACA GAGACTCCTT CTTGATAGCT AATCGTTTTT TCTACTTTCA 2280 
AGAGGTCAGT TTTTATGAAG AGGATACCTA AAAATGCGAT TAAAAAGGTA AGAGCGTTCA 2340 
GTAAGGAAAT AAACTGGATG GATAGAATGC CTAGTAAGAC TCCTCCTAGG ATATTACTGA 2400 
TTGTTTTCAC TAAACTAACA GTTGACTGTT TAAAGCCAAT AGCTTCTGCC AGATGGTCTT 2460 
GCCCAATAAT TCTAATGAAA ATCGGAGTGA GCATGGCGCC TGAAAAATAA CTCAATGTGT 2520 
CAGACAAGAG GTTAATCAGA CAAATAAATG CTACTAGCAA CAAGGAGAAA GACTGCCCTG 2580 
AAAGTGATAA AGACACTATA GAGTAAAGCA AAAATTTTCC AAAACTAATG ACTGTGTATT 2640 
TCAAGACACG ATGATGTTGA AAATCCGCCA AAACTCCCAG AAAGATTTGT AGAACTTGGG 2700 
GCAGGGTTTC TGAAATCGTG ATGAGTAAAA TCGCCAAAGG GGCAAAAGAT GCATCTGCCA 2760 
CATAATTCAG GAAGGCCAGA TAAAAAATCG TATCCCCAAG CCTTGAAATC CACTGGTTGA 2820 

TAGTTAATTG CCTAAAATCT CTATTTTGAA GAAATACTTT CATCACAACT CCTTCTTAAG 2880 

TTCAAATGGG AATCTTTCCC CAAGGATAGA CCGCGATACT ACTAACAACC AAAATTACAG 294 0 

TAACATCAAA AGCTGACCAA TGCCATTGTA GACTATATGC AGTCCAATAG CCCAATAAAT 3000 

TGACTTTGTC ATTCTAAATA AGACTGCAAA TATAAGACCT CCACCCATAT AGAAGACAAA 3060 

GTCTGTCAAG ACCCAACCGT GATTACTAAT GTGCGAGACC CCAAATAAAA CAGCGGAACC 3120 

AAGTACATCT AGCCCCCATT TCTTTCCTTT TTCCAGAGCA GTCATCACTA ATCCACGATA 3180 

AATCATGTCT TCAAAAATGG GACCTGCAAT CACAGGATAA AAAAAATACA TCAAAAATGC 3240 

TGTAGCCCCT GTAAAAGTCG GAGCAGCATG TTGATAAGAA ATTTCATTTC GAGTAGGTGG 3300 

GAAAAGAAAA AAGGTAACGA AATTCCAAAC AACAAAAGCA AGCAGAGCTA GGAAGGAATA 3360 

GAAAAGATAG GATCCTTTAA ACTTTCTACT ATTGATTTTC TGCCATTTCC CCGACCAAAT 3420 

CATAGCAATA AGAGCAAATA AAACCACAAG AAAATTCAAC ATCATATCCG ACAGATAATA 3480 

GGCAAAGTCA GATAGCCCAG TAACAAGGTC GCTGCGTAAA ACTAGAACAC TGAACTTCTG 3540 

GTCAGCAATA ACTAGTAGAA AAACTATAAT AAAGTAGCGG TGTGAGATTA TCTTTTTCAT 3600 

ATATCACCTT TCTAATATCC AAATACCAAT AAAGTAACAA TGAGTAAGAA ACTATTCCAT 3660 

GAAGCATGCA GAGCTATAGC CCAATAGATG GATCGGGTGT AGCGAAACAT CATACAAAAT 3720 

ATCAAGCCCA TTCCAAAATA CTTTATGAAA TCTGTCGTTA TCCAACCATA CTCCAAAACA 3780 

TGCATAGCGC CAAATATGGC AGCGGAAACA AGAACATCAA GAT ACT AT CT CTTAACTTTA 3840 
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GATAAACTTG TCATCAAAAG ACCACGACAA ACAACCTCTT CTGATACAGG TGCGATAATA 3900 

CTAGTATAAA GTATTCGCGT AACAAAATAG CTAATTCCTG TTAAATTGGT GGCTACTTCT 3960 

ACGACTGTAC TTCCATTCTG GGTACGAGCA AAGATATAGG TTGTTAGATT TCCCCACACG 4020 

AACAATAAGA AAAAAGAAAG AAGGAAAACA CCCAGGTAAG ACCAACGAAA CTGGAAACGA 4080 

CCACACTCTT TCCAATGTTC ACTTTTGACA AAAGCAATTG TAGCTATAGT TCCCAGAATA 4140 

AGTACCAATA AAACTTGGAA CACATAGTAC ATATTATCAG ACAAAGCAAC CATAAAATCT 4200 

AAGTCTGATG TGACATTAAA AATGAGGTAA TAAGTCAAAA TCAACAAGCC AGTTGCTAGG 4260 

TGAAATTTCA CTTCTTTCAT TTTCTTCATC CTATTATCTC CTATAAGAGC CTATCTTCTA 4320 

CGGCGGCCAA ACAATCCATC TGCTAAATCT ATAGTCCAAT CAAAAGCTCC ACGATTAGGA 43B0 

CTCATCCCTT GATTGCCCCA ACCAGGGTAA ATTCCTGGGA CGCCCCAACC AGATATACCA 4440 

CTTCTTCCAC CACCTCCCAT AGAATTTACG AGGTTGCCTC CTCTAACATC TTGCAACTCA 4500 

GCTTCTGTCA ATTCCATTGT TTCTGCAAAT TGTAAATTTA ACATCTTTTA CACTCCTTCA 4560 

ATTATCTTCA TTTGTAAACC ACTTCTGCGA CCTAGGATTT GCTTCAAGTG CTTTACAAGT 4 620 

ACAGTATAAC ACGAACATTG GCTTATTTTA GAAAATCGCA TATTTGATAT TTTTTCTTAT 4680 

AGAAATTTCA GATTTGCGAT TTTGGTGAAT TTGATTACTT CTCTGGTATA ATAAAGTTAC 4740 

TACTAATGAG GAGTGGAGAA ATATGAAGAA ACAAATTTTA ACATTATTGA AA 4792 

(2> INFORHAT ION FOR SBQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

f A) LENGTH: 2156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CCGTTCTCGG CGACGGCCAT CTGATGAAGC TATTTATGAG GGAAACTGGC AAGCTGGAGA 60 

GTCAGAGTAT CTAGTCTTTC ACCGATTGCT GTGGCAGCAG ATGTGCAGGG AAAAGGAGTT 120 

GCTCAAACCT TCTTAGAGGG CTTGATTGAA GGTTTTGATT ATCTTGATTT TCGCTCAGAT 180 

ACGCATGCTG AAAACAAGGT TATGCAACAT ATTTTTGAAA AACTTGGTTT TAAACAAGTC 240 

GGTAAGATGC CAGTAGATGG CGAACCCTTG GCCTATCAAG AATTAAAGAA ATAATGCAAA 300 

AGAAGTATGT AAAAATCCTC TACTCCTCAC CAATTGGTAT TCTATCACTT GTAGCTGATG 360 

ACCATTATTT GTATGGAATT TGGGTTCAGG AGCAGAAGCA TTTTGAGAGG GGACTAGGAG 420 

ATGAAACGAT AGAAGAAGTT GTTAGTCATC CTATTTTAGA CCCAGTTATT GCTTGCTTAG 480 
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ATGATTACTT TAAAGGCAAG CCTCAGGATT TATCCAACTT GCTCTTGGCG CCAATCGGAA 540 

CGAATTTTGA AAAGAGAGTT TGGGACTATT TACAGGGCAT TCCTTATGGT CAGACAGTGA 600 

CCTATGGACA AATTGCTCAA GACCTGCAAG TGGCTTCTGC TCAAGCAATT GGTGGAGCAG 660 

TGGGACGCAA TCCTTGGTCT ATCCTAGTAC CTTGTCATCG TGTGTTGCGA GCAGGCAAGC 720 

GTCTGACAGG TTATGCTGCA GGAGTGGAAA AGAAAGCTTG GCTCTTGGAG CATGAAGGAG 780 

TAGATTTTAA AGATAGAAGC AATAGAAGGA GAAGCACATG TTAGAATTTA TCGAATACCC 840 

CAAATGTTCA ACTTGTAAAA AAGCAAAACA AGAATTAAAT CAATTAGGTG TGGACTATAA 900 

AGCCGTCCAT ATCGTGGAAG AAACACCTAG CCAAGAAGTC ATTTTGAATT GGCTAGAAAC 960 

CTCAGGATTT GAATTGAACC AATTTTTCAA CACCACTGGT ATCAAATACC GTGAATTAGG 1020 

GCTAAAAGAT AAGGTAGGAA GTTTGTCAAA CCAAGAACCC GCTGAGTTGC TAGCAAGTGA 1080 

CGGTATGTTG TTAAAACGGC CCATTTTAGT AGAAAATGGA ACTGTTAAGC AAATCGGTTA 1140 

TCGAAAATCT TATGAGGAAC TGGGACTGAA ATAGTTTTTA TCTATCTCTT TGATAGATAA 1200 

AATATATAAC TTCCCTGTTT CAAAGTATGA TAAACTAGTA GGTAGACAAA GTCTGTATCT 1260 

GACCGTAGCA AATAATTTCA TTGACGGCAG AAGCATCGTA GCATGAATCA TTATCAGAAG 1320 

AGGATGTTTT TATGAATGTT ACAACGATTT TAGCATCAGA TTGGTACCAA AACTTGATGC 1380 

AATTGATTCC GGATGGCAAG CTGTTTAGCC TACGTTCGGT CTTTGATGGA ATCCCTAGAA 1440 

TTGTCCAACA ACTTCCAACA ACAATTATGT TGACAATTGG TGGTGCCCTT TTTGGCTTGG 1500 

TTTTGGCGCT TCTTTTTGCC ATTGTGAAGA TCAATCGTGT CAAGATTTTA TATCCCTTGC 1560 

AGGCCTTCTT TGTTAGTTTC TTAAAAGGGA CACcGATTTT GGTGCAACTC ATGTTGACCT 1620 

ACTACGGAAT CCCTTTGGCT TTGAAAGCCC TCAATCAGCA ATGGGGAACT GGTCTCAATA 1680 

TCAATGCGAT TCCAGCTGCA GCTTTTGCGA TTGTCGCCTT TCCCTTTAAT GAGGCAGCTT 1740 

ATGCTAGTGA AACCATTCGT GCAGCCATTC TCTCAGTTAA TCCTGGTGAG ATTGAGGCGG 1800 

CACGCAGTCT GGGTATGACC CGAGCGCAAG TTTATCGACG AGTGATTATT CCTAATGCAG I860 

CGGTGGTAGC TACTCCAACC TTGATTAATT CCCTCATCGG TTTGACCAAG GCAACATCTC 1920 

TAGCTTTTAG TGCGGGTGTT GTGGAAGTCT TTGCCCAAGC TCAGATTCTA GGTGGAGCTG 1980 

ATTATCGCTA TTTTGAACGC TTCATCTCCG TTGCCCTTGT TTATTGGGTA GTCAATATCG 2040 

GAATTGAAAG CCTCGGTCGT TTCATCGAGA GAAAAATGGC TATTTCTGCA CCTGATACAG 2100 

TGCAACAGAT GTGAAAGGAG ACCTTCGTTA ATGATTAAGA TTTCGAATTT AACCAA 2156 
(2) INFORMATION POR SEQ ID NO: 15B: 
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U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3140 base pairs 
(BJ TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{D> TOPOLOGY: linear 



(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


158: 






GTATCTCTAC 


ACATGTCTTC 


AATCGATTTT 


GTTGTCCTCC 


AATTTAATTC 


CTTATATGCT 


60 


TTGTCTGCAT 


TTGCATAACA 


AGTTGCAACG 


TCTCCTGAAC 


GTCTTGGAAC 


TATTTTATAA 


120 


GGAATAGGGA 


TCTTATTAAC 


ACTTTCAAAT 


GTATTTACAA 


GTTGTAATAC 


ACTAGTGCCT 


180 


TCTCCCGAGC 


CTAGGTTATA 


GATATAAACA 


TCTGTTTTTT 


CAGATACTTT 


TTCTAAAGCT 


240 


TTTATATCTC 


CTATTGCTAA 


ATCTACTACA 


TGGATATAAT 


CACGCACACC 


AGTACCATCA 


300 


AGCGTATCAT 


AATCATTTCC 


GAACACACTT 


AGCTCTGATA 


CCTTACCTAC 


CGCTACTTGT 


360 


GCAATATAAG 


GCATCAAGTT 


GTTAGGAATT 


CCTGAGGGAT 


CTTCCCCAAT 


CAAACCAGAC 


420 


TCATGAGCAC 


CAATTGGATT 


GAAATAACGA 


AGCAACGCAA 


TACTCCATTC 


TGAATCTGCC 


460 


ACATGAACAT 


CTTTTAAAAT 


TTGCTCAAGC 


ATCACTTTCG 


TATACCCATA 


AGGATTTGTC 


540 


GCACTTGTTT 


GCATCGTCTC 


AATTAGAGGT 


GACTGATTGT 


TAATTCCATA 


TACAGTCGCA 


600 


CTTGAAGAAA 


AGACAATCTT TTTAACATTA 


AATTCTGACA 


TCACTTCAAC 


AAGTGCCAAT 


660 


GTACTCATAA 


TATTATTTTT 


GTAGTACATC 


ACAGGCTTTT 


GCACGGATTC 


TCCGACAGCT 


720 


TTATAACCTG 


CAAAATGAAT 


TGCAGCATCA 


ATCGATTCTT 


GTTCAAATAC 


CTTTCTCAAT 


780 


GCTTGTTTAT 


CACAAACATC 


TAATTCGTAA 


AACACGGGAC 


GTATTCCTGT 


AATTGCTTCA 


840 


ATACGGTCTA 


GCACCAAGAT 


GCTAGAGTTC 


GAAAGGTTGT 


CGACAATGAT 


AACTTCCTTT 


900 


CCTAAATTTA 


GTAATTCTAC 


TACGGTATGG 


CTACCAATAT 


AACCAGCTCC 


GCCTGTTACC 


960 


AATATTGCCA 


TCTGGGTTTC 


CTCCTAATTA 


ATTCCAACCG 


ACTTAACAAA 


TCTCATAAAC 


1020 


GCTTCATGCC 


CAGACGGTGT 


ATTCTTATAA 


ACTCCTGCAT 


CTTCCAGAAC 


TCTCGCAAAC 


1080 


ACTTGTCCTG 


CTTCGTGTTG 


AACTACGCTA 


TTAACCTCTT 


CTTTATTAAT 


GCGAGGATAT 


1140 


TTTTCTTTCA 


ATTGGTCGGC 


CCATTCTAAA 


TGATAATCCG 


CAATTGCATT 


ATCCTCTCCT 


1200 


AAAAGATATT 


TTCCAACTTC 


TTCTAACTCT 


GGTTTCAAAC 


GAGGTGGTAA 


TATCGCAAGT 


1260 


CCCATCACTT CGATTAACCC 


GATATTTTCC 


TTTTTAATAT 


GTTGTACATC 


TTGATGAGGA 


1320 


TGGAAAACAC 


CATCTGGGTA 


TTGTTCAGTA 


GTATGATTAT 


CTCTTAGAAC 


AATATCTAAT 


1380 


TCGTATCTCC 


CGTCCACTTT 


ACGAGCAATA 


GGAGTCACCG 


TATGGTGTGG 


GACATCTTCA 


1440 


GTCATAGCAA TGATGTCTAC 


TTCTAAATCT 


GAATATTCTC 


TCCACTTATT 


TAGAATTTTA 


1500 



WO 98/18931 PCTAJS97/19S88 



1031 



ri'Vhnr^Th a at* 


CTAACAAGCG 


ATTTTTATTT 


TCACTTTGTA 


ACCTAATTAC 


TGACATTGGC 


1560 


V— rl 1 1 l InLnA 


TACCAGCATT 


AACATCCTCA 


AAGTCTTTAA 


AACAAAATTC 


ACTCTCAAAT 


1620 


1 1 IAjI. 1 1 U 1 


CCATTGGGAA AATATGTTTC CCTCCCTGGT AGTGGTTATG ACTAAGAATG 


1680 




AGATAGGAAG 


ATCAGAATTT 


GAACCAGCAA 


AATATCCTGG 


CAAAATATCA 


1740 


nt*nn * V. 1 L LA 


ATAATTGTTC 


AAATGTTTTA 


GAGGTAATAG 


CCATTGGTAC 


ATGTTGACTA 


1800 


TTuAAAAATA 


TCGCATGCTC 


ATTAAAGTAT 


GAGTAGGGAG 


AATACTGGAA 


TCCCCATACT 


I860 


TIajTuACCAA 


GTTTCAACCG 


AATAATTCTA 


TGATTCGAAC 


GTGCTGGATA 




1920 


CCCTGATATC 


CTTCATTTTC 


CATACATAGT 


AAACATTTGG 


GATAATTAGT 


4 V>V> * * 4 1 l 


1980 


AATTTTTCAG 


CAGCAATTGT 


TTTTGGATCT 


TTTTCGGGTT 


TTGACAAATT 




2040 


TCTAGCTCTC 


CGTATTTAGT 


TGATGCTCGA 


AACTCAATAT 


TCTTAGCAAT 




2100 


i 1 AATATAAT 


CACTATCTTT 


ACTTAACTTA 


TAAAACTCTT 


CAACTGCTTC 


TTCACGTRAT 


2160 


ATATCATATG 


AACTCCAAAA 


AATATCATTT 


AATCGACTAG 


GTAAAGGAAC 


TATflAAATTP 


2220 


ATTAACTCTG 


CTCCTAAACA 


TTCCTTTTCC 


TCGATTAAAT 


PTTT A ATTTT 




2280 


AAGGCGATTT 


CCACTAAGTA 


ATCTTTTATT 


TGTTTCAGGT 


CATTTTtATf* 


flfiAAATIVCA 


2340 


TCAATTCCCT 


CCTCACCTAT 


TAACGCTAGT 


ACTCPATTTT 




TTTflTPA ATT 
lit uIwnAl I 


2400 


TCATTATACA 


TTCCGTATTC 


AATTACTCTA 


TCAACAAAAT 


TATf?AATAAT 




2460 


TArrrnvrr 


TCTAATTTAT 


GTTCCCATAT 


TTTCT AT AC A 


VTA'T^ff* A ITT 


i\ 1 AAA X 1 O^ta- X 


2520 


TGCGTAGTAT 


GAGCAATTTT 


ATCAAGGTGA 


TGAATAATAT 




Ma 1 1 a\. 1 


2580 


GAAACGTTCC 


CATCATCTTC 


AAATATGTAA 


TTCATTATTT 


TCTTTTCCAT 


ATTTATACTA 


2640 


AGCTCTTCTA 


TCTCATTCTG 


TTTTTGTATA 


ACAACCATAT 


CTAAACATCC 


AGATTGTTCC 


2700 


TCTCTATAAC 


AAGATATAGC 


CCTATTCATA 


TGCAGTCCGA 


TAACTTCATG 


AAGTATTTTT 


2760 


ATTTTTGAAA 


TAATTTTCTT 


CAAAATTTCA 


TTATTTTGAA 


GAATCTGTAG 


ATTTTTTAAA 


2820 


ATTTCAACAA 


TTCTATCCCC 


AATACGTTCA 


ATGTCAGTTG 


ATATTTTTAT 


TACACTAATA 


2880 


ATTCTTCXTA 


AGTCATATGA 


AACAGGATGT 


TGTAAACAAA 


TTAACTCATA 


TCCTTTTTTA 


2940 


TCAATATTTA 


GAACTGACTC 


ATTTATGATT 


AAATCTTCTT 


TAATCAATTC 


TACTCGTTCT 


3000 


TCATTTGATA 


AATATTCAAA 


TAACTTCTCA 


TATTTATCAA 


GCACAGATAC 


CCAAATGGTC 


3060 


TCTAAATTAT 


TTGATAATTC 


TATAATTTCA 


TTTTCTAAAT 


ATAACCTTAA 


CATTTACGTA 


3120 


CCTCTTCTTA 


ACAAAGTTCG 










3140 


(2) INFORMATION FOR SEQ ID NO: 159: 
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ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 9048 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
tD) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


159: 






CCGGATGATT 


TCCTGGTCAG 


ATAGGGGGAA AGTGACTTCC 


TCAGCAATCG 


CGCCTAGAGT 


60 


AGGATTCCCT 


TCACGGATAA 


TATCGTTCAT ATCAATTAAG 


TGAGCAGCTT 


TTGTAATACG 


120 


TTCTATTGCA 


GACATTTTCT 


CTCCTTATAT TATGTTTAGT 


GCAGTTAGCT 


ACTGCCAAAG 


1B0 


CCCAAGTGGT 


ATACTTGGAA 


TAACCCACTG TGGATTAGTT 


CATTTTCTTT 


CATTACCTCT 


240 


ACATGATATC 


ACAAAATGAC 


AAGAATTGAA AGCATTATGG 


CATTTAGGAT 


TTATAGAAAA 


300 


TAGATAGGAA 


GTTCAATTCA 


ATTGTGAAAG AAATACTTAT 


CTGTGATATA ATAAAAAGAA 


360 


AAGGCTTGCA 


TAAGAAAGTA 


GGGAGAACGA AGATACAAAG 


AAGACAAAAT 


CGAAATCAGG 


420 


GTGGTTTAGC 


TTTTCGTTTT 


ATGAAGGGCT TGGTAAACTT 


TTTAGGAGTT 


ATCGCAAGTG 


480 


GAGCAATAAG 


GGATTTGTGG 


CGATACTCTT GCTAGCAGTT 


GGTTTATCAA 


TGGGCTTGGT 


540 


CTTGTIX>TTT 


GAAAGCTTCC 


AAGGAATCCC TTGACTAGTC 


AAAAACGAGA 


TACTATTTCT 


600 


CAAGAGGGGA 


CTAAGCAAAA 


GTCTCAGGAG TAGGAAGAGG 


AAAAAACTGC 


CAGAATTATG 


660 


GCCCACGGGG 


ATTTGCTCTA 


CCACGATGGA CTT1TCTTTT 


CAGCTAAAAA 


AGAAGACGGT 


720 


ACCTATGACT 


TTCATGAAAA 


TTTTGAGTAT GTGACTCCTT 


GGCTCAAGCA 


AGGGGACTAA 


780 


GCAGCAGATT 


TAGCTATTGG 


TGATTTTGAA GGAACCATTA 


ATAAGGATCA 


TTATTTAGCG 


o a n 

840 


GGTTATCTTC 


TCTTTAATGC 


TCCTGTTGAA GTTATGGATG 


CTATTAAGGA 


GGCAGGTTAT 


900 


CATGTGCTGG 


ATTTAGCTCA 


TAATCATATT TTGGATTCGC 


AAATTGAGGG 


AGTTATTTCA 


960 


ACGGCCGATA 


TTATTGAGAA 


AGCTGGAATC ACTCCAATCG 


GAGTTTATAC 


GCACGAACCA 


1020 


CGTGATCAGG 


CTCCGCTGGT 


CATTAAGGAA GTGAATGGTA 


TCAAGGTTCC 


ATTGTTACCC 


1080 


TATTCCTATG 


GTTTCAATGG 


AATTGAGCAG TATATTTCTC 


AGGAAGACTA 


TAATCGTTAT 


1140 


CTTTCAGATT 


TAAACGAAGA 


TAAGATGAAG GTTGAAATTG 


AACGGGCAGA 


GAAGGAAGCA 


1200 


GATATCACCA 


TTATCATGCT 


TCAGATGGGT GTTGAGTATC 


GATTGGAACC 


AACTGAAGAA 


1260 


CAAAAAGCTC 


TTTATCACAA 


GATGATCGAT TTGGGAGCGG 


ATATTATCTT 


TGGAGGGCAT 


1320 


CCTCACGTTG 


TTGAACCATC 


TGAAACGGTT GAAAAAGATG 


GAGATAACAA ACTCATTATC 


1380 


TATTAAATGG 


GGAACTTCAT 


TTCCAATCAA CGAATTGAAT 


CTATGGGAGA 


TGAAGAGAAT 


1440 


GCTAAGTGGA 


CTGAACGTGG 


TGTTCTCATG GATGTCACCA 


TCAAGAAGAA 


GGATGGAAAA 


1500 
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AGCTCATCCT 


ACTTGGGTCA 


ATCGAACACC 


AAAGGGAACC 


1560 




aawjATAtci. 


l:" l" l\iT ATC AT 


TACCAAACTT 


ATATTTTGGA 


AGATTTTATA 


1620 


fl Ann A TWIT 1 & 


I* IVATvAaTG A 


CCAGTTAGAT 


GAAGCGACTA 


AGGAACGAAT 


TGATACAGCC 


1680 


1 AlAnAuAAA 


TGAATGAACA 


TGTGGGATTG 


AAGTGGTATT 


AGCTTGAATC 


CAGAGGAAAG 


1740 


T A A & TVt A TV* A 


*TAACGTAAT 


TGCGACAGAT 


ATGGATGGGA 


CCTTGCTGGA 


TGCTAGAGGT 


1800 




TCCCACGATT 


GGAAAAGATT 


TTAGATCAGT 


TGGATCAAAG 


GGGCATTCGT 


1860 


1 1 l\J luAl lVj 


CGACGGGCAA 


TGAAATTCAC 


CGCATGAGAC 


AACTACTGAG 


TCCCTTGGTG 


1920 




TTCTGGTTGT 


TGCTAATGGC 


GCTCGTATTT 


TTGAAAACAA 


TGAATTGATT 


1980 


CAGGCTCAGA 


CATGGGATGA 


CGCCATTGTC 


AACAAGGCTT 


TGACTCATTT 


CAAGGGTCGA 


2040 


GCGTGTCAGG 


ACCAGTTTGT 


TGTAACGGGG 


ATGAAGGGTG 


ATTTTGTCAA 


GGAAGGTACG 


2100 


ATTTTTACAG 


ATCTTGAAAG 


TTTTATGACT 


CCAGAAATGA 


TTGAAAAATT 


CTACCAACGG 


2160 


ATGCAATTTG 


TGGATGAATT 


AACATCTGAC 


CTCTTTGGTG 


GTGTGCTCAA 


GATGAGCATG 


2220 


GTTGTTGGTG 


AGGAACGTTT 


GAGTTCGGTT 


TTGGAAGAAA 


TCAATGCTCT 


CTTTGATGGC 


2280 


CGTGTCCGAG 


CTGTATCCAG 


TGGCTATGGT 


TGCATTGATA 


TCCTCCAAGC 


TGGGATTCAT 


2340 


AAAGCATGGG 


GCTTGGAGGA 


ATTACTCAAG 


CGCTGGGACT 


TGAAATCCCA 


AGAAATCATG 


2400 


GCTTTTGGTG ATAGTGAAAA TGATGTTGAA 


ATGCTTGAAA 


TGGCTGGAAT 


TGCCTATGCG 


2460 


ATGGAAAATG 


CTGATGAGAA 


AGCCAAAGCT 


GTGGCGACTG 


CTCTAGCACC 


AGCCAACAGC 


2520 


CAAGGAGGAG 


TTTATCAAGT 


CTTGGAAAAC 


TGGTTAGAAA 


AAGGAGAATG 


AAGTGGCAGT 


2580 


ACAGTTATTA GAAAATTGGC 


TCCTAAAGGA 


ACAAGAAAAA 


ATTCAAACTA 


AGTATCGTCA 


2640 


CCTAAATCAC 


ATTTCTGTTG 


TAGAACCAAA 


CATTCTTTTT 


ATTGGGGATT 


CCATTGTCGA 


2700 


GTATTATCCT 


CTACAGGAGC 


TATTTGGCAC 


TTCAAAGACG 


ATTGTCAATC 


GAGGAATTCG 


2760 


TGGCTATCAG 


ACAGGACTGT 


TACTAGAGAA 


CCTTGATGCT 


CATCTATATG 


GTGGAGCAGT 


2820 


AGATAAAATT 


TTTCTTCTGA 


TTGGGACAAA 


TGATATCGGA 


AAGGATQTTC 


CTGTGAATGA 


2880 


GGCTCTCAAT 


AATCTCGAAG 


CTATCATTCA 


ATCCGTTGCT 


CGCGATTATC 


CATTGACAGA 


2940 


GATTAAATTG 


CTTTCCATTT 


TGCCTGTCAA 


TGAGAGAGAG 


GAGTACCAGC 


AGGCAGTCTA 


3000 


TATCCGCTCG 


AATGAAAAAA 


TTCAGAACTG 


GAATCAAGCC 


TATCAAGAGC 


TTGCATCTGC 


3060 


CTATATGCAG 


GTGGAATTTG 


TGCCAGTATT 


TGATTGTTTG 


ACAGACCAAG 


CAGGCCAACT 


3120 


CAAAAAAGAA 


TATACAACTG 


ATGGACTGCA 


CCTCAGTATT 


GCTGGTTATC 


AGGCTTTGTC 


3180 


AAAATCCTTG 


AAAGACTATC 


TTTACTAAAT 


AGCTAAATAA 


TGTTAAATTT 


GAGCATAATA 


3240 



WO 98/18931 



PCT/US97/19588 



1034 



TCTTGTAAAA 


AATTCTAAAA 


TCCTTTAAAA 


TAAAAAGTGA 


CGGAGGAATT 


TATGAATGTA 


3300 


AATCAGATTG 


TACGGATTAT 


TCCTACTTTA 


AAAGCTAATA 


ATAGAAAATT 


AAATGAAACA 


3360 


TTTTATATTG 


AAACCCTTGG 


AATGAAGGCC 


TTGTTAGAAG 


AATCGGCCTT 


TCTGTCACTA 


3420 


GGTGACCAAA 


CGGCTCTTGA 


AAAGCTGGTT 


TTAGAAGAAG 


CTCCCAGTAT 


GCGTACTCGT 


3480 


AACGTAGAGG GAAGAAAAAA ACTAGCTAGA TTGATTGTCA - AG6TGGAAAA TCCCTTAGAA 


3S40 


ATTGAAGGAA 


TCTTATCTAA 


AACAGATTCG 


ATTCATCGAT 


TATATAAAGG 


TCAAAATGGC 


3600 


TACGCTTTTG 


AAATTTTCTC 


ACCAGAAGAT 


GATTTGATTT 


TGATTCATGC 


GGAAGATGAC 


3660 


ATAGCAAGTC 


TAGTAGAAGT 


AGGAGAAAAG 


CCTGAATTTC 


AAACAGATTT 


GGCATCAATT 


3720 


TCTTTAAGTA 


AATTTGAGAT 


TTCTATGGAA 


TTACATCTCC 


CAACTGATAT 


CGAAAGTTTC 


3780 


TTGGAATCAT 


CTGAAATTGG 


GGCATCCCTT 


GATTTTATTC 


CAGCTCAGGG 


GCAGGATTTG 


3840 


ACTGTGGACA 


ATACGGTTAC 


CTGGGACTTA 


TCTATGCTCA 


AGTTCTTGGT 


CAATGAATTA 


3900 


GACATAGCAA 


GTCTTCGCCA 


GAAGTTTGAG 


TCTACTGAAT 


ATTTTATTCC 


TAAGTCTGAA 


3960 


AAATTCTTCC 


TTGGTAAAGA 


TAGAAATAAT 


GTTGAATTGT 


GGTTTGAAGA 


AGTATGAAGT 


4020 


GGACCAAGAT 


TATTAAAAAA 


ATAGAAGAAC 


AAATCGAGGC 


AGGGATTTAT 


CCCGGAGCCT 


4080 


CTTTTGCGTA 


TTTTAAGGAC 


AATCAATGGA 


CAGAGTTCTA 


TTTAGGCCAG 


AGTGACCCAG 


4140 


AGCATGCCTT 


GCAGACTGAG 


GCAGGACTAG 


TTTATGACCT 


AGCTAGTGTC 


AGCAAGGTTG 


4200 


TTGGGGTTGG 


CACAGTTTGT 


ACCTTCTTGT 


GGGAAATAGG 


TCAATTAGAT 


ATTGATAGAC 


4260 


TGGTAATAGA 


TTTTTTACCT 


GAGAGTGATT 


ATCCAGACAT 


CACTATTCGC 


CAGCTCTTGA 


4320 


CTCATGCAAC 


AGACCTTGAT 


CCTTTTATTC 


CTAATCGTGA 


TCTTTTAACA 


GCCCCTGAAT 


4380 


TAAAGGAAGC 


GATGTTTCAT 


CTCAACAGAC 


GAAGTCAGCC 


AGCCTTTCTT 


TATTCGGATG 


4440 


TCCATTTTTT 


GCTGTTGGGC 


TTTATTTTGG 


AAAGAATTTT 


TAATCAAGAT 


TTGGATGTGA 


4500 


TTTTAAAGGA 


TCAAGTCTGG 


AAACCTTGGG 


GAATGACGGA 


AACTAAGTTT 


GGGCCAGTTG 


4560 


AGCTTGCTGT 


TCCAACAGTT 


AGAGGTGTAG 


AGGCAGGCAT 


AGTGCATGAT 


CCCAAGGCTC 


4620 


GTCTCCTGGG 


TAGACATGCT 


GGGAGTGCTG 


GTTTATTTTC 


GACTATAAAG 


GATTTACAAA 


4680 


TCTTTTTAGA 


ACACTATTTA 


GCAGATGATT 


TTGCAAGAGA 


CTTAAATCAA 


AATTTTTCTC 


4740 


CTTTGGATGA CAAGGAACGT TCTTTAGCAT 


GGAATTTGGA 


AGGAGATTGG 


CTAGACCATA 


4800 


CGGGCTATAC AGGTACCTTT ATCATGTGGA ATCGTCAGAA GCAAGAAGCC ACTATTTTCC 


4860 


TATCGAATCG 


TACCTATGAA 


AAGGACGAGA 


GAGCTCAATG 


GATATTAGAC 


CGCAATCAAG 


4920 


TGATGAACTT 


GATTCGCAAA 


GAAGAGTAAG 


GAGAGACATG 


TCAAATAGTT 


TAAAAGGGAC 


4980 


TTTACTAACA 


GTTGTGGCTG 


GTATTGCTTG 


GGGGTTGTCA 


GGAACGAGTG 


GCCAATACCT 


5040 
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GGAATTTCGG 


CTCTGGTCTT 


GACTAACTTG 


CGTCTTTTAA 


TCGCTGGTGG 


5X00 




v."*i'\."l"l"GGCTT 


ATGCTACTGC 


AAAGGATAAA 


ATACTGGTC1 


TTTTAAAGGA 


5160 


I MIjAAAVxAvj J 


TTGCTGTCTC 


TTCTTATTTT 


TGCTCTGATT 


GGTCTTTTTC 


TCAACCAATT 


5220 


LuUL 1 AIL I V» 


TCTGCTATTC 


AGGAGACCAA 


TGCGGGAACA 


GCGACGGTCC 


TTCAGTATGT 


5260 




GGAATTTTAA 


TTTATAGCTG 


TATCAAGGAT 


AGGGTGGCAC 


CGACACTGGG 


5340 


AuAuATAuTT 


TCCATCATAT 


TCGCCATCGG 


AGGAACCTTC 


CTGATCGCAA 


CACATGGGCA 


5400 


(TTrGGACCAG 


TTATCCATGA 


CACCTGCTGG 


TCTGTTCTGG 


GGTCTCTTTT 


CTGCCTTGAC 


5460 


TTATGCTCTG 


TATATCATTT 


TACCCATAGC 


CTTGATTAAA 


AAGTGGGGGA 


GCAGCTTGGT 


5520 


CATTGGTGTG 


GGAATGGTCA 


TAGCAGGTTT 


GGTCGCCCTT 


CCTTTTACAG 


GGGTTCTACA 


5580 


GGCCGATATC 


CCGACTAGTC 


TTGATTTTCT 


CCTTGCGTTT 


GCAGGCATTA 


TCCTTATCGG 


5640 


GACTGTCTTT 


GCCTATACAG 


CTTTCCTTAA 


AGGAGCCAGT 


CTGATAGGAC 


CGGTCAAGTC 


5700 


AAGCTTGTTG 


GCTTCAATTG 


AGCCAATATC 


GGCGATTTTC 


TTTGCCTTCT 


TAATAATGAA 


5760 


TGAACAATTT 


TATCCCATTG 


ATTTTCTTGG 


TATGGCAATG 


ATATTGTTTG 


CTGTAACTTT 


5820 


GATTTCTTTG 


AAAGATTTAT 


TCTTAGAAAA 


ATAAAAAAGA 


CTCTTTGTCC 


GTGACAGAGA 


5880 


GTTTTTGCGT 


GGTAATCTAA 


TTATTTTCAA 


GATAAAATTC 


AAAGCGTTCG 


CCTACATATT 


5940 


GACTTTTTAC 


GTATTCAAAA 


GCAGTACCAT 


CTTCTAGGTA 


GGAAACCTGG 


GTCAATCCAA 


6000 


GAATAGCATG 


TCCTTTTTCA 


ACTTCCAAAT 


AGTGGGCAAT 


CTTTTCTTTA 


GCAAGGCGAG 


6060 


CATAGATGGT 


CTGTTGAGAT 


TTGCCGATAC 


GATAGCCATG 


TTTTTGCAAG 


GTTTGGAAGA 


6120 


AATGACTGGT 


GATTTCTTCT 


TTTTTAAAGT 


CCTTAATGAA 


TTTTTCAGGA 


ATAGAAGCAA 


6180 


CTTCATAAAC 


TAGGGGAACT 


TGGTCGGCAT 


AGCGGACCCG 


CTCCATTCGG 


ATAATATTGT 


6240 


CCGTTGGAAA 


AATTCCTAGC 


TTGGCAACTT 


CTTGCTCATT 


GGGAATGGTT 


TTTTTGTAGG 


6300 


AAATGAGCTG 


GCTAGAGGGA 


ACTTTACCTT 


GGGATTTGAC 


AATTTCAGTA 


AAACTGGTTG 


6360 


TCCCTCGCAT 


CTTTTCTTGT 


ACTCGAGTAC 


TGGAAACAAA 


GGTGCCGCTT 


CCTACACGGC 


6420 


GCTCTAAGAC 


GCCTTCTTCG 


ACTAATAGAG 


ATACGGCTTG 


GCGGAGGGTC 


ATGCGACTGA 


6480 


CCGCAAACTG 


CTCAGCTAAA 


TCTCTTTCAC 


TGGGAAGCCT 


CTCACCAATA 


GCCCAACGGT 


6540 


ACTCGTCAAT 


ATCCTTTTTT 


ATCTGATCAT 


GGATTTTTAT 


ATAAGCAGGT 


AGCATATTTT 


6600 


TCACTTCATT 


TCTATCTTTT 


CTCTATTGTA 


CCCCAATAAA 


CTAGAAAAAG 


TCAAACTTCG 


6660 


CCTTGTTTAG 


TTGGTAATTC 


GCCCTTATTT 


GTGATAGAAT 


ATTGAGAAAA 


GATATTTCTT 


6720 


TTGAGAAAGG 


AAAAAGATGA 


GCAACATTTC 


AACTGATTTG 


CAAGATGTAG 


AAAAAATCAT 


6780 
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CGTATTGCAC 


TATGGTAGCC 


AGTACAACCA 


GCTGATTTCA 


CGCCGTATCC 


GTGAGATTGG 


6840 


TGTTTTTTCA GAACTAAAAA 


GCCATAAAAT 


TTCAGCTGCT 


GAAGTTCGTG 


AAGTCAATCC 


6900 


TGTAGGAATT 


ATTCTATCAG 


GTGGTCCAAA 


TTCTGTATAT 


GAAGATGGTT 


CATTTGATAT 


6960 


TGACCCAGAA 


ATCTTCGAAC 


TCGGAATTCC 


AATTTTGGGA 


ATCTGTTATG 


GTATGCAGTT 


7020 


ATTGACCCAT 


AAACTTGGAG 


GAAAAGTTGT 


TCCTGCAGGT 


GATGCTGGAA 


ATCGTGAATA 


7080 


CGGTCAATCA 


ACCCTAACTC 


ACACACCATC 


AGCGCTTTTT 


GAATCAACAC 


CTGATGAACA 


7140 


GACTCTTTTG 


ATGAGCCATG 


GTGATGCGGT 


TACTGAGATT 


CCTGCTGACT 


TTGTTCGTAC 


7200 


AGGTACATCA 


GCTGACTGCC 


CATACGCAGC 


CATCGAAAAC 


CCAGATAAAC 


ACATTTACGG 


7260 


TATCCAATTC 


CACCCAGAAG 


TTCGTCATTC 


TGTATACGGA 


AATGATATCC 


TTCGTAACTT 


7320 


TGCCCTTAAC 


ATTTGTAAGG 


CTAAAGGTGA CTGGTCAATG 


GATAATTTCA 


TTGACATGCA 


7380 


GATCAAAAAA 


ATTCGTGAAA 


CCGTCGGTGA 


TAAACGTGTC 


CTTCTTGGTC 


TATCAGGTGC 


7440 


TGTTGACTCA 


TCTGTCGTTG 


GGGTTCTTCT 


CCAAAAAGCG 


ATTGGCGATC 


AATTGATCTG 


7500 


TATCTTCGTA 


GACCACGGTC 


TTCTTCGTAA 


AGGCGAAGCT 


GATCAAGTTA 


TGGACATGCT 


7560 


CGGTGGTAAG 


TTTGGTTTGA 


ATATCGTCAA AGCAGACGCT 


GCTAAACGTT 


TCCTTGACAA 


7620 


ACTTGCTGGC 


GTTTCTGACC 


CTGAACAAAA 


ACGTAAAATC 


ATCGGTAACG 


AGTTTGTCTA 


7680 


TGTATTCGAT 


GACGAAGCAA 


GCAAGCTCAA 


AGATGTGAAA 


TTCCTTGCTC 


AAGGTACTTT 


7740 


ATATACAGAT 


GTTATCGAGT 


CTGGTACGGA TACAGCTCAA ACTATCAAGT 


CACACCACAA 


7800 


CGTGGtGGTC 


TTCCAGAAGA 


TATGCAGTTT 


GAATTGATTG 


AACCACTCAA 


TACTCTTTAC 


7860 


AAGGATGAAG 


TTCGTGCTCT 


TGGTACAGAG 


CTTGGTATGC 


CAGACCATAT 


CGTATGGCGC 


7920 


CAACCATTCC 


CAGGACCAGG ACTTGCTATC CGTGTCATGG GTGAAATCAC TGAAGAGAAA 


7980 


CTTGAAACCG 


TTCGTGAATC 


AGACGCTATT 


CTTCGTGAAG 


AAATCGCTAA 


AGCTGGACTT 


8040 


GACCGCGATA 


TTTGGCAATA 


CTTCACTGTT 


AACACAGGCG 


TTCGTTCAGT 


CGGTGTTATG 


8100 


GGTGACGGTC 


GTACGTATGA 


CTACACGATT 


GCAATCCGTG 


CTATCACTTC 


TATCGATGGT 


oloO 


ATGACTGCTG 


ATTTTGCCAA 


AATTCCATGG 


GAAGTACTTC 


AAAAAATCTC 


AGTACGTATC 


8220 


GTAAATGAAG 


TGGATCATGT 


TAACCGTATC 


GTCTACGATA 


TTACAAGTAA 


ACCACCTGCA 


8280 


ACAGTTGAGT 


GGGAATAATC 


GCAAAAAAAT 


TAAAAGCTTT 


GTAAAATCAA 


CGGTTACAGA 


8340 


GGATTAAAAA 


CTGTAACTGG 


GATTAAAACG 


GGAACATTTG 


CTAAAAAGAA 


TAAATTGAAT 


8400 


AATAGTTCCA 


AGTGGTTTAC 


ATTTGGACAA 


AAAATTAGAC 


CGTAGTTTTC 


AAGCTGCGGT 


8460 


CTTTTGATAT 


ATATAATGAG 


AATTAATGGC 


TCTTTGTCAA 


CTGTAGTGGG 


TTGAAGTCAG 


8520 


CTAAGCTCGA 


CAAAGGACAA 


ATTTTGTCCT 


TTCTTTTTTG 


ATATTCAGAG 


CGATAAAAAT 


8580 
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CCGTTTTTTG AAGTTTTCAA AGTTCCGAAA ACCAAAGGCA TTGCCCTTGA TAAGTTTGAT 8640 

GAGATTATTG GTCCCTTCCA ATTTGGCGTT AGAATAGTGT ACTTCAAGCG CGTTGACGAT 8700 

TTTCTCTTTG TCCTTTAGAA AGGTTTTAAA GACAGTCTGA AAAAGAGGAT GAACCTGCTT 8760 

TAGATTGTCC TCAATGAGTC CGAAAAATTT CTCCGGTTCC TTATTCTGAA AGTGAAACAG 8820 

CAAGAGTTGA TAGAGCTGAT AGTGATGTTT CAAGTCTTGT GAATAGCTCA AAAGCTTGTT 8880 

TAAAATCTCT TTATTGGTTA AATGCATACG AAAAGTAGGG CGATAAAAAT GTTTATCGCT 8940 

GAGTTTACGA CTATCCTGTT GTATGAGCTT CCAGTAGCGC TTGATAGCCT TGTATTCATG 9000 

AGACTTTCGA TCCAATTGAT TCATGATTTG AACACGCACA CGACTCGG 9048 
(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10399 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
GTACCTTTAT TGATGAATGG ACTGTTTAAA TCAGTAGCAC GCCAACCAGA TATGCTTTCT 60 

GAGTTTCGTA GTTTGATGTT TTTAGGTGTT GCCTTTATTG AAGGAACTTT CTTTGTAACT 120 

CTTGTCTTCT CATTTATTAT CAAATAAATA CATGGAACGA GAAGAAAAGG GAGGATTTTA 180 

GATGGAAGAA AGTATTAATC CAATCATCTC TATTGGTCCT GTTATCTTCA ATCTGACTAT 240 

GTTAGCCATG ACTTTGTTGA TTGTGGGAGT TATTTTTGTC TTTATTTATT GGGCAAGCCG 300 

CAATATGACC TTGAAACCCA AAGGAAAGCA AAATGTACTT GAGTATGTCT ATGACTTTGT 360 

TATTGGATTT ACAGAACCTA ACATTGGTTC GCGCTACATG AAAGATTACT CACTCTTTTT 420 

CCTTTGTTTA TTCCTTTTCA TGGTGATTGC CAATAACCTT GGCTTAATGA CAAAGCTTCA 480 

AACGATCGAT GGGACTAACT GGTGGAGTTC GCCAACCGCT AATTTACAGT ATGACTTAAC 54 0 

CTTATCTTTT CTTGTCATTT TGTTGACACA TATAGAAAGC GTTCGTCGTC GTGGATTTAA 600 

AAAAAGTATA AAATCTTTTA TGAGTCCTGT TTTTGTCATA CCGATGAATA TCTTGGAAGA 660 

ATTTACAAAC TTCTTATCTT TGGCTTTGCG GATTTTTGGG AATATCTTTG CAGGAGAGGT 720 

CATGACGAGT TTGTTACTTC TTCTTTCCCA CCAAGCTATT TATTGGTATC CAGTAGCCTT 780 

TGGAGCTAAT TTGGCTTGGA CTGCATTTTC TGTCTTTATT TCCTGCATCC AAGCTTATGT 840 

TTTTACTCTT TTGACATCTG TGTATTTAGG GAATAAGATT AATATTGAAG AGGAATAGAA 900 
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AGGAGTAACT 


GATGCACGTA 


ACAGTAGGTG 


AATTAATTGG 


TAATTTTATT 


TTAATCACTG 


960 


GCTCTTTTAT 


TCTTTTGCTA 


GTCTTCATTA 


AAAAATTTGC 


ATGGTCTAAT 


ATTACAGGCA 


1020 


TTTTCGAAGA 


AAGAGCTGAA 


AAAATTGCTT 


CAGATATTGA 


CAGAGCTGAA 


GAAGCCCGTC 


1080 


AAAAAGCAGA 


AGTATTGGCT 


CAAAAACGCG 


AAGATGAATT 


GGCTGGTAGC 


CGTAAAGAAG 


1140 


CTAAGACAAT 


CATTGAAAAT 


GC AAAGGAAA 


CACCTGAGCA 


AAGTAACGCT 


AATATCTTAG 


1200 


CAGATGCTAA 


ACTAGAAGCA 


GGACACTTAA 


AAGAAAAAGC 


CAATCAAGAA 


ATTGCTCAAA 


1260 


ATAAAGTAGA 


AGCTTTACAG 


AGTGTTAAGG 


GTGAGGTCGC 


AGATTTGACC 


ATCAGCTTAG 


1320 


CTGGTAAAAT 


CATCTCACAA 


AACCTTGACA 


GTCATGCCCA 


TAAAGCACTC 


ATTGATCAGT 


1380 


ATATCGATCA 


GCTAGGAGAA 


GCTTAATGGA 


CAAGAAAACA 


GTAAAGGTAA 


TTGAAAAATA 


1440 


CAGCATGCCT 


TTTGTCCAAT 


TGGTACTTGA 


AAAAGGAGAA 


GAAGACCGTA 


TCTTTTCAGA 


1500 


CTTGACTCAA 


ATCAAGCAAG 


TTGTTGAAAA 


AACAGGTCTG 


CCTTCTTTTT 


TAAAACAAGT 


1560 


GGCAGTAGAC 


GAGTCGGATA 


AGGAAAAAAC 


AATTGCTTTT 


TTCCAAGATT 


CTGTGTCGCC 


1620 


TTTATTACAA 


AACTTTATCC 


AGGTTCTGGC 


CTACAATCAC 


AGAGCAAATC 


TTTTTTATGA 


1680 


TGTGCTTGTA 


GATTGCTTGA 


ACCGACTTGA 


AAAAGAAACA 


AATCGATTTG 


AAGTGACGAT 


1740 


TACGTCTGCT 


CATCCTCTAA 


CTGATGAACA 


GAAGACTCGT 


TTGCTCCCTT 


TGATTGAGAA 


1800 


AAAAATGTCT 


CTGAAAGTAA 


GGAGTGTAAA 


AGAACAAATC 


GATGAAAGTC 


TCATTGGTGC 


1860 


TTTTGTCATT 


TTTGCCAATC 


ACAAGACAAT 


TGATGTGAGT 


ATTAAACAAC 


AACTTAAAGT 


1920 


TGTTAAAGAA 


AATTTGAAAT 


AGAAAGTGGT 


GTTCTTTTGG 


CAATTAACGC 


ACAAGAAATC 


1980 


AGCGCTTTAA 


TTAAGCAACA 


AATTGAAAAT 


TTCAAACCCA 


ATTTTGATGT 


GACTGAAACA 


2040 


GGTGTTGTAA 


CCTATATCGG 


GGACGGTATC 


GCCCGTGCTC 


ACGGCCTTGA 


AAATGTCATG 


2100 


AGTGGAGAGT 


TGTTGAATTT 


TGAAAACGGC 


TCTTATGGTA 


TGGCTCAAAA 


CTTGGAGTCA 


2160 


ACAGACGTTG 


GTATTATCAT 


CCTAGGTGAC 


TTTACAGATA 


TCCGTGAAGG 


CGATACAATC 


2220 




MjtAAAAl LAI 


GGAAGTCCCT 


GTAGGTGAAA 


GTCTGATTGG 


TCGTGTTGiG 


2280 


GATCCGCTTG 


GTCGTCCAGT 


TGACGGTCTT 


GGAGAAATCC 


ACACTGATAA 


AACTCGTCCA 


2340 


GTAGAAGCAC 


CAGCTCCTGG 


TGTTATGCAA 


CGTAAGTCTG 


TTTCAGAACC 


ATTGCAAACT 


2400 


GGTTTGAAAG 


CTATTGACGC 


CCTTGTACCG 


ATTGGTCGTG 


GTCAACGTGA 


GTTGATTATC 


2460 


GGTGACCGTC 


AGACAGGGAA 


AACAACCATT 


GCCATTGATA 


CAATCTTGAA 


CCAAAAAGAT 


2520 


CAAGATATGA 


TCTGTATCTA 


CGTCGCGATT 


GGACAAAAAG 


AATCAACAGT 


TCGTACGCAA 


2S80 


GTAGAAACAC 


TTCGTCAGTA 


CGGTGCCTTG 


GACTACACAA 


TCGTTGTGAC 


AGCCTCTGCT 


2640 


TCACAACCAT 


CTCCATTGCT 


CTTCCTAGCT 


CCTTATGCTG 


GGGTTGCTAT 


GGCGGAAGAA 


2700 
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TTTATGTATC AAGGTAAGCA TGTTTTGATT GTATACGATG ATCTATCAAA ACAAGCGCTA 2760 

GCTTATCGTG AACTGTCGCT CTTGCTTCGT CGTCCTCCAG CTCCTGAAGC CTTCCCAGGG 2820 

GATGTTTTCT ATCTCCACAG CCGTTTGCTT GAGCGCTCAG CTAAAGTTTC TGATGAACTT 2880 

GGTGGTGGAT CAATTACAGC CCTACCATTT ATCGAGACAC AAGCAGGAGA TATCTCAGCC 2940 

TATATCGCAA CCAACGTGAT TTCTATCACT GATGGACAAA TCTTCCTTGG CGATGGCCTC - 3000 

TTCAATGCAG GTATTCGTCC AGCCATCGAT GCGGGTTCAT CTGTATCTCG TGTAGGTGGT 3060 

TCTGCACAAA TCAAAGCCAT GAAGAAGGTT GCTGGTACAC TTCGTATCGA CCTTGCTTCA 3120 

TACCGTGAGT TGGAAGCCTT TACTAAGTTT GGTTCTGACT TGGACGCAGC AACACAGGCT 3180 

AAGTTGAACC GTGGACGTCG TACCGTTGAG GTCTTGAAAC AACCTGTTCA CAAACCATTA 3240 

CCTGTTGAGA AACAAGTAAC CATTCTTTAT GCTTTGACAC ATGGTTTCTT GGATACTGTT 3300 

CCAGTAGATG ATATTGTTCG TTTCGAGGAA GAGTTCCATG CCTTCTTTGA TGCTCAACAT 33 60 

CCAGAGATTT TGGAAACCAT TCGTGATACA AAAGACTTGC CAGAAGAAGC AGTCTTGGAT 3420 

GCTGCGATTA CAGAGTTTCT CAATCAATCT AGCTTCCAAT AAGAATAGAG GTGTCAGATG 3480 

GCAGTATCTC TAAATGATAT TAAAACAAAA ATCGCCTCAA CAAAAAATAC GAGTCAAATC 3540 

ACTAATGCCA TGCAAATGGT ATCGGCTGCT AAGCTAGGTC GTTCTGAAGA AGCTGCTCGC 3600 

AACTTCCAAG TTTACGCTCA GAAAGTGCGT AAACTTTTGA CAGATATCCT TCATGGTAAT 3660 

GGAGCTGGTG CTTCAACTAA TCCGATGTTG ATTAGCCGTT CTGTGAAGAA GACAGGCTAT 3720 

ATCGTTATCA CTTCAGACCG CGGTTTGGTT GGAGGTTATA ATTCCTCTAT TTTGAAAGCT 3780 

GTTATGGAGT TGAAAGAAGA ATACCACCCA GACGGTAAAG GTTTTGAAAT GATCTGTATC 3640 

GGTGGGATGG GAGCTGATTT CTTTAAGGCT CGCGGTATTC AACCACTTTA TGAATTACGT 3900 

GGCTTGTCAG ACCAACCTAG CTTTGATCAA GTTCGTAAGA TTATTTCAAA AACTGTTGAA 3960 

ATGTACCAAA ATGAACTCTT TGATGAGCTT TATGTTTGCT ACAACCACCA TGTCAATACG 4020 

CTAACCAGTC AAATGCGTGT GGAACAAATG CTTCCGATTG TTGACTTGGA TCCAAATGAA 4080 

GCGGATGAAG AGTACAGCTT GACTTTTGAA TTGGAAACCA GCCGAGAAGA AATTCTGGAG 4140 

CAGTTGTTGC CTCAGTTTGC AGAAAGTATG ATTTACGGTG CCATTATCGA TGCCAAGACA 4200 

GCTGAGAATG CTGCGGGCAT GACAGCCATG CAAACAGCGA CAGATAATGC TAAGAAAGTC 4260 

ATCAATGATT TGACAATTCA GTATAACCGT GCCAGACAGG CGGCGATTAC ACAAGAAATT 4320 

ACAGAAATCG TAGCAGGTGC TAGTGCCTTA GAATAGGCTC TAGTCCAGCT CGTATGAAAA 4380 

TGAACTTAGG ACCTAGTTGA GCTAGGAACC GACAGTATCT TATATAGAAT AGGAGAAGGA 4440 
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GATGAGTTCA GGTAAAATTG CTCAGGTTAT CGGTCCCGTT GTAGACGTTT TGTTTGCACC 4500 

AGGGGAAAAA CTTCCTGAGA TTAACAATGC ACTTGTCGTC TACAAAAATG ACGAAAGAAA 4560 

AACAAAAATC GTCCTTCAAG TAGCCTTGGA GTTAGGAGAT GGTATGGTTC GTACTATCGC 4620 

CATGGAATCA ACAGATGGGT TGACTCGTGG AATGGAAGTA TTGGACACAG GTCGTCCAAT 4680 

CTCTGTACCA GTAGGTAAAG AAACTTTGGG ACCTGTCTTC AACGTTTTGG GAGATACCAT 4740 

TGACTTGGAA GCTCCTTTTA CAGAAGACGC AGAGCGTCAC CCAATTCATA AAAAAGCTCC 4800 

AACTTTTGAT GAGTTGTCTA CCTCTTCTGA AATCCTTGAA ACAGGGATCA AGGTTATTCA 4860 

CCTTCTTGCC CCTTACCTTA AAGGTGGTAA AGTTGGACTT TTCGGTGGTG CCGGAGTTGG 4920 

TAAAACTGTC TTAATCCAAG AATTGATTCA CAACATTGCC CAAGAGCACG GTGGTATTTC 4980 

AGTATTTGCT GGTGTTGGGG AACCTACTCG TGAGGGGAAT GACCTTTACT GGGAAATGAA 5040 

AGAATCAGCC GTTATCGAGA AAACAGCCAT GGTCTTTGGT CAGATGAATG AGCCACCAGG 5100 

AGCACGTATG CGTGTTGCCC TTACTGGTTT GACAATCGCT GAATACTTCC GTGATGTGGA 5160 

AGGCCAAGAC GTGCTTCTCT TTATCGATAA TATCTTCCGT TTCACTCAGG CTGGTTCAGA 5220 

AGTATCTGCC CTTTTGGGTC GTATGCCATC AGCCGTTGGT TACCAACCAA CACTTGCTAC 5280 

GGAAATGGGT CAATTGCAAG AACGTATCAC ATCAACCAAG AAGGGTTCTG TAACCTCTAT 5340 

CCAGGCTATC TATGTGCCAG CGGATGACTA TACTGACCCA GCGCCAGCAA CAGCCTTCGC 5400 

TCACTTGGAT TCAACAACAA ACTTGGAACG TAAGTTGGTA CAATTGGGTA TCTACCCAGC 5460 

CGTTGACCCA CTTGCTTCAA GCTCACGTGC CTTGGCACCT GAAATCGTTG GAGAAGAGCA 5520 

CTATGCAGTT GCTGCTGAAG TAAAACGTGT CCTTCAACGT TACCATGAAT TGCAAGATAT 5580 

CATTGCTATC CTTGGTATGG ATGAGCTTTC TGATGAAGAA AAGACCTTGG TTGCTCGCGC 5640 

CCGTCGTATC CAGTTCTTCT TGTCACAAAA CTTCAACGTT GCGGAACAAT TTACTGGTCA 5700 

GCCAGGTTCT TATGTTCCAG TTGCTGAAAC TGTACGTGGC TTTAAGGAAA TCCTTGATGG 5760 

TAAATACGAC CACTTGCCAG AAGATGCCTT CCGTGGTGTA GGTTCTATCG AAGATGTGAT 5820 

TGCAAAAGCT GAAAAAATGG GATTTTAAGA GGTGATCTAT GGCTCAGTTA ACTGTCCAGA 5880 

TCGTGACACC AGATGGTCTC GTCTATGATC ACCATGCCAG CTATGTATCG GTTCGAACTC 5940 

TGGATGGTGA GATGGGGATC TTGCCACGAC ATGAAAATAT GATTGCGGTT TTAGCAGTTG 6000 

ATGAAGTAAA GGTAAAACGT ATCCATGATA AAGATCACGT GAACTGGATT GCAGTAAACG 6060 

GTGGCGTTAT TGAAATTGCC AATGATATGA TCACAATCGT CGCTGACTCT GCAGAACGTG 6120 

CTCGTGATAT CGATATCAGT CGTGCAGAAC GTGCCAAACT TCGTGCAGAA CGTGCAATTG 6180 

AAGAAGCACA AGACAAACAT TTGATTGACC AAGAACGTCG TGCTAAGATT GCTTTCCAAC 6240 
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GTGCTATTAA CCCTATTAAT GTCGGAAATA GACTATAAGA AAAAATGAAC TTGAAAATAC 6300 

CAAGTTCATT TTTTATGGTG TTTTAAGGAG CAAAACGGAT CCACACTGCT TCGGGAACAT 6360 

GGAAGTCGTT GGAGAGTTCT GCTAGACGAC CATTGTCACA ATTACGTTTA AAGACAGTTG 6420 

CATTGTCAGA GTCTTGATGG ACAACAATGA GAAATTTTTG GTCGGGTGTC AAATCAAAAT 6480 

CACCTGGAGT CTGACCATGC GTTGGAACGA TTTCTAATAA CTCTAAGCTA CCGTCCGCAA 6540 

GGATGGTATA TACTGCGATA GAATCATGGC CACGGTTAGA AGCGTAGAGG TATTTACCGT 6600 

CTTTAGAGAG ATGAATAGCA GCGGTTCCAT TAAAGCCTTC GTAAGCTTCC GGTAAAGTTG 6660 

AAATGACCTG CATACGTTCA AATTCGCCAA CGCCATCGTA GATTAAAACT TCGATAGTAC 6720 

TATTGAGTTC ACAAATGAGA TAAGCGATTT TATAGTGGTT ATGGAAAATG ATATGGCGTG 6780 

AGCCTGCTCC TGGCTTGCTG TGATAGGTAT AGAGCTTAGA TAATTTTCCT TCTTGATCGA 6840 

GGTCATAGGT GATGACTTGG TCAGTTCCCA AGTCGCAGGT CACTAGATAG TGGTCAGGTG 6900 

TTAAATCTGT AT AGTGAAC A TGGGGGGAAG CTTGATTTTC ATGTGGACCT TGGCCACTGT 6960 

GTTGATCCAT ATCACTAAGT AGAAGACTAC CATCTTCCTG GCGTTTATAA ACAAGGACTT 7020 

GTCCCTTGTG ATAGTTAGCT GCGTAAACCA AATCACGCTT TTCATCGACA GCAACATAAC 7080 

AGTGGGGAGC TCCTTCTTCA ACAACATGAT TTAACACAGT CCCCTCAGTT TGATAGGCTG 7140 

CAATTCCCCC CTTATCGTCT TGGCTACCAA CAGTGTATAA ATGTTGGTGC TGGTCAAAGC 7200 

CAAGGTAGGT TGGACTTGGC TCACCTGCAA AAAGTTCTAG ATTTGAAAGC TGACCAGTTT 7260 

CTGTATCAAA GTCTGCCTTG TAAATCCCTT GAGAAGTACG ACGTGTATAA GTTCCAAAAT 7320 

AAACAGTTTC TTTCATTACT ATACCTCTGT GTAAAGATAA GACTATTATA TCACAAAAAC 7380 

AAGTAAATTA AAGATATCCA ATTAGATGTA AGCACTTTAA AAAAGAGTTA TTTTGTTTCA 7440 

AAAATGGTAT AATGAGAGAA CAATAGAAAG GAAGTATTTA TGGAGCAAAA AGAGAAACAT 7500 

TTTAGCCTAT CTTGGTTTTT CAAGTGGTTT TTAGATAACA AGGCAATTAC GGTATTTTTA 7560 

GTAACCTTAT TATTGGGACT GAATCTTTTT ATTTTAAGTA AGATTAGTTT TCTATTTTCA 7620 

CCTGTTTTAG ACTTTTTAGC AGTTGTGATG TTGCCAGTCA TTTTGTCTGG TTTGTTATAT 7680 

TATTTGTTGA ATCCTATTGT TGATTGGATG GAGAAGCATA AGGTTAATCG TGTTATAGCT 7740 

ATCACTATTG TCTTTGTTAT CATCGCTCTC TTTATCATTT GGGGCTTGGC AGTCGCCATT 7800 

CCAAATCTGC AACGTCAGGT TTTGACCTTT GCAAGAAACG TTCCTGTTTA CTTAGAAGAT 7860 

ATAGATAGGA TTGTTAATGG ATTGGTAGCC CAGCACCTGC CAGATGATTT CAGACCTCAA 7920 

TTAGAGCAAG TTTTGACCAA TTTTTCTAGC CAGGCTACAG TTTTGGCAAG TAAGGTTTCA 7980 
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TCTCAGGCAG 


TCAACTGGGT 


GAGTGCCTTT 


ATTAGCGGGG 


CTTCTCAAGT 


GATTGTTGCC 


8040 


TTGATTATCG 


TTCCTTTCAT 


GCTCTTTTAT 


CTCTTGCGTG 


ATGGGAAAGG 


CTTGCGTAAC 


8100 


TATTTGACCC 


AATTCATTCC 


AAGAAAATTG 


AAGGAACCTG 


TTGGACAAGT 


TTTATCAGAT 


8160 


GTGAATCAAC 


AGTTGTCCAA 


CTATGTTCGA 


GGGCAAGTGA 


CAGTGGCTAT 


TATTGTAGCA 


B220 


GTAATGTTTA 


TCATCTTCTT 


CAAGATTATT 


GGTCTACGCT 


ATGCGGTTAC 


GCTGGGGGTT 


8280 


ACTGCTGGTA 


TTTTAAATCT 


GGTCCCTTAT 


CTTCGTAGCT 


TTCTAGCCAT 


GCTTCCTGCT 


8340 


CTAGTATTGG 


GTTTGATTGC 


TGGTCCAGTC 


ATGCTTTTGA 


AAGTAGTGAT 


TGTCTTTATC 


8400 


GTAGAACAAA 


CTATTGAAGG 


CCGTTTTGTC 


TCTCCATTGA 


TTTTGGGAAG 


TCAATTAAAC 


6460 


ATCCACCCTA 


TTAATGTTCT 


cittcttttg 


TTAACTTCAG 


GATCTATGTT 


TGGTATCTGG 


8S20 


GGAGTTTTAC 


TTGGTATTCC 


GGTTTATGCC 


TCTGCTAAGG 


TTGTCATTTC 


AGCCATTTTC 


8580 


GAATGGTATA 


AGGTAGTCAG 


TGGTCTATAT 


GAATTAGAGG 


GTGAGGAAGT 


CAAGAGTGAA 


8640 


CAATAGTCAA 


CAGATGTTAC 


AGGCTTTGGA 


GGAGCAAGAT 


TTAACTAAGG 


CTGAGCATTA 


8700 


TTTCGCCAAA 


GCTTTAGAAA 


ATGATTCAAG 


TGATCTTCTG 


TATGAATTGG 


CAACTTATCT 


8760 


TGAAGGGATT 


GGTTTCTATC 


CTCAGGCCAA 


GGAAATTTAC 


CTGAAAATTG 


TAGAGGATTT 


8820 


TCCAGAGGTT 


CATCTTAATC 


TAGCTGCAAT 


TGCTAGCGAG 


GATGGTCAAA 


TAGAAGAAGC 


8880 


CTTTACCTAT 


CTTGAGGAAA 


TCCAAGCTGA 


CAGTGACTGG 


TATGTCTCGT 


CTTTGGCTCT 


8940 


GAAGGCAGAC 


CTTTACCAGC 


TGGAAGGTTT 


GACAGATGTG 


GCACGTGAGA 


AATTATTGGA 


9000 


GGCCTTGACC 


TACTCAGAGG 


ATTCTCTCTT 


GATATTGGGT 


TTGGCAGAGT 


TGGATAGTGA 


9060 


GTTGGAAAAT 


TACCAAGCGG 


CTATTCAAGC 


CTATGCCCAG 


TTAGATAATC 


GCTCGATTTA 


9120 


TGAGCAAACG 


GGCATTTCCA 


CCTATCAACG 


AATTGGCTTT 


GCCTATGCTC 


AGTTAGGGAA 


9180 


ATTTGAAACG 


GCTACTGAGT 


TTTTAGAAAA 


AGCCCTGGAG 


TTAGAATACG 


ATGACTTAAC 


9240 


AGCTTTTGAG 


TTGGCCAGTC 


TTTATTTTGA 


TCAAGAAGAA 


TATCAAAAAG 


CCACCCTCTA 


9300 


■•HI nnu^AW 


OTTTS A T & f*f* » 
\.l luAlnLLA 


TTTCTCCTGA 


CTTTGAAGGC 


TATGAGTATG 


GGTACAG1 CA 


9360 


GGCTTTACAT 


AAGGAACATC 


AAGTTCAAGA 


AGCCCTGCGT 


ATCGCTAAGC 


AAGGATTAGA 


9420 


GAAAAATCCC 


TTTGAAACTC 


GCCTCTTGCT 


AGCTGCTTCA 


CAATTTTCTT 


ATGAATTGCA 


9480 


TGATGCTAGT 


GGTGCAGAAA 


ATTATCTCCT 


TACTGCAAAA 


GAAGACGCTG 


AGGATACAGA 


9S40 


AGAAATCTTG 


CTTCGTTTAG 


CCACTATTTA 


TCTGGAGCAG 


GAGCGTTATG 


AGGATATTCT 


9600 


AGAATTGCAG 


AGTGAGGAGC 


CAGAAAATCT 


TTTGACCAAG 


TGGATGATTG 


CTCGTTCTTA 


9660 


TCAAGAAATG 


GACGATTTGG 


ATACTGCTTA TGAGTATTAT 


CAAGAGTTGA 


CAGGAGATTT 


9720 


GAAGGACAAT 


CCAGAATTTC 


TGGAACACTA 


TATCTATCTC 


TTGCGTGAAT 


TGGGACATTT 


9780 
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TGAAGAAGCA AAAGTCCATG CTCACACTTA CTTAAAACTG GTTCCAGATG ATGTGCAAAT 9840 

GCAAGAACTG TTTGAGAGAT TGTAAGAATG TTTAACCCAA ATCATTCATA CCTCTCTCAA 9900 

CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT TTCTTCCTCC TCATGAGGTC 9960 

AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT AGATTTCCTC AAAAGGGCAG 10020 

ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC TGTTCTTTAA TGCATCATTA 10080 

ACGACGCTTT TCTTCTAGGT GGTTCATAAG GAACAGGAAG ATTCAGGTTG ACTTTTCTAA 10140 

TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA GAGACTAGAC AATTTGAGGA 10200 

GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG AAGAAAAAGA TGGCGGAAGC 10260 

GTTTGATTGT TAAAGTTTGG AAGTCACCTC CAGCTAGATG TTTGAGAAAA AGATAGAGAT 10320 

TGTAGGCGAT ACAGCTCATC ATCATACGAA TTCGTTTTTG ATTAAGGTTG AACTATCCGT 10380 

TTTATCGCCA AAAAATCGG 103 99 

C 2 1 INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9409 base pairs 
<B> TYPE: nucleic acid 
(C) STRAND EDNESS : double 
(O) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

GATAAGATTA AGTTAGAAAA GAAAGAACTA GGACATATCT ACCAGATTCA CGTTTTTAAT 60 

AGCTATGGGC AGGAAGAAAT CTATCGTGTG ATTTTGATGG AGACCAATAT TAGTTCGGTT 120 

TCAACCAATA TCAAGTATGC TGCTGTCTTG ATTAATACCA GTCAGTTGGA ACAGGCTAGT 180 

CAAAAGCATG AGCAATTGAT TGTGGTCGTG ATGGCTAGTT TCTGGATTTT G TCTTTACTT 240 

GCCAGTCTCT ATCTAGCTAG GGTCAGTGTT AGGCCCCTGC TTGAGAGTAT GCAGAAGCAA 300 

CAGTCTTTTG TGGAAAATGC CAGTCATGAG TTACGAACTC CACTCGCAGT TTTGCAAAAT 360 

CGCTTAGAGA CCCTTTTTCG TAAGCCAGAA GCTACCATTA TGGATGTGAG CGAAAGCATT 420 

GCATCGAGTT TGGAAGAAGT CCGAAATATG CGTTTTTTAA CGACAAGCTT GCTGAACTTA 480 

GCTCGGAGAG ATGATGGGAT TAAGCCGGAG CTTGCAGAAG TTCCAACTAG CTTTTTTAAT 540 

ACAACTTTCA CAAACTACGA GATGATTGCT TCGGAAAATA ATCGTGTCTT CCGTTTTGAA 600 

AATCGTATCC ATCGAACAAT TGTCACAGAT CAGCTTCTTC TGAAACAACT GATGACCATT 660 

CTTTTCGATA ATGCCGTCAA GTATACTGAG GAGGATGGTG AAATTGATTT TCTTATCTCG 720 
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GCGACCGATC GCAATCTTTA TTTACTTGTT TCTGATAATG GAATCGGTAT 


TTCGACAGAA 


780 


GATAAAAAGA AAATTTTTGA CCGTTTTTAT CGAGTAGACA AGGCTAGAAC 


CCGGCAAAAA 


640 


GGTGCTTTTG GTTTAGGATT ATCCCTAGCC AAGCAAATTG TAGATGCTCT 


AAAAGGAACT 


900 


GTTACTGTCA AAGATAATAA ACCCAAGGGA ACAATCTTTG AAGTGAAGAT 


TGCCATTCAG 


960 


ACACCATCTA AAAAGAAAAA ATAAAAATAT CGCTCCAATT GGGGCGATAT 


TTTGGATTTA 


1020 


TCTTCTACGT TTTCGTTTGA TAATACACCC TTGAACTTTT AAAACAAGTA AGCTGAATCC 


1080 


GATTGCTGCG GCAAAGGCAA GAGCAGTTGA TAATTTTAAT GCTAAAAAGA 


TAAAACTAAA 


1140 


GATAGCAATA CAGATACAAA AAACAGCGAT ATTAATAAAA AATAGGATTT 


CCTTGAGATT 


1200 


GGCATCAGAT TGCGCTTCAG GTGTATAAGC TTGGTAATGA GGAAGCTGCT 


GGTTTAATTC 


1260 


TTCTTGATAG TCTACCTCAT AGGATTGTAA TTTTCTTACG GGCATGATTC 


TCTCCTTAAC 


1320 


AGTACATACC TATTTTATCA TTTTTTCGGC AGAGAATTAT TACAGAAAGG 


TTACAAAAAG 


1380 


AATAAAGTCC CTTTTCATTT TCAAAGCATG GCTGATTTTG GAGAAATGTG 


GTATAATTTT 


1440 


TCTTATGGAA AAGATTGTCA TTACAGCAAC TGCTGAAAGT ATTGAACAAC 


TTGAACAACT 


1500 


ACTCGAAGCT GGCGTAGACC GTATCTATGT CGGTGAGAAA GATTTTGGTC 


TTCGTCTGCC 


1560 


AACGACCTTT AGTTATGACC AATTACGTGA AATCGCTAAG TTGGTTCATG 


ATCCTGGTAA 


1620 


GGAATTGATC GTTGCGGTCA ATGCTCTCAT GCACCAAGAT ATGATGGACC 


GTATCAAGCC 


1680 


TTTCTTAAAC TTCTTGGAAG AAATCAAGAC AGACTATATT ACGATTGGGG 


ATGCAGGCGT 

mm • ^m*^m>m m^mf^m^^mt^m' « 


1740 


CTTTTACGTA GTTAACCGCG ATGGTTATTC ATTTAAGACC ATCTACGATG 


CTTCAACCAT 


1800 


GGTAACTAGC AGTCGTCAGA TTAACTTCTG GGGACAAAAG GCTGGCGCAT 


CTGAGGCTGT 


1860 


TTTGGCGCGT GAAATTCCAT CAGCTGAACT TTTCAAAATG CCAGAGATTT 


TGGAAATTCC 


1920 


TGCTGAAGTT TTGGTTTACG GTGCTAGCGT CATCCATCAT TCTAAACGTC 


CACTCTTGCA 


1980 


AAACTACTAT AACTTTACAC ATATCGATGA TGAAAAGACG CATAAACGTG 


ACCTCTTCTT 


2040 


GGCTGAGCCA AGTGATCCAG AGAGCCACTA TTCCATTTTT GAAGATAATC 


ATGGGACCCA 


2100 


TATCTTTGCC AACAATGACC TTGATTTGAT GATCAAATTA ACAGAATTGG 


TGGAGCATGG 


2160 


CTTTACTCGC TGGAAACTAG AAGGGCTCTA CACTCCTGGT CAGAACTTTG 


TTGAGATTGC 


2220 


AAAACTCTTT ATCCAAGCGC GTAGCTTGAT TCAAGAGGGC AACTTTAGTC 


ATGCTCAAGC 


2280 


CTTCTTGCTG GATGAAGAAG TTCGTAAACT TCACCCTAAA AACOGTTTCC 


TTGATACAGG 


2340 


ATTTTATGAC TACGATCCTG ACATGGTTAG ATAAAATACA TGATTCGTTG 


AGAGAAGGAA 


2400 


GATGCAAACA TTTCTTCTCT CAATTTTTCG TATTTCTTCA CTATTTTACA 


AAAATCAGCA 


2460 


GGCTAGAATG CTCTATTCGA TGGGATTTTT AAGAAAAGTA GTGTTCTTGA 


GTTTGAAAAT 


2520 
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TATCCTATGT TTGCAGGTGC CAAATGGCCC TTTTTTTGGT ATAATTTTTT ATAATGAAAA 2580 

CGATTGGTAA TCGCTATGTT GTGGTGGATT TAGAGGCAAC TAGCACAGGT AGTAAGGCTA 2640 

AAATTATCCA AGTGGGAATT GTCGTGATTG AGGACGGAGA AATCGTCGAT CACTATACGA 2700 

CGGATGTCAA TCCACATGAA CCCTTGGATG CTCATATCAA AGAACTGACA GGATTGACAG 2760 

ACCAACGTCT GGCGCAAGCA CCTGATTTTT CGCAAGTTCC CAGAAAAATA TTTGACTTGG 2820 

TGGAGGATGG GATTTTTGTA GCCCATAATG TTCAGTTTGA TGCTAATCTC TTGGCGGAAA 2880 

ATTTATTTTT TGAAGGCTAT GAGCTAAGAA ACCCTCGTGT TGATACGGTC GAATTGGCCC 2940 

AGGTCTTTTT CCCTGAACTG GAAAAATATA GCTTGCCGAT TTTGTGTCCA GAATTAGGAA 3000 

TTCCTCTTAA ACACGCACAC ACAGCCCTTT CAGATGCCCA AGCTACAGCA GAATTACTTC 3060 

TTTTTTTACG GAAAAAGATG ACCCAGCTTC CTAAAGGTCT CTTGGAACGC TTGCTGGAAA 3120 

TGGCTGACGC TCTCCTATAT GAGTCCTACC TGGTTATTGA GGAAACTTAT CGCAACCAAT 3180 

CTATCCTGAG TTCTCCAGAC TTGGTCCAAG TTCAAGGTCT ATATTTTAAG AAAACGGAAG 3240 

CTTCTCTGGA GCCACGAAAA CTATCTCAAG ACTTTTCTAA AAATATTTCT CTGTTGAACC 3300 

TTGAAGTGAG GGAGGAACAA GAAAGTTTTG CTAAAGAGGT TGGCTTGCTA TTGAAAGATG 3360 

AACCTGTCTC TCTGATTCAA GCGCCGACAG GGATTGGGAA AACCTATGGC TATCTCTTAC 3420 

CCGCTTTATC TCAATCCAAA GAGCGACAAA TTGTTCTTAG TGTTCCGACA AAGATTCTTC 3480 

AAAATCAAAT CATGGAAGAA GAAGGTAAAC GCCTCAAGGA AGTGTTCCAT ACAGATATTC 3540 

ATAGCTTAAA GGGACCACAA AATTATCTGA AGTTGGATGC CTTTTATCAT TCCTTGCAGG 3600 

AAAATGATGA AAATCGCTTA TTTAGACGCT TTAAAATGCA AGTCTTGGTC TGGCTTACTG 3660 

AGACAGAGAC AGGAGATTTG GATGAAATCG GGCAACTCTA CCGTTACCAA CATTTTCTAG 3720 

CAGACCTTCG TCATGATGGG AATTTATCAT CCCAGAGCTT ATTTGTGACG GAAGATTTTT 3780 

GGAAACGTAG TCAAGAAAGG GCAGAGACTT GCAAGCTTTT AGTGACTAAT CATGCCTATC 3840 

TCGTAACCAG ACTTGAAGAT AATCCTGAAT TTGTCAGTGA CCGTTTACTG ATTATTGATG 3900 

AAGTCCAAAA GATTTTGTTA GCTCTAGAAA ATCTGCTTCA AGAGACCTAC GATATACAAT 3960 

CTATTATCGA TTTAATTGAT AAGGCTTTAG TAGGAGAAGA AAACAGGGTT CAACAACGGA 4020 

TACTAGAAAG TATTCGCTTT GAGTGTCTCT ACTTGATAGA ACAATTTCAG TCTGGCAAAT 4080 

CTAGGAAAAA TATCTTAGAT TCTCTGGACA ATCTCCATCA GTATTTTTCA GAATTGGAAG 4140 

TAGAAGACTT TGATGAGCTG GTTCGCTATT TTACAGCTGA AGGTGATTAC TGGCTTGAAG 4200 

TAACTGAAAC GAGTCAAAAG AAAATTCAGA TTTCTTCTAC AAAATCAGGC CGTACTCTTC 4260 
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TGTCCTCTTT 


ACTTCCTGAG 


1 AGTTGCCAAG 


TCTTGGGAGT 


ATCGCCTACT 


CTTGAGATTA 


4320 


GTCAGAGGGT 


TTcrrrGGCA 


> GACCTTTTAG 


GCTATCCTGA 


AGCTAAATTT 


GTCAAGATTG 


4380 


AATCTCGGGG 


AAAACAGGAA 


CAAGAAGTGG 


TCATGGTCAA 


AGATTTCCCT 


CTGGTAACAG 


4440 


AAACCTCCTT 


AGAAGTCTAT 


GCCAGAGAGG 


TAGCTGCTTT 


ACTAGTGGAA 


ATTCAAGCTT 


4S00 


TCCAGCAACC 


GATTTTGGTT 


CTCTTTACCG 


CTAAAGACAT 


GCTTCTAGCA 


GTATCGGATT 


4560 


TACTTACAGT 


TAGCCACTTG 


GCCCAGTATA 


AAAATGGGGA 


TGTTCATCAG 


CTAAAGAAAC 


4620 


GCTTTGAAAA 


AGGTGAACAA 


CAAATCTTGC 


TTGGTGCAGC 


AAGTTTCTGG 


GAGGGAGTTG 


4680 


ATTTTTCAAG 


CCATCCTTCT 


GTGATTCAAG 


TTGTACCGAG 


GCTTCCTTTC 


CAAAATCCTC 


4740 


AAGAACCCTT 


GACGAAAAAG 


ATTAATCAAG 


AACTGAATCA 


AGAAGGGAAA 


AATGCCTTTT 


4800 


ATGATTATCA 


ATTGCCAATG 


GCCATTATTC 


GTTTAAAACA 


GGCTTTGCGA 


AGAAGTATGA 


4860 


GACGTGAATA 


CCAACGTTCC 


TTAACTCTTA 


TTTTGGATAG 


GAGAATCGTC 


GGAAAACGAT 


4920 


ACGGCAAACA 


AATAGTAGCA 


TCTCTAGCAG 


AAGAAGCGAC 


TGTTAAAACC 


ATCTCTCGAT 


4980 


CCGAAGTTGA 


CGAGGCTATT 


GATAGATTTT 


TTAATGAGCT 


TTGATAAATA 


GTATTGTATG 


S04O 


AAAGTATAAG 


GTTAGTATAT 


ATGAAACGTT 


CTCTCGACTC 


AAGAGTCCAT 


TACAGTTTGC 


5100 


TCTTGCCAGT 


ATTTTTTCTA 


CTGGTCATCG 


GTGTGGTGGC 


TATCTATATA 


GCCGTTAGTC 


5160 


ATGATTATCC 


CAATAATATT 


CTGCCCATTT 


TAGGGCAGCA 


GGTCGCCTGG 


ATTCCCTTGG 


5220 


GGCTTGTGAT 


TGGTTTTGTG 


GTCATGCTCT 


TTAATACAGA 


ATTTCTTTGG 


AAGGTGACCC 


5280 


CCTTTCTATA 


TATTTTAGGC 


TTGGGACTTA 


TGATCTTGCC 


GATTGTATTT 


TATAATCCAA 


5340 


GCTTAGTTGC 


ATCAACGGGT 


GCCAAAAACT 


GGGTATCAAT 


AAATGCAATT 


ACCCTATTCC 


5400 


AACCGTCAGA 


ATTTATGAAG 


ATATCCTATA 


TCCTCATGTT 


GCCTCGTGTC 


ATTGTCCAAT 


5460 


TTACAAAGAA 


ACATAAGGAA 


TGGAGACGCA 


CGGTTCCGCT 


GGACTTTTTG 


TTAATTTTCT 


5520 


GGATGATTCT 


CTTTACCATT 


CCAGTCCTAG 


TTCTTTTAGC 


ACTTCAAAGT 


GACTTGGGGA 


5580 


CGGCTTTGGT 


ill i\> i s\l»\~v. 




ijAATtGTTTT 


ATTATCAGGG 


GTTTCTTGCA 


5640 


AAATTATTAT 


CCCAGTATTT 


GTGACTGCTG 


TAACAGGAGT 


TGCTGGTTTC 


TTAGCTATCT 


1.700 


TTATTAGCAA 


GGACGGACGA 


GCTTTTCTTC 


ACCAGATTGG 


AATGCCGACC 


TACCAAATTA 


5760 


ATCGGATTTT 


GGCTTGGCTC 


AATCCCTTTG 


AGTTTGCCCA 


AACAACGACT 


TACCAGCAGG 


5820 


CTCAAGGGCA 


GATTGCCATT 


GGGAGTGGTG 


GCTTATTTGG 


TCAGGGATTT 


AATGCTTCGA 


5880 


ATCTGCTTAT 


CCCAGTTCGA 


GAGTCAGATA 


TGATTTTTAC 


GGTTATTGCA 


GAAGATTTTG 


5940 


GCTTTATTGC 


CTCTGTCCTG 


GTTATTGCCC 


TCTATCTCAT 


GTTGATTTAC 


CGTATGTTGA 


6000 


AGATTACTCT 


TAAATCAAAT 


AACCAGTTCT 


ACACTTATAT 


TTCCACAGGT 


TTGATTATGA 


6060 
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TGTTGCTCTT CCACATCTTT GAGAATATCG CTCCTGTGAC TGGACTACTT CCTTTGACGG 6120 

GGATTCCCTT GCCTTTCATT TCGCAAGGGG GATCAGCTAT TATCAGTAAT CTGATTGGTG 6180 

TTGGTTTGCT TTTATCGATG AGTTACCAGA CTAATCTAGC TGAAGAAAAG AGCGGAAAAG 6240 

TCCCATTCAA ACGGAAAAAG GTTGTATTAA AACAAATTAA ATAAGGAGAA AATCATGGTA 6300 

AAAGTAGCAG TTATATTAGC TCAGGGCTTT GAAGAAATTG AAGCCTTGAC AGTTGTAGAT 6360 

GTCTTGCGTC GAGCCAATAT CACATGTGAT ATGGTTGGTT TTGAAGAGCA AGTAACGGGT 6420 

TCGCATGCAA TCCAAGTAAG AGCAGATCAT GTCTTTGATG GAGATTTATC AGACTATGAT 6480 

ATGATTGTTC TTCCTGGAGG TATGCCTGGT TCTGCACATT TACGTGATAA TCAGACCTTG 6540 

ATTCAAGAAT TGCAAAGCTT CGAGCAAGAA GGGAAGAAAC TAGCAGCCAT TTGTGCGGCA 6600 

CCAATTGCCC TCAATCAAGC AGAGATATTG AAAAATAAGC GATACACTTG TTATGACGGC 6660 

GTTCAAGAGC AAATCCTTGA TGGTCACTAC GTCAAGGAAA CAGTAGTGGT AGATGGTCAG 6720 

TTGACAACCA GTCGGGGTCC TTCAACAGCC CTTGCCTTTG CCTACGAGTT GGTGGAGCAA 6780 

CTAGGAGGGG ACGCAGAGAG TTTACGAACA GGAATGCTCT ATCGAGATGT CTTTGGTAAA 6840 

AATCAGTAAA ACGCGAGTTA TTCTCTCGTT TTTTATGTGG AAAACTCAGG GAAATCATCG 6900 

CTTTTTTCAT AAAAAAATGC TATAATGAAG GGTATGAAAT ATCACGATTA CATCTGGGAT 6960 

TTAGGTGGAA CTTTACTGGA TAATTATGAA ACTTCAACAG CTGCATTTGT TGAAACATTG 7020 

GCACTGTATG GTATCACACA AGACCATGAC AGTGTCTATC AAGCTTTAAA GGTTTCTACT 7080 

CCTTTTGCGA TTGAGACATT CGCTCCCAAT TTAGAGAATT TTTTAGAAAA GTACAAGGAA 7140 

AATGAAGCCA GAGAGCTTGA ACACCCGATT TTATTTGAAG GAGTTTCTGA CCTATTGGAA 7200 

GACATTTCAA ATCAAGGTGG CCGTCATTTT TTGGTCTCTC ATCGAAATGA TCAGGTTTTG 7260 

GAAATTTTAG AAAAAACCTC TATAGCAGCT TATTTTACAG AAGTGGTGAC TTCTAGCTCA 7320 

GGCTTTAAGA GAAAGCCAAA TCCCGAATCC ATGCTTTATT TAAGAGAAAA GTATCAGATT 7380 

AGCTCTGGTC TTGTCATTGG TGATCGGCCG ATTGATATCG AAGCAGGTCA AGCTGCAGGA 7440 

CTTGATACCC ACTTGTTTAC CAGTATCGTG AATTTAAGAC AAGTATTAGA CATATAAGAA 7500 

AAAGGAATAA GATGACAGAA GAAATCAAAA ATCTGCAGGC ACAGGATTAT GATGCCAGTC 7560 

AAATTCAAGT TTTAGAGGGC TTAGAGGCTG TTCGTATGCG TCCAGGGATG TACATTGGAT 7620 

CAACCTCAAA AGAAGGTCTT CACCATCTAG TCTGGGAAAT TGTTGATAAC TCAATTGACG 7680 

AGGCCTTGGC AGGATTTGCC AGCCATATTC AAGTTTTTAT TGAGCCAGAT GATTCGATTA 7740 

CTGTTGTGGA TGATGGGCGT GGTATCCCAG TCGATATTCA GGAAAAAACA GGCCGTCCTG 7800 
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CTGTTGAGAC 


CGTCTTTACA 


GTCCTTCACG 


CTGGAGGAAA 


GTTCGGCGGT 


GGTGGATACA 


7860 


AGGTTTCAGG 


TGGTCTTCAC 


GGGGTGGGGT 


CGTCAGTAGT 


TAATGCCCTT 


TCCACTCAAT 


7920 


TAGACGTTCA 


TGTTCACAAA 


AATGGTAAGA 


TTCATTACCA 


AGAATACCGT 


CGTGGTCATG 


7980 


TTGTCGCAGA 


TCTTGAAATA 


GTTGGAGATA 


CGGATAAAAC 


AGGAACAACT 


GTTCACTTCA 


8040 


CACCGGACCC 


AAAAATCTTC 


ACTGAAACAA 


CAATCTTTGA 


TTTTGATAAA 


TTAAATAAAC 


6100 


GGATTCAACA 


GTTGGCCTTT 


CTAAATCGCG 


GTCTTCAAAT 


TTCAATTACA 


GATAAGOGCC 


8160 


AAGGTTTGGA 


ACAAACCAAG 


CATTATCATT 


ATGAAGGTGG 


GATTGCTAGT 


TACGTTGAAT 


8220 


ATATCAACGA 


GAACAAGGAT 


GTAATCTTTG 


ATACACCAAT 


CTATACAGAC 


GGTGAGATGG 


8280 


ATGATATCAC 


AGTTGAGGTA 


GCCATGCAGT 


ACACAACTGG 


TTACCATGAA 


AATGTCATGA 


8340 


GTTTCGCCAA 


TAATATTCAT 


ACCCATGAAG 


GTGGAACACA 


TGAACAAGGT 


TTCCGTACAG 


8400 


CCTTGACACG 


TGTTATCAAC 


GATTATGCTC 


GTAAAAATAA 


GTTACTGAAA 


GACAATGAAG 


8460 


ATAATTTAAC 


AGGGGAAGAT 


GTTCGCGAAG 


GCTTAACTGC 


AGTTATCTCA 


GTTAAACACC 


8520 


CAAATCCACA 


GTTTGAAGGA 


CAAACCAAGA 


CCAAATTGGG 


AAATAGCGAA 


GTGGTCAAGA 


8580 


TTACCAATCG 


CCTCTTCAGT 


GAAGCTTTCT 


CCGATTTCCT 


CATGGAAAAT 


CCACAGATTG 


8640 


CCAAACGTAT 


CGTAGAAAAA 


GGAATTTTGG 


CTGCCAAGGC 


TCGTGTGGCT 


GCCAAGCGTG 


8700 


CGCGTGAAGT 


CACACGTAAA 


AAATCTGGTT 


TGGAAATTTC 


CAACCTTCCA 


GGGAAACTAG 


8760 


CAGACTGTTC 


TTCTAATAAC 


CCTGCTGAAA 


CAGAACTCTT 


CATCGTCGAA 


GGAGACTCAG 


8820 


CTGGTGGATC 


AGCCAAATCT 


GGTCGTAACC 


GTGAGTTTCA 


GGCTATCCTT 


CCAATTCGCG 


8880 


GTAAGATTTT 


GAACGTTGAA 


AAAGCAAGTA 


TGGATAAGAT 


TCTAGCCAAC 


GAAGAAATTC 


8940 


GTAGTCTTTT 


CACAGCCATG 


GGAACAGGAT 


TTGGCGCAGA 


ATTTGATGTT 


TCGAAAGCCC 


9000 


GTTACCAAAA 


ACTCGTTTTG 


ATGACCGATG 


CCGATGTCGA 


TGGAGCCCAC 




7UOU 


TTCTTTTAAC 


CTTGATTTAT 


CGTTATATGA AACCAATCCT 


AGAAGCTGGT 


TATGTTTATA 


9120 


TTGCCCAACC 


ACCAATCTAT 


GGTGTCAAGG 


TTGGAAGCGA 


GATTAAAGAA 


TATATCCAGC 


9180 


CGGGTGCAGA 


TCAAGAAATC 


AAACTCCAAG 


AAGCTTTAGC 


CCGTTATAGT 


GAAGGTCGTA 


9240 


CCAAACCGAC 


TATTCAGCGT 


TATAAGGGGC 


TAGGTGAAAT 


GGACGATCAT 


CAGCTGTGGG 


9300 


AAACAACCAT 


GGATCCCGAA 


CATCGCTTGA TGGCTAGAGT TTCTGTAGAT 


GATGTGCAGA 


9360 


AGCAGATAAA ATCTTTGATA 


TGTTGATGGG 


GATCGAGTTG TCCTCGTCG 




9409 


(2) INFORMATION FOR SEQ ID NO: 162: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6415 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 
ID> TOPOLOGY: linear 





SEQUENCE DESCRIPTION: 


cert Tt\ \tr\ » 


1 C *1 - 

lo2 : 






v»\- 1 VAMnnnu 


TCTTCAAAAT 


TATGATAGAA 


I\>U I\^GAAGG 


ft ft ft ■ ft Mtott^^ * 

aaaaattcag 


gagagtagta 


60 


HTnAf^TPA ft ft 


ATGTTGAAAG 


TCTTCTCGTA 


TCCATTGTAA 


ffft^^ ft ^4ftt>^4^ft Mb & 

tcagtgcata 


CAATGAAGAA 


120 


ft A ATATrTYtr* 


CTGGTCTAAT 


TGAAGACTTA 


fc ■ ft « ft Jlft^t ft ft ft 

AAAAATCAAA 


cctatcctaa 


AGAGGATATT 


180 


Vj AAA I I \_ 1 AT 


TTATAAATGC 


TATGTCCACA 


GATGGGACCA 


cagctatcat 


TCAGCAATTT 


240 


A 1 nAAuuAAu 


ATACAGAGTT 


TAACTCAATT 


ft ft Jft<*| JhjBt & «k fc 

AGATTGTATA 


acaatcctaa 


GAAAAATCAA 


300 




TTAACCTGGG 


AGTTAAACAT 


TCTGTAGGGG 


accttatttt 


AAAAATTGAT 


360 


trt. 1 UMTLAA 


AAGTTACTGA 


GACTTTTGTA 


ATGAACAATG 


tggctattat 


TCAACAAGGT 


420 


IjAATTTGTCT 


GTGGGGGGCC 


TAGACCGACG 


ATTGTCGAAG 


GAAAAGGAAA 


ATGGGCAGAG 


480 


ALL i IajCATC 


TTGTTGAGGA 


AAATATGTTT 


GGCAGTAGCA 


ttgccaatta 


TCGAAATAGT 


540 


TCTGAGGATA 


GATATGTTTC 


TTCTATTTTT 


CATGGAATGT 


ataaacgaga 


GGTTTTCCAG 


600 


AAGGTTGGTT 


TAGTAAATGA 


GCAACTTGGC 


CGAACTGAAG 


ataatgatat 


TCATTATAGA 


660 


ATTCGAGAAT 


ATGGTTATAA AATCCGCTAT 


AGCCCAAGTA 


ttctatctta 


TCAGTATATT 


720 


CGACCAACAT 


TCAAGAAAAT 


GCTGCATCAA 


AAGTATTCAA 


atggtttgtg 


GATTGGCTTG 


780 


ACAAGTCATG 


TTCAGTTTAA 


GTGTTTATCA 


TTATtTCACT 


atgttccttg 


TTTATTTGTT 


840 


1 1 wvjTV- 1 Tu 


TGTTTAGTCT 


AGCATTGTTA 


CCGATCACAT 


tcgtattcat 


AACTTTACTA 


900 


I rAWSTGCCT 


ATTTTCTACT 


TTTGTCATTA 


ctcactttgc 


tgactttatt 


AAAACATAAA 


960 


AATGGATTTC 


TAATTGTGAT 


GCCCTTTATT 


ttattttcca 


ttcactttcc 


TTATGGCCTT 


1020 


GGGACGATTG 


TAGGTTTAAT 


TAGAGGATTT 


aaatggaaga 


aggagtacaa 


GAGAACAATA 


1080 


ATTTATTTGG 


ATAAAATAAG 


CCAAATAAAT 


caaaatatgc 


tataataaca 


ATATAGTAAA 


1140 


ACTCTTTTAA 


GGAGGAGTAG 


ATTTCTATGA 


ataaaaaact 


aacagattat 


GTGATTGATC 


1200 


TGGTGGAAAT 


TTTAAATAAA 


CAACAAAAGC 


aggttttctg 


gggaatattt 


GATATTTTCA 


1260 


GTATGGTGGT 


TTCCATCATT 


GTATCTTATA 


ttttatttta 


tgggctgatt 


AATCCAGCAC 


1320 


CTGTTGACTA 


CATTATCTAT 


ACGAGTTTGG 


ccttcctgtt 


CTATCAATTG 


ATGATtGGTT 


1380 


TTTGGGGGTT 


GAACGCGAGC 


ATTAGTCGTT 


acagcaagat 


tacggatttc 


ATGAAAATCT 


1440 


TTTTTGGTGT 


GACTGCTAGC 


AGTGTCTTGT 


catatagtat 


ctgttatgcc 


TTCTTGCCAC 


1500 


TCTTCTCCAT 


CCGTTTCATC 


ATTCTCTTTA 


tcttgttgag 


taccttcttg 


ATTTTATTGC 


1S60 
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CACGGATTAC TTGGCAGTTA ATCTACTCCA GACGCAAAAA AGGTAGTGGT GATGGAGAAC 
ACCGTCGGAC CTTCTTGATT GGTGCCGGTG ATGGTGGGCC TCTTTTTATG GATAGTTACC 
AACATCCAAC CACTGAATTA GAACTGCTCG CTATTTTGGA TAAGGATTCT AAGAAAAAGG 
GTCAAAAACT TGGTGGTATT CCTGTTTTGG GCTCTTATGA CAATCTGCCT GAATTACCCA 



AACGCCATCA 
AGCGTATCTT 
AAACTGTTGT 
ACCTTTTGGG 
GTAAGACCAT 
TTAGTCGCTT 



AATCGAGCGT GTCATCGTTG CGATTCCGTC GCTGGATCCC TCAGAATATG 
GCAGATGTGT AATAAGCTGG GTGTCAAATG TTACAAGATG CCTAAGGTTC 
TCAGGGCCTT CACCAAGCAG GTACTGGCTT CCAAAAAATT GATATTACGG 
TCGTCAGGAA ATCCGTCTTG ACGAATCGCG TCTGGGTGCA GAACTGACAG 
CTTAGTCACA GGAGCTGGAG GTTCAATCGG TTCTGAAATC TGTCGTCAAG 
CAATCCTGAA CGCATTGTCT TGCTCGGTCA TGGGGAAAAC TCAATCTACC 



TTGTTTATCA TGAATTGATT CGTAAGTTCC AAGGGATTGA TTATGTACCT GTGATTGCGG 
ACATTCAAGA CTATGATCGT TTGTTGCAAG TCTTTGAGCA GTACAAACCT GCTATTGTTT 
ATCATGCGGC AGCCCACAAG CATGTTCCTA TGATGGAGCG CAATCCAAAA GAAGCCTTCA 
AAAACAATAT CCGTGGAACT TACAATGTTG CTAAGGCTGT TGATGAAGCT AAAGTGTCTA 
AGATGGTTAT GATTTCGACA GATAAGGCAG TCAATCCACC AAATGTTATG GGAGCAACCA 
AGCGCGTGGC GGAGTTGATT GTCACTGGCT TTAACCAACG TAGCCAATCA ACCTACTGTG 
CAGTTCGTTT TGGGAATGTT CTTGGTAGCC GTGGTAGTGT CATTCCAGTC TTTGAACCTC 
AGATTGCTCA AGGTGGGCCT GTAACGGTGA CAGACTTCCG TATGACCCGT TACTTTATGA 
CCATTCCAGA AGCTAGCCGT CTGGTTATCC ATGCTGGTGC TTATGCCAAA GATGGGGAAG 
TCTTTATCCT TGATATGGGC AAACCAGTCA AGATTTATGA CTTGGCCAAG AAGATGGTGC 
TTCTAAGTGG CCACACTGAA AGTGAAATTC CAATCGTTGA AGTTGGAATC CGCCCAGGTG 
AAAAACTCTA CGAAGAACTC TTGGTATCAA CCGAACTCGT TGATAATCAA GTTATGGATA 
AGATTTTCGT TGGTAAGGTT AATGTCATGC CTTTAGAATC CATCAATCAA AAGATTGGAG 
AGTTCCGCAC TCTCAGTGGA GATGAGTTGA AGCAAGCTAT TATCGCCTTT GCTAATCAAA 
CAACCCACAT TGAATAAAAA AGAAAAACGC ATAGTATCAA GTTACACAAC CTTGGTAATA 
TGCGTTTTAT TATGTAGAGA CTTATACTCT TCGAAAATCT CTTCAAACCA CGTCAACGTC 
GCCTTGCCGT ATATGGTTAC TGACTtCGTC ACTTCTATCC ACAACCTCAA AACAGTGTTT 
TGAGytGACT TCGTCAGTTC TATCCACAAC CTCAAAACAG TGTTTTGAGc TGACtTCGTC 
AGTTCTATCC ACAACCTCAA AACAGTGTTT TGAGCTGAcT TCGTCAGTTC CATCCACAAC 
CTTAAAACAG TGTTTTGAGy TGACnTTCGT CAGTTCCATC TACAACCTTA AAACAGTGTT 



1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

.3000 

3060 

3120 

3180 

3240 

3300 

3360 
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TTGAGCTGCC 




i l c 1 nu III 


M. 1L1 1 fuAl 


11 1 \_ATTGAG 


TATTACTTCA 


3420 


TTTTCTTCTG 


AAATGfSAATT 


1 f?*PT AOfT* A fi*P 
ul 1 ALLLAu t 


LIA ITjL 1 I 


/*• ft A Ii kin> oo o 

GaaAATACGC 


^* ft ft ft ft ^^m#H^VK 

CAAAACTTCT 


3480 


AAGGGTTTGT 


G AfSfVSA T A T A 


Al LAuvl 1 VjA 


, TAGTTTAGTA 


GATCTGCTTG 


CTCTCCAAAT 


3540 


CCCCAAGTCA 


iuw.UU\ 111 


Cv|v * ftf1**^*f*n* 


i»i~i"i"\."i"CGAG 


^*W^F^i^ft^ft^^ ft ^ft^ft ft JBfe 

CTCCCAGCAT 


ATCAAACTTG 


3600 


GTATCTCCGA 


TTI ATTI ATVT!/'!/* 1 
1 un lun 


I 1 V» 1 I\. I\A»J 


OtTT AL7TTGAT 


GTGTCTGCAA 


GGCTTGGTGA 


3660 


ATGACATCTG 


1 X A 1 LtVAAv 


1 vrL. 1 1 A_AJLfOv> 


CTAGAACCAT 


AAATGCCATC 


AAAGAAATGA 


3720 


TGGATTTCCA 


ftui 1 1 1 I lov. 


UA IVl^lTGA 


GCAGTAGaTG 


TATCCTTTGT 


CGTGGTGATG 


3780 




A i/T^VTWH 

AAfJTtAJTVliA 


TAACTCCTCA 


AG CAAGTCT A 


TAATCTGAGG 


AAAGAGTTGA 


3840 


v>Vr x j. \» A 1 nun 




CTTATAGTAA 


GAACGATATA 


TCTGCACGGC 


TTCAGAAATT 


3900 


luu IV. 1 1 


AUAGGCAGGT 


^^^m ^% ft m ft ft 

CGCAAAACTA 


CTTTCGAGAG 


GTGGTCCCAT 


AAAACCACGA 


3960 


/* 1 r\\j 1111 


CATCAGGGCT 


AGGCACCCCC 


AGCTCTn'AA 


AGGTATAGGT 


AAAGGCATTG 


4020 


traa TCfi^n a 

1 1 1>\_V»UA 


rAGAACTATC 


AACGAGGGTT 


^^^^ ft iVl^^^V ft ft ft iBfc 

CCATCCAaAT 


CGAAAAAAAT 


CGCTGTGATA 


4080 


A/V**TV*A*1V*/* 


I'l'iVlvCTAT 


TTGATAAGCT 


TATTCTCCGA 


AAATTTCTTT 


TTGGAGGCGA 


4140 




GGGTGGTAGC 


GAGTCCACCT 


TCAGCTGTTT 


CACGAAAGGC 


AGTTGGCATG 


4200 


K. 1 1\X_ 1 LLTA 


CTTCGTACAT 


GGCATCGATC 


ACTTCATCCA 


CAGGGATTTT 


AGATTCGATA 


4260 


riA.l»AA\jb 


CCATGTCTGC 


TGCGATGAAA 


GCAAAGCTAG 


CTCCCATGGC 


ATTACGTTTG 


4320 




Li rCGACCAA 


ft AAffl^A m ft rfta % 

ACCTGCAACA 


GGGTCACAGA 


TGAGGCCTAG 


CATATTTTTA 


4380 


A r* A A Art/* 


t-AATAGCTTG 


ACTGGCCTGA 


JVfe 4. ft ^ft^ftatv^ft^BUta ^ft 

TAAGGTGTTC 


CACCTGCACC 


CAGAGTCAAG 


4440 


« I* Vaj^Au^, At. 


TCATAGCAGA 


GGCTGAACCA 


ft. Jft4^ftfl^h.^ft ft .^^^^ft^^K 

ACTTCAGCTT 


GACACCCACC 


CTCAGCACCT 


4500 


ununluunuu 


CATTGTTTGC 


GATGACTAGT 


^ft^ft ft ft ft AAJt ft <4ft 

CCAAAGGCAC 


CAGCAGCAAA GAGGAAATCC 


4560 


AATTfiTTftPT 

IVi tlUl 1 UV« 1 


LL*i\A*CTGAG 


GTCTAATTTT 


MA* ft #nft ^mjm ft 

TCAATAGCAG 


CAGTGAGAAC 


GGATGGCAGA 


4620 


CAGCCAGCAC 


TTCCAGCGGT 


TGGAGTGGCA 


CAGACCAAGC 


CCATTTTGGC 


ATTGTGTTCA 


4680 


TTGACTGCGA 


TGGCATTTCG 


GGCAGCCGAG 


AGAATCGTAT 


AATCTGACAG 


AGTTTTTCCG 


4740 


TTTTCGATGT 


AGTGATCCAA 


TTTGGCAGCA 


TCPCCACCTG 


TCAGGCCACT 


ACGAGATTTA 


4800 


TTTTCATTGA 


GGCCAAGTTG 


GACAGAGGCT 


TTCATAACTT 


CCAGATTGCG 


TTCCATGAGA 


4860 


AGGAAGACTT 


CTTCACGTTC 


GCGACCGGTC 


AATTCAAACT 


CTGTTGTAAT 


CATGAGTTCT 


4920 


GCGACATTTC 


CTTGAAAGTC 


CAGATCTGCP 


TGCTCGACCA 


ATTCTTTGAT 


AGAATAAAAC 


4980 


ATGCTTCCTC 


CTATTTAAAG 


AAATTGACAT 


TGTGGAGATG 


AGGGATTTTT 


CGAATTTCTT 


5040 


CGATACCCTC 


ATCACAGTTG 


CGACTGTCAA 


CTTCGATAAT 


CATAATGGCT 


TTTTCACCAG 


5100 
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CTTTTTCACG AGTGACATTC ATCTGGGCGA TATTGATACC 


ATAGCGGGAA 


AGCGCCTCTG 


5160 


TAACAAGGGC AATCATACCT GGAATATCTT GATGAACGAT 


GATGATAGTC 


GGTGTATTCA 


5220 


TATTGAGAGA GACGGCAAAA CCATTGAGTT CGGTTACCTG 


AATATTTCCT 


CCACCGATAG 


5280 


AAATACCAGT CACGCTGATG GTCTTGTGGG CATTTTTAAC 


AGTAATTTTA 


GTGGTGTTAG 


5340 


GGTGAGGGGC ATTGCTGTCT TTCTGAATGG TCCAGACAAT 


CTTGATACCA 


CGCTTGTGGG 


5400 


CAATTTCCAG ACTATTTGGA ATTTCAGGAT CATCTGTATC 


CATTCCTAAA 


ATACCTGCAA 


5460 


CAAGGGCTAG GTCTGTTCCG TGACCACGAT AGGTCTTGGC 


AAATGAGTTA 


AAAAGTTGGA 


5520 


ATTCAACTTC TGTCGGAGTA TCATCAAAAA TGGAAGAGAC 


AATCTTCCCA 


ATACGAACAG 


5580 


CACCAGCGGT ATGGCTACTA GATGGGCCAA TCATAACTGG 


TCCGATGATA TCAAAGACAG 


5640 


ATTGAAAACG AAGTGATTTC ATCAGTTTCC CCTTATAAAA 


ATTCTTATCT 


CTATTATATC - 


5700 


AAAGAATGAG GGGCTTGGCT TTAATTGTGG ATGAAAACCT 


TTCTAATACC 


TCAAATAGCA 


5760 


TAAAAATAGT ATCTTTTATG ACAAAAAACA CCTTATTTAG 


GGAAATAAAA 


AATAATTTTG 


5820 


TAATATTTCT ACATAAAAGT GTCAAGAAAC GGTAATATTT 


AAAGGGTATG 


ATAGAACTAT 


5880 


AGAAAGAACG AGAATTTTCG AATATGAAAT CAATAACTAA 


AAAGATTAAA 


GCAACTCTTG 


5940 


CAGGAGTAGC TGCCTTGTTT GCAGTATTTG CTCCATCATT 

* 


TGTATCTGCT 


CAAGAATCAT 


6000 


CAACTTACAC TGTTAAAGAA GGTGATACAC TTTCAGAAAT 


CGCTGAAACT 


CACAACACAA 


6060 


CAGTTGAAAA ATTGGCAGAA AACAACCACA TTGATAACAT 


TCATTTGATT 


TATGTTGATC 


6120 


AAGAGTTGGT TATCGATGGC CCTGTAGCGC CTGTTGCAAC 


ACCAGCGCCA 


GCTACTTATG 


6180 


CGGCACCAGC CGCTCAAGAT GAAACTGTTT CAGCTCCAGT 


AGCAGAAACT 


CCAGTAGTAA 


$240 


GTGAAACAGT TGTTTCAACT GTAAGCGGAT CTGAAGCAGA 


AGCCAAAGAA 


TGGATCGCTC 


6300 


AAAAAGAATC AGGTGCTAGT ATACAGCTAC AAATGGACGT 


TATATCGGAC 


GTTACCAATT 


6360 


AACAGATTCA TACCTGAACG GTGACTACTC AGCTGAAAAC 


CAAGAACGGG 


TACCG 


6415 


(2) INFORMATION FOR SEQ ID NO: 163: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; B494 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
TACCCCTTTC GAATTTTGGC AAAAATTCGG TAAGGCTTTG ATGGTAGTTA TCGCGGTTAT 60 
GCCGGCTGCT GGTTTGATGA TTTCAATCGG TAAGTCTATC GTGATGATTA ACCCAACCTT 120 
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TGCACCACTT GTCATCACAG GTGGAATTCT TGAGCAAATC GGTTGGGGGG TTATCGGTAA 180 

CCTTCACATT TTGTTTGCCC TAGCCATTGG ACGAAGCTGG GCTAAAGAAC GTGCTGGTGG 240 

TGCTTTCGCC GCTGGTCTTG CCTTCATCTT GATTAACCGT ATCACTGGTA CAATCTTTGG 300 

TGTATCACCC GATATGTTGA AAAATCCAGA TGCTATGGTA ACTACTTTCT TTGGTGGTTC 360 

AATCAAAGTT GCTGATTACT TTATCAGTGT TCTTGAAGCT CCAGCCTTGA ACATGGGGGT 420 

ATTCGTAGGG ATTATCTCAG GTTTTGTAGG GGCAACTGCT TACAACAAAT ACTACAACTT 480 

CCCTAAACTT CCTGATGCAC TTTCATTCTT CAACGGGAAA CGTTTCGTAC CATTTGTAGT 540 

TATTCTTCGT TCAGCAATCG CTGCAATTCT ACTTGCTGCT TTCTGGCCAG TAGTTCAAAC 600 

AGGTATCAAT AACTTCGGTA TCTGGATTGC CAACTCACAA GAAACTGCTC CAATTCTTGC 660 

ACCATTCTTG TATGGTACTT TGGAACGTTT GCTCTTGCCA TTTGGTCTTC ACCACATGTT 720 

GACTATCCCA ATGAACTACA CAGCTCTTGG TGGTACTTAT GACATTTTAA CTGGTGCAGC 780 

TAAAGGTACT CAAGTATTCG GTCAAGACCC ACTATGGCTT GCATGCGTAA CAGACCTTGT 840 

AAACCTTAAA GGTACTGATG CTAGTCAATA TCAACACTTG TTAGATACAG TACATCCAGC 900 

TCGTTTCAAA GTTGGACAAA TGATCGGTTC ATTCGGTATC TTGATGGGTC TGATTGTTGC 960 

TATCTACCGT AATGTTGATG CTGACAAGAA ACATAAATAC AAAGGTATGA TGATTGCAAC 1020 

AGCTCTTGCA ACATTCTTGA CAGGGGTTAC TGAACCAATC GAATACATGT TCATGTTCAT 1080 

CGCAACACCT ATGTATCTTG TTTACTCACT TGTTCAAGGT GCTGCCTTCG CTATGGCTGA 1140 

CGTCGTAAAC CTACGTATGC ACTCATTCGG TTCAATCGAG TTCTTGACTC GTACACCTAT 1200 

TGCAATCAGT GCTGGTATTG GTATGGATAT CGTTAACTTC GTTTGGGTAA CTGTTCTCTT 1260 

TGCTGTAATC ATGTACTTTA TCGCAAACTT CATGATTCAA AAATTCAACT ACGCAACTCC 1320 

AGGGCGCAAC GGAAACTACG AAACTGCTGA AGGTTCAGAA GAAACCAGCA GCGAAGTGAA 1380 

AGTTGCAGCA GGCTCTCAAG CTGTAAACAT TATCAACCTT CTTGGTGGAC GTGTAAACAT 1440 

CGTTGATGTT GATGCATGTA TGACTCGTCT TCGTGTAACT GTTAAAGATG CAGATAAAGT 1500 

AGGAAATGCA GACCAATGGA AAGCAGAAGG AGCTATGGGT CTTGTCATGA AAGGACAAGG 1560 

GGTTCAAGCT ATCTACGGTC CAAAAGCTGA CATTTTGAAA TCTGATATCC AAGATATCCT 1620 

TGATTCAGGT GAAATCATTC CTGAAACTCT TCCAAGCCAA ATGACTGAAG CACAACAAAA 1680 

CACTGTTCAC TTCAAAGATC TTACTGAGGA AGTTTACTCA GTAGCAGACG GTCAAGTTGT 1740 

TGCTTTGGAA CAAGTAAAGG ATCCAGTATT TGCTCAAAAA ATGATGGGTG ATGGATTTGC 1800 

AGTAGAACCT GCAAATGGAA ACATTGTATC TCCAGTTTCA GGTACTGTGT CAAGCATCTT I860 
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CCCAACAAAA CATGCTTTTG GTATTGTGAC GGAAGCAGGT CTTGAAGTAT TGGTTCACAT 1920 

TGGTTTGGAC ACAGTAAGTC TTGAAGGTAA ACCATTTACA GTTCATCTTG CTGAAGGACA 1980 

AAAAGTTGCA GCAGGAGATC TCCTTGTCAC AGCTGACTTG GATGCTATCC GTGCAGCAGG 2040 

ACGTGAAACT TCAACAGTAG TTGTCTTCAC AAATGGTGAT GCAATTAAAT CAGTTAAGTT 2100 

AGAAAAAACA GGTTCTCTTG CAGCTAAAAC AGCAGTTGCT AAAGTAGAAT TGTAATATAC 2160 

TTGAGGTTGG AAGCTGTATT CCAACCTCTT ATTTTGGGAG AAAAGAATGA AATTTTTAAC 2220 

ACTCAATACT CACAGTTGGA TGGAGAAAGA AGCAGAGGAA AAATTCCAGA TTTTGCTTGA 2280 

AGATATTCTT GAAAAGGACT ATGATTTGAT TTGTTTTCAA GAAATCAATC AGGAGATGAC 2340 

CTCGTCAGAG GTGGAGGTTA ATCACCTTTA TCAAGCTTTG CCAGCAGCTG AGCCTATTCA 2400 

CCAAGACCAT TATGTTAGAC TCTTGGTTGA AAAGTTCTCT GAGCAAGGGA AAAATTACTA 2460 

CTGGACCTGG GCCTATAACC ATATCGGCTA TAACCGCTAC CACGAAGGTG TGGCTATCTT 2520 

GTCTAAAACA CCTATTGAAG CCAGAGAAAT TTTGGTTTCA GATGTGGATG ATCCAACAGA 2580 

CTATCATACT CGCCGTGTTG CCCTAGCTGA AACTCTACTC GATGGCAAGG AGCTAGCAGT 2640 

TGCCAGTGTT CATCTCTCTT GGTGGGATAA AGGTTTCCAA GAAGAATGGG CACGATTTGA 2700 

GGCTGTCTTG AAAAAATTGA ACAAGCCACT TTTACTAGCT GGAGATTTCA ACAATCCGGC 2760 

TGGACAGGAA GGTTACCAAG CTATTTTAGC TAGTCCATTA GGCTTACAAG ACGCATTTGA 2820 

AGTTGCTCAA GAGAAAAGTG GTAGCTATAC TGTTCCGCCT GAAATTGATG GCTGGAAAGG 2880 

GAACACTGAA CCCCTTCGAA TCGATTATGT CTTTACTACC AAAGAGTTAG CGGTGGAAAA 2940 

TTTACATGTC GTATTTGATG GTAACAAGAG TCCACAAGTG AGTGATCACT ATGGCTTGAA 3000 

TGCTATATTA AACTGGAAAT AATAACTGAA AAGAGGTTGG AACTATAAAA TTCCAGCCTT 3060 

TTCTTACTAG AGAAGCTACT GGAAATAGCC TAAATAAGTG AGACTACTGT AATGGAATAA 3120 

AATATGGTAT AATTGATAAG GTAGATAGAA TCGAGGATGT TATGTCATTT ACGAAATTTC 3180 

AATTTAAAAA CTATATTAGA GAAGCCTTGA AGGAGTTAAA ATTTACAACT CCAACAGAviG 3240 

TGCAAGACAA GTTCATTCCT ATTGTTTTGG CAGGTCGTGA CCTAGTAGGA GAATCAAAAA 3300 

CAGGTTCAGG TAAGACTCAT ACTTTCTTGT TACCGATTTT CCAGCAATTA GATGAAGCTA 3360 

GCGATAGTGT ACAAGCAGTG ATTACTGCAC CGAGTCGTGA GTTGGCTACT CAAATTTACC 3420 

AAGTAGCGCG TCAGATTTCA GCTCACTCAG ATGTCGAAGT TCGTGTGGTT AATTATGTGG 3480 

GTGGTACGGA TAAGGCTCGC CAGATTGAGA AATTGGCAAG CAATCAGCCT CATATTGTTA 3540 

TTGGAACACC AGGCCGTATC TACGACTTGG TTAAATCTGG TGATTTAGCT ATTCATAAAG 3600 

CCAAGACATT TGTTGTTGAT GAAGCAGATA TGACCTTGGA TATGGGATTC TTGGAAACTG 3660 
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TTGATAAGAT TGCTGGCAGT CTTCCAAAAG ACTTGCAATT CATGGTCTTC TCAGCGACTA 3720 

TCCCACAAAA ACTGCAACCA TTCTTGAAAA AATACTTATC AAATCCTGTT ATGGAGAAAA 3780 

TTAAGACCAA AACGGTTATT TCTGACACCA TTGATAATTG CTTCATTTCG ACCAAGGGAC 3840 

ATGATAAGAA TGCTCAAATT TACCAGTTGA CTCAGTTGAT GCAGCCGTAT TTGGCAATGA 3900 

TTTTTGTTAA CACTAAAACG CGTGCTGATG AATTGCATTC ATATCTGACT GCTCAAGGCT 3960 

TGAAGGTTGC AAAAATCCAT GGCGATATTG CCCCTCGTGA ACGCAAGCGA ATCATGAATC 4020 

AGGTGCAAAA TCTGGATTTT GAGTATATTG TCGCAACAGA TTTGGCAGCG CGTGGGATTG 4080 

ACATTGAAGG TGTCAGCCAT GTCATCAATG ATGCCATTCC GCAAGACTTA TCTTTTTTTG 4140 

TTCATCGTGT TGGTCGTACT GGACGAAATG GCCTACCAGG TACAGCTATT ACCCTTTATC 4200 

AGCCAAGTGA TGACTCGGAT ATCCGTGAGT TGGAGAAATT GGGAATCAAG TTTAGTCCTA 4260 

AGATGGTCAA AGACGGGGAA TTTCAAGATA CCTATGACCG TGATCGTCGT GCCAACCGTG 4320 

AGAAAAAACA AGATAAACTT GATATCGAAA TGATTGGTTT GGTTAAAAAG AAAAAGAAAA 4380 

AAGTCAAACC GGGTTATAAG AAGAAAATTC AATGGGCGGT TGATGAAAAG CGCCGTAAAA 4440 

CCAAGCGTGC TGAAAATCGC GCTCGCGGTC GTGCAGAGCG TAAAGCTAAA CGCCAAACAT 4500 

TTTAATAGAA ATTGTTGGAG TATTGAGCTC CAACTTTTTT ATTTATGAGA ACGAACTATC 4560 

TAAACCGAAA CACTACATTA AAGACTGCAA ATTGCGATTA AAAATGGTAT AATGATAAAG 4620 

TTATATAGTC CCGATAAGAT GGTAGGTATT TATTACGAAG AGTTTTCCTA TCAGTACTTT 4680 

GTAACTCTAT AACAATATTT TTTAAGGGGG GACATTTTTA TGTCAGAGCG TAAATTATTC 4740 

ACGTCTGAAT CTGTATCTGA GGGGCATCCG GATAAGATTG CAGACCAAAT TTCAGATGCG 4800 

ATTTTGGATG CTATTTTAGC AAAGGATCCA GAGGCGCACG TTGCTGCTGA AACAGCTGTA 4860 

TATACTGGTT CTGTCCACGT TTTTGGTGAA ATTTCTACAA ATGCCTATGT GGATATTAAC 4 920 

CGTGTGGTTC GTGATACCAT TGCAGAGATT GGTTATACCA ATACAGAATA TGGATTTTCT 4980 

GCTGAGACGG TGGGAGTACA CCCATCTTTG GTGGAACAAT CTCCTGACAT CGCTCAAGGT 5040 

GTTAACGAAG CCTTGGAGGT TCGTGGAAAT GCTGATCAAG ATCCACTGGA CTTGATTGGA 5100 

GCAGGTGACC AAGGGCTCAT GTTTGGATTT GCAGTAGATG AAACAGAAGA GCTTATGCCA 5160 

TTGCCAATTG CACTCAGTCA TAAATTGGTT CGTCGTCTGG CAGAACTTCG TAAGTCTGGA 5220 

GAAATTAGCT ATCTCCGTCC AGATGCAAAA TCACAAGTTA CAGTTGAGTA CGATGAAAAT 5280 

GACCGTCCGG TACGTGTAGA TACAGTCGTT ATTTCTACTC AGCATGATCC AGAGGCCACT 5340 

AATGAACAAA TCCATCAAGA TGTGATTGAC AAGGTCATCA AAGAAGTTAT TCCATCTTCT 5400 
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TATCTTGATG 


ATAAGACAAA 


ATTCTTTATC 


AATCCGACAG 


GTCGTTTTGT 


AATCGGTGGT 


S4G0 


CCTCAAGGGG 


ACTCAGGTTT 


GACTGGTCGT 


AAGATTATTG 


TAGATACTTA 


TGGTGGCTAC 


5520 


TCTCGTCATG 


GTGGTGGTGC 


CTTCTCTGGT 


AAAGATGCGA 


CTAAGGTGGA 


TCGTTCAGCC 


5S80 

4* 4* W V 


TCTTATGCGG 


CTCGCTATAT 


TGCCAAGAAT 


ATCGTTGCAG 


CAGACCTTGC 


TAAGAAGGCA 


5640 


GAAGTGCAGT 


TGGCCTATGC 


TATCGGTGTT 


GCGCAACCTG 


TTTCTGTTCG 


TATCGATACT 

4 • 4 4 ^4^rr4 4 4V^> m 


5700 


TTCGGTACAG 


GAACAGTAGC 


TGAAAGTCAA 


CTTGAAAAAG 


CGGCTCGTCA 


AATCTTTGAC 

* 4» 4 -4 ^4 4 4 4 V« 444 


5760 


CTTCGCCCTG 


CAGGGATTAT 


CCAAATGCTG 


GACCTCAAGC 


GTCCAATTTA 


CCGTCAAACA 


5820 


TCGGCTTACG 


GTCACATGGG 


ACGTACAGAT 


ATTGATCTTC 


CATGGGAACG 


TTTGGATAAG 

4 4 4 Wfl 4 


5880 

J V W V 


GTAGATGCTT 


TGAAAGAAGC 


AGTAAAATAA 


GATTTTAAGA 


GGGGAACGTC 


CTCTPCTTTTT 

4 V 4 4 4 4 4 A 


5940 

J j v w 


TATAGTTTTT 


AACTATACTG 


GGATACTGTT 


CTGAAAATCC 


ATTTTGCGAA 


AGTAGAGATT 


6000 

W 4F V W 


TACATGTATA 


GTAGATTGAA 


ACTAGAATAG 


TACACCTCAA 


CTTCTAAAAC 


ATTGTTAGCA 

■ • 4 4- N# 4 4 nwo 


- 6060 


ATCAATTTGA 


CTGTCCTGAT 


CGATTTCTCC 


TGTTCTTGTT 


TCATTTTACT 


AT ATTT CTTT 

rl 4 « 4 4*4 A 4 4 


6120 

U 44 V 


AAAAATGATA 


AAGGTTAAGA 


TTTCTCCTCG 


TAATAGATAA 


TCTTGGGGAT 


ATTTCAATCC 

* » 4> 4 4 44#>n ^ *v *w 


6180 

W X V w 


AAAGTTTTAT 


TCGTTATCAC 


TTGACTATTG 


CAAGGTTTTC 


TAGAGCAACA 


GAGTCATGGA 

1 Wl A \#\^^4 


6240 

W A 1 W 


ATGGACTCAT 


GGTTGAGATT 

^4 ^» • 4 nH 4 4 4 


TCTCCTTGTT 

4 4) ^P^V 4 4 4) 4 


GCTTGGACTT 

^^^b- 4 4 ™^4> 4 4 


CATTCAAAAG 




OJVV 


AAGCCTTGTT 


CAAACTTCTA 


ATACACTAGC 


TGTTTCCATA 


GCATGACTTC 


TGTACTAGAC 

4 \# • iiv 4 n^#r*w 


6360 


TTTCTTTTCC 


GAATAAATAG 


ATAGAACCAC 


AGAATCTAGT 


AAACCTAGAA 


TTAAAATTAT 


6420 

V *1 4> w 


GGTATAATAT 


TAGCAATAAA 


AGAAATCTGG 


AGGATTAGAA 


TCATGGTATC 


AACGAAAACA 


6480 


CAAATTGCTG 


GTTTTGAGTT 


TGACAATTGC 


TTGATGAATG 


CAGCAGGTGT 


GGCTTGTATG 

^4^*4j» 4 4 W 4 4^B 4 


6540 


ACGATAGAGG 


AGTTAGAAGA 


GGTCAAAAAC 


TCAGCGGCAG 


GAACCTTTGT 

4> 4 4 4 


TACTAAGACA 


6600 


GCGACCTTGG 


ACTTCCGTCA 


GGGGAATCCT 


GAGCCACGCT 


ACCAAGATGT 

* ^44 4> >VI 4 V V 4 


TCCACTTGGT 

4 44 4-^4 4 4 W 4 


6660 


TCCATCAACT 


CTATGGGCTT 


GCCAAATAAT 


GGCTTAGACT 


ATTATTTGGA 


TTATCTTTTA 


6720 


GATTTGCAGG 


AAAAAGAGTC 


GAACCGAACT 


TTCTTCTTAT 


CTCTGGTCGG 


CATGTCTCCA 


6780 


GAGGAAACCC 


ATACTATTTT 


GAAAAAAGTC 


CAAGAGAGTG 


ATTTTCGTGG 


TCTGACTGAG 


6840 


CTAAATCTTT 


CCTGTCCAAA 


TGTTCCAGGT 


AAACCTCAGA 


TTGCCTATGA 


TTTTGAGACA 


6900 


ACAGACCGGA 


TTTTGGCAGA 


AGTGTTTGCT 


TACTTCACCA 


AACCTCTTGG 


AATTAAATTG 


6960 


CCACCTTATT 


TTGATATTGT 


TCACTTTGAC 


CAAGCGGCAG 


CTATTTTCAA 


CAAATATCCG 


7020 


CTCAAGTTTG 


TCAACTGCGT 


TAACTCTATC 


GGAAACGGCC 


TCTATATAGA 


AGACGAATCT 


7080 


GTCGTTATTC 


GGCCTAAGAA 


TGGTTTTGGT 


GGAATTGGTG 


GAGAATACAT 


CAAACCGACT 


7140 


GCTTTAGCCA 


ATGTTCACGC 


CTTTTATCAA 


CGTTTAAATC 


CTCAAATCCA 


AATTATCGGA 


7200 



